Report Number: CS-TN-98-74
Institution: Stanford University, Department of Computer Science
Title: Predicate Rewriting for Translating Boolean Queries in a
Heterogeneous Information System
Author: Chang, Chen-Chuan K.
Author: Garcia-Molina, Hector
Author: Paepcke, Andreas
Date: May 1998
Abstract: Searching over heterogeneous information sources is difficult
in part because of the non- uniform query languages. Our
approach is to allow users to compose Boolean queries in one
rich front-end language. For each user query and target
source, we transform the user query into a subsuming query
that can be supported by the source but that may return extra
documents. The results are then processed by a filter query
to yield the correct final results. In this paper we
introduce the architecture and associated mechanism for query
translation. In particular, we discuss techniques for
rewriting predicates in Boolean queries into native subsuming
forms, which is a basis of translating complex queries. In
addition, we present experimental results for evaluating the
cost of post-filtering. We also discuss the drawbacks of this
approach and cases when it may not be effective. We have
implemented prototype versions of these mechanisms and
demonstrated them on heterogeneous Boolean systems.
http://i.stanford.edu/pub/cstr/reports/cs/tn/98/74/CS-TN-98-74.pdf