Report Number: CS-TN-98-74
Institution: Stanford University, Department of Computer Science
Title: Predicate Rewriting for Translating Boolean Queries in a Heterogeneous Information System
Author: Chang, Chen-Chuan K.
Author: Garcia-Molina, Hector
Author: Paepcke, Andreas
Date: May 1998
Abstract: Searching over heterogeneous information sources is difficult in part because of the non- uniform query languages. Our approach is to allow users to compose Boolean queries in one rich front-end language. For each user query and target source, we transform the user query into a subsuming query that can be supported by the source but that may return extra documents. The results are then processed by a filter query to yield the correct final results. In this paper we introduce the architecture and associated mechanism for query translation. In particular, we discuss techniques for rewriting predicates in Boolean queries into native subsuming forms, which is a basis of translating complex queries. In addition, we present experimental results for evaluating the cost of post-filtering. We also discuss the drawbacks of this approach and cases when it may not be effective. We have implemented prototype versions of these mechanisms and demonstrated them on heterogeneous Boolean systems.
http://i.stanford.edu/pub/cstr/reports/cs/tn/98/74/CS-TN-98-74.pdf