Minimal difference query and view matching -> Monitor Keywords
Fresh Patents
Monitor Patents Patent Organizer File a Provisional Patent Browse Inventors Browse Industry Browse Agents Browse Locations
site info Site News  |  monitor Monitor Keywords  |  monitor archive Monitor Archive  |  organizer Organizer  |  account info Account Info  |  
08/16/07 - USPTO Class 707 |  246 views | #20070192282 | Prev - Next | About this Page  707 rss/xml feed  monitor keywords

Minimal difference query and view matching

USPTO Application #: 20070192282
Title: Minimal difference query and view matching
Abstract: The subject disclosure pertains to efficient computation of the difference between queries by exploiting commonality between them. A minimal difference query (MDQ) is generated that roughly corresponds to removal of as many joins as possible while still accurately representing the query difference. The minimal difference can be employed to further substantially the scope of view matching where a query is not wholly subsumed by a view. Additionally, the minimal difference query can be employed as an analytical tool in various contexts. (end of abstract)



Agent: Amin. Turocy & Calvin, LLP - Cleveland, OH, US
Inventors: Kaushik Shriraghav, Venkatesh Ganti, Xin Dong
USPTO Applicaton #: 20070192282 - Class: 707002000 (USPTO)

Related Patent Categories: Data Processing: Database And File Management Or Data Structures, Database Or File Accessing, Access Augmentation Or Optimizing

Minimal difference query and view matching description/claims


The Patent Description & Claims data below is from USPTO Patent Application 20070192282, Minimal difference query and view matching.

Brief Patent Description - Full Patent Description - Patent Application Claims
  monitor keywords

BACKGROUND

[0001] Data-intensive applications such as decision support and e-commerce rely on being able to integrate data from various sources. To accomplish this task, a data transformation query is created between a data source and target. A variety of issues have to be addressed in identifying such a query. The data is often inconsistent owing to factors such as data entry errors and missing information. These inconsistencies must be removed before the data can be loaded and used for analysis. This is further compounded by the presence of mismatches between the source and the target schemas that need to be reconciled. As a result, the problem of data integration is widely recognized to be a significant challenge.

[0002] The space of reasonable transformation queries between data source and target can be enormous, and it is very difficult for users to consider and even conceive of all possible options. This is aggravated by the fact that a user may not understand the source data fully. As a result, users need to try different queries iteratively until a satisfactory result is obtained. Previously published work has thus identified the need for interactive tools that help users understand the impact of a transformation query.

[0003] In such an interactive environment, it is natural to reason about the difference between queries. Even a small change to a transformation query, such as changing a join column, relaxing an equi-join to a join that exploits string similarities (also known as similarity joins), changing the thresholds for similarity comparison or adding an extra join can have substantial impact on the results of the query. It is therefore very natural to ask whether such a change produces tuples that are expected at the result but were previously absent or suppresses tuples that were erroneously generated.

[0004] Of course, the difference between queries can be computed in SQL (Structure Language Query), using the EXCEPT, EXCEPT ALL or MINUS clause. However, the performance of this approach is highly inadequate, especially when the two queries are closely related to each other. In particular, assume there are two queries Q.sub.1 and Q.sub.2. Conventionally, the difference is computed by executing both Q.sub.1 and Q.sub.2 and then determining the difference.

SUMMARY

[0005] The following presents a simplified summary in order to provide a basic understanding of some aspects of the claimed subject matter. This summary is not an extensive overview. It is not intended to identify key/critical elements or to delineate the scope of the claimed subject matter. Its sole purpose is to present some concepts in a simplified form as a prelude to the more detailed description that is presented later.

[0006] Briefly described the subject innovation pertains to systems and methods of efficiently computing the difference between queries. In accordance with one aspect of the subject innovation, a principled approach is employed based on identifying joins that can be eliminated when computing the query difference. This is characterized by defining the notion of a minimal difference query (MDQ) that roughly corresponds to minimizing the number of joins. Systems and methods are disclosed for obtaining the minimal difference query that are adapted to both set and bag semantics.

[0007] In accordance with another aspect of the subject innovation, the systems and methods can be extended to exploit known or determined metadata including but not limited to functional dependencies (e.g., key constraints) and inclusion dependencies (e.g., foreign key constraints). This metadata can be employed to improve query difference performance further by providing an opportunity to eliminate additional joins, for example based on redundant conjuncts.

[0008] According to yet another aspect of the subject innovation, the query difference system and method can be exploited to further the scope of view matching technology in the case when the results of a query are only partially covered by a view. This can be accomplished by, among other things, incorporating the aforementioned difference operations into view matching. In an interactive environment, this strategy widens the possibility of using the results of previously executed queries in answering a query. This approach is especially useful when the queries are closely related, as is often the case in an interactive environment.

[0009] Additionally or alternatively, aspects of the subject innovation can be employed in an analytical tool (e.g., client and/or server side) useful in contexts including but not limited to data cleaning, data integration and schema matching. For example, utilizing such a tool an entity or individual can reason whether a specific change to a query produces tuples that were expected but not found or suppresses tuples that were erroneously returned.

[0010] To the accomplishment of the foregoing and related ends, certain illustrative aspects of the claimed subject matter are described herein in connection with the following description and the annexed drawings. These aspects are indicative of various ways in which the subject matter may be practiced, all of which are intended to be within the scope of the claimed subject matter. Other advantages and novel features may become apparent from the following detailed description when considered in conjunction with the drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

[0011] FIG. 1 is a block diagram of a query difference system.

[0012] FIG. 2 is a block diagram of a difference component.

[0013] FIG. 3 is a block diagram of a query difference system.

[0014] FIG. 4 is a block diagram of a constraint difference component.

[0015] FIG. 5 is a block diagram of a query difference system.

[0016] FIG. 6 is a diagram of query and view portions with respect to view matching.

[0017] FIG. 7 is a block diagram of a view matching system.

[0018] FIG. 8 is a flow chart diagram of a method of generating a minimal set difference query.

[0019] FIG. 9 is a flow chart diagram of a method of generating a minimal set difference query exploiting constraints.

[0020] FIG. 10 is a flow chart diagram of a method of generating a bag difference.

[0021] FIG. 11 is a flow chart diagram of a view matching methodology.

Continue reading about Minimal difference query and view matching...
Full patent description for Minimal difference query and view matching

Brief Patent Description - Full Patent Description - Patent Application Claims

Click on the above for other options relating to this Minimal difference query and view matching patent application.
###
monitor keywords

How KEYWORD MONITOR works... a FREE service from FreshPatents
1. Sign up (takes 30 seconds). 2. Fill in the keywords to be monitored.
3. Each week you receive an email with patent applications related to your keywords.  
Start now! - Receive info on patent apps like Minimal difference query and view matching or other areas of interest.
###


Previous Patent Application:
Methods and systems for multi-pattern searching
Next Patent Application:
Optimizing aggregate processing
Industry Class:
Data processing: database and file management or data structures

###

FreshPatents.com Support
Thank you for viewing the Minimal difference query and view matching patent info.
IP-related news and info


Results in 0.10748 seconds


Other interesting Feshpatents.com categories:
Daimler Chrysler , DirecTV , Exxonmobil Chemical Company , Goodyear , Intel , Kyocera Wireless , 174
filepatents (1K)

* Protect your Inventions
* US Patent Office filing
patentexpress PATENT INFO