Minimal difference query and view matching -> Monitor Keywords
Fresh Patents
Monitor Patents Patent Organizer File a Provisional Patent Browse Inventors Browse Industry Browse Agents Browse Locations
site info Site News  |  monitor Monitor Keywords  |  monitor archive Monitor Archive  |  organizer Organizer  |  account info Account Info  |  
08/16/07 - USPTO Class 707 |  1 views | #20070192297 | Prev - Next | About this Page  707 rss/xml feed  monitor keywords

Minimal difference query and view matching

USPTO Application #: 20070192297
Title: Minimal difference query and view matching
Abstract: The subject disclosure pertains to efficient computation of the difference between queries by exploiting commonality between them. A minimal difference query (MDQ) is generated that roughly corresponds to removal of as many joins as possible while still accurately representing the query difference. The minimal difference can be employed to further substantially the scope of view matching where a query is not wholly subsumed by a view. Additionally, the minimal difference query can be employed as an analytical tool in various contexts. (end of abstract)



Agent: Amin. Turocy & Calvin, LLP - Cleveland, OH, US
Inventors: Kaushik Shriraghav, Venkatesh Ganti, Xin Dong
USPTO Applicaton #: 20070192297 - Class: 707 3 (USPTO)

Minimal difference query and view matching description/claims


The Patent Description & Claims data below is from USPTO Patent Application 20070192297, Minimal difference query and view matching.

Brief Patent Description - Full Patent Description - Patent Application Claims
  monitor keywords

CROSS REFERENCE TO RELATED APPLICATIONS

[0001]This application is a continuation of U.S. patent application Ser. No. 11/352,628, filed on Feb. 13, 2006, entitled "MINIMAL DIFFERENCE QUERY AND VIEW MATCHING", the entirety of which is incorporated herein by reference.

BACKGROUND

[0002]Data-intensive applications such as decision support and e-commerce rely on being able to integrate data from various sources. To accomplish this task, a data transformation query is created between a data source and target. A variety of issues have to be addressed in identifying such a query. The data is often inconsistent owing to factors such as data entry errors and missing information. These inconsistencies must be removed before the data can be loaded and used for analysis. This is further compounded by the presence of mismatches between the source and the target schemas that need to be reconciled. As a result, the problem of data integration is widely recognized to be a significant challenge.

[0003]The space of reasonable transformation queries between data source and target can be enormous, and it is very difficult for users to consider and even conceive of all possible options. This is aggravated by the fact that a user may not understand the source data fully. As a result, users need to try different queries iteratively until a satisfactory result is obtained. Previously published work has thus identified the need for interactive tools that help users understand the impact of a transformation query.

[0004]In such an interactive environment, it is natural to reason about the difference between queries. Even a small change to a transformation query, such as changing a join column, relaxing an equi-join to a join that exploits string similarities (also known as similarity joins), changing the thresholds for similarity comparison or adding an extra join can have substantial impact on the results of the query. It is therefore very natural to ask whether such a change produces tuples that are expected at the result but were previously absent or suppresses tuples that were erroneously generated.

[0005]Of course, the difference between queries can be computed in SQL (Structure Language Query), using the EXCEPT, EXCEPT ALL or MINUS clause. However, the performance of this approach is highly inadequate, especially when the two queries are closely related to each other. In particular, assume there are two queries Q.sub.1 and Q.sub.2. Conventionally, the difference is computed by executing both Q.sub.1 and Q.sub.2 and then determining the difference.

SUMMARY

[0006]The following presents a simplified summary in order to provide a basic understanding of some aspects of the claimed subject matter. This summary is not an extensive overview. It is not intended to identify key/critical elements or to delineate the scope of the claimed subject matter. Its sole purpose is to present some concepts in a simplified form as a prelude to the more detailed description that is presented later.

[0007]Briefly described the subject innovation pertains to systems and methods of efficiently computing the difference between queries. In accordance with one aspect of the subject innovation, a principled approach is employed based on identifying joins that can be eliminated when computing the query difference. This is characterized by defining the notion of a minimal difference query (MDQ) that roughly corresponds to minimizing the number of joins. Systems and methods are disclosed for obtaining the minimal difference query that are adapted to both set and bag semantics.

[0008]In accordance with another aspect of the subject innovation, the systems and methods can be extended to exploit known or determined metadata including but not limited to functional dependencies (e.g., key constraints) and inclusion dependencies (e.g., foreign key constraints). This metadata can be employed to improve query difference performance further by providing an opportunity to eliminate additional joins, for example based on redundant conjuncts.

[0009]According to yet another aspect of the subject innovation, the query difference system and method can be exploited to further the scope of view matching technology in the case when the results of a query are only partially covered by a view. This can be accomplished by, among other things, incorporating the aforementioned difference operations into view matching. In an interactive environment, this strategy widens the possibility of using the results of previously executed queries in answering a query. This approach is especially useful when the queries are closely related, as is often the case in an interactive environment.

[0010]Additionally or alternatively, aspects of the subject innovation can be employed in an analytical tool (e.g., client and/or server side) useful in contexts including but not limited to data cleaning, data integration and schema matching. For example, utilizing such a tool an entity or individual can reason whether a specific change to a query produces tuples that were expected but not found or suppresses tuples that were erroneously returned.

[0011]To the accomplishment of the foregoing and related ends, certain illustrative aspects of the claimed subject matter are described herein in connection with the following description and the annexed drawings. These aspects are indicative of various ways in which the subject matter may be practiced, all of which are intended to be within the scope of the claimed subject matter. Other advantages and novel features may become apparent from the following detailed description when considered in conjunction with the drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

[0012]FIG. 1 is a block diagram of a query difference system.

[0013]FIG. 2 is a block diagram of a difference component.

[0014]FIG. 3 is a block diagram of a query difference system.

[0015]FIG. 4 is a block diagram of a constraint difference component.

[0016]FIG. 5 is a block diagram of a query difference system.

[0017]FIG. 6 is a diagram of query and view portions with respect to view matching.

[0018]FIG. 7 is a block diagram of a view matching system.

[0019]FIG. 8 is a flow chart diagram of a method of generating a minimal set difference query.

[0020]FIG. 9 is a flow chart diagram of a method of generating a minimal set difference query exploiting constraints.

Continue reading about Minimal difference query and view matching...
Full patent description for Minimal difference query and view matching

Brief Patent Description - Full Patent Description - Patent Application Claims

Click on the above for other options relating to this Minimal difference query and view matching patent application.

Patent Applications in related categories:

20090281998 - Deletion in data file forwarding framework - Methods and apparatus, including computer program products, for deletion in data file forwarding framework. A framework includes a network of interconnected computer system nodes in which data files are continuously forwarded from computer memory to computer memory without storing on any physical storage device in the network, a central server ...

20090281998 - Deletion in data file forwarding framework - Methods and apparatus, including computer program products, for deletion in data file forwarding framework. A framework includes a network of interconnected computer system nodes in which data files are continuously forwarded from computer memory to computer memory without storing on any physical storage device in the network, a central server ...

20090282007 - Electronic dictionary apparatus and recording medium on which search control program is recorded - When an additional search key of an input unit is operated by the user, if a current display on a display screen of a display unit is a list display of respective example sentences, a user input for refining example sentences is received, and example sentences are refined in accordance ...

20090282007 - Electronic dictionary apparatus and recording medium on which search control program is recorded - When an additional search key of an input unit is operated by the user, if a current display on a display screen of a display unit is a list display of respective example sentences, a user input for refining example sentences is received, and example sentences are refined in accordance ...

20090282003 - File management apparatus, file management method, and computer program product - A file management apparatus displays place images each of which is a unit for managing files on a display unit in a matrix manner. Files and thumbnail images are stored in association with one of the place images. When a place image is selected, thumbnail images associated with the selected ...

20090282003 - File management apparatus, file management method, and computer program product - A file management apparatus displays place images each of which is a unit for managing files on a display unit in a matrix manner. Files and thumbnail images are stored in association with one of the place images. When a place image is selected, thumbnail images associated with the selected ...

20090281996 - Generating a service-oriented architecture policy based on a context model - A solution for generating a Service-Oriented Architecture (SOA) policy based on a context model is provided, which generates an application scope of the SOA policy; generates a context model; generates an action list for the context model based on action semantic modules customized by a user; generates a condition part ...

20090281996 - Generating a service-oriented architecture policy based on a context model - A solution for generating a Service-Oriented Architecture (SOA) policy based on a context model is provided, which generates an application scope of the SOA policy; generates a context model; generates an action list for the context model based on action semantic modules customized by a user; generates a condition part ...

20090282001 - Information providing apparatus, portable information terminal, content processing device, device control apparatus, content processing system and program - A content list including contents processable by a designated processing device, among the contents to be processed including a specific content managed with respect to each user, is received by a portable terminal from an ECG server, and a content is selected by a user. Then, a processing request for ...

20090282001 - Information providing apparatus, portable information terminal, content processing device, device control apparatus, content processing system and program - A content list including contents processable by a designated processing device, among the contents to be processed including a specific content managed with respect to each user, is received by a portable terminal from an ECG server, and a content is selected by a user. Then, a processing request for ...

20090281994 - Interactive search result system, and method therefor - A system and method to provide interactive search results, wherein interactive icons allow a user to customize displayed search results, thereby providing the user with a robust, tailored environment for providing relevant search results. ...

20090281994 - Interactive search result system, and method therefor - A system and method to provide interactive search results, wherein interactive icons allow a user to customize displayed search results, thereby providing the user with a robust, tailored environment for providing relevant search results. ...

20090281997 - Method and a system for searching information using information device - A method and system for searching information using information device is provided. The method for searching information comprises generating one or more questions corresponding to a search query. The questions are generated based on a predetermined ordering criterion when a predefined condition is true. One or more responses to the ...

20090281997 - Method and a system for searching information using information device - A method and system for searching information using information device is provided. The method for searching information comprises generating one or more questions corresponding to a search query. The questions are generated based on a predetermined ordering criterion when a predefined condition is true. One or more responses to the ...

20090282004 - Method for analyzing, labeling and certifying low radiocarbon food products - Methods, particularly computer-implemented methods, are provided for analyzing, labeling, reporting, and certifying the radiocarbon abundance levels of low radiocarbon food products, including relevant chemical components of final products as well as components of lots used in manufacturing, so that manufacturers, consumers or other users of these products can have high ...

20090282004 - Method for analyzing, labeling and certifying low radiocarbon food products - Methods, particularly computer-implemented methods, are provided for analyzing, labeling, reporting, and certifying the radiocarbon abundance levels of low radiocarbon food products, including relevant chemical components of final products as well as components of lots used in manufacturing, so that manufacturers, consumers or other users of these products can have high ...

20090282002 - Methods and systems for integrating data from social networks - Methods and systems for integrating data from social networks. The methods and systems may gather data from social networking sites and sites with user review and advocacy data, and merge the data to create universal profiles for Internet users. The methods and systems may also integrate social networking data with ...

20090282002 - Methods and systems for integrating data from social networks - Methods and systems for integrating data from social networks. The methods and systems may gather data from social networking sites and sites with user review and advocacy data, and merge the data to create universal profiles for Internet users. The methods and systems may also integrate social networking data with ...

20090281989 - Micro-bucket testing for page optimization - Methods for optimizing webpage content by micro-bucket testing user customization to the webpage include presenting a plurality of modules at a webpage based on a request from a user. The modules define an intent of the webpage. A change defining customization to one or more modules within the webpage is ...

20090281989 - Micro-bucket testing for page optimization - Methods for optimizing webpage content by micro-bucket testing user customization to the webpage include presenting a plurality of modules at a webpage based on a request from a user. The modules define an intent of the webpage. A change defining customization to one or more modules within the webpage is ...

20090281992 - Optimizing database queries - Embodiments of the invention provide techniques for optimizing database queries for energy efficiency. In general, a query optimizer is configured to compare energy requirements of query plans, and to select a query plan requiring minimal energy to execute. In one embodiment, the query optimizer may also compare time performance of ...

20090281992 - Optimizing database queries - Embodiments of the invention provide techniques for optimizing database queries for energy efficiency. In general, a query optimizer is configured to compare energy requirements of query plans, and to select a query plan requiring minimal energy to execute. In one embodiment, the query optimizer may also compare time performance of ...

20090281991 - Providing search results for mobile computing devices - Technologies are described herein for providing search results for mobile computing devices. Search results are retrieved based on a search term. Each of the search results is assigned to one or more categories. The categories and the assigned search results are provided to the mobile computing device. The mobile computing ...

20090281991 - Providing search results for mobile computing devices - Technologies are described herein for providing search results for mobile computing devices. Search results are retrieved based on a search term. Each of the search results is assigned to one or more categories. The categories and the assigned search results are provided to the mobile computing device. The mobile computing ...

20090282005 - Sip network-based content sharing method and system - A content sharing method and system are provided such that a first client stores a content in one of its local storage and a content server and a second client requests from an information server one of a query and subscription of the content and downloads the content from the ...

20090282005 - Sip network-based content sharing method and system - A content sharing method and system are provided such that a first client stores a content in one of its local storage and a content server and a second client requests from an information server one of a query and subscription of the content and downloads the content from the ...

20090281999 - Symbolic program analysis using term rewriting and generalization - Systems and methods are disclosed for analyzing data-flow using program expressions or terms by extracting a control flow graph node from a work list; merging symbolic term values at join nodes; performing simplification of term values using rewriting logic; determining successors of the graph node to which data must be ...

20090281999 - Symbolic program analysis using term rewriting and generalization - Systems and methods are disclosed for analyzing data-flow using program expressions or terms by extracting a control flow graph node from a work list; merging symbolic term values at join nodes; performing simplification of term values using rewriting logic; determining successors of the graph node to which data must be ...

20090281993 - System and method for data retrieval - In accordance with one or more embodiments, a system for facilitating transfer of data and information over a network includes a database component, a communication component adapted to communicate with a user via a portable communication device over the network, and a processing component adapted to receive a request for ...

20090281993 - System and method for data retrieval - In accordance with one or more embodiments, a system for facilitating transfer of data and information over a network includes a database component, a communication component adapted to communicate with a user via a portable communication device over the network, and a processing component adapted to receive a request for ...

20090281990 - System and method for dissemination of relevant knowledge - A method comprises extracting one or more document identifiers from a document currently displayed on the user's computer screen. The method further comprises searching an enterprise network, using the one or more document identifiers, for one or more related documents that are related to the document currently displayed on the ...

20090281990 - System and method for dissemination of relevant knowledge - A method comprises extracting one or more document identifiers from a document currently displayed on the user's computer screen. The method further comprises searching an enterprise network, using the one or more document identifiers, for one or more related documents that are related to the document currently displayed on the ...

20090281995 - System and method for enhanced direction of automated content identification in a distributed environment - The successful development of digital production content within a development organization requires that the intellectual property rights of the multiple elements of digital content introduced into the digital production content. According to the invention there is provided a means of automatically identifying the source, licenses and other associated intellectual property ...

20090281995 - System and method for enhanced direction of automated content identification in a distributed environment - The successful development of digital production content within a development organization requires that the intellectual property rights of the multiple elements of digital content introduced into the digital production content. According to the invention there is provided a means of automatically identifying the source, licenses and other associated intellectual property ...

20090281988 - Systems and methods to provide search based on social graphs and affinity groups - Methods, machine-readable media, apparatuses and systems are provided to identify and/or present information based on relationship-based recommendations. The information may be search results; and the relationship-based recommendations may be recommendations or preferences specified by related people in one or more social networks or affinity groups. For example, the search results ...

20090281988 - Systems and methods to provide search based on social graphs and affinity groups - Methods, machine-readable media, apparatuses and systems are provided to identify and/or present information based on relationship-based recommendations. The information may be search results; and the relationship-based recommendations may be recommendations or preferences specified by related people in one or more social networks or affinity groups. For example, the search results ...

20090282006 - Transaction management - A transaction management system facilitates the storage and management of documents associated with transactions. The system facilitates the review of stored transactions and their associated documents. The system also provides searching capabilities to quickly identify transactions that match a search query. Transaction models can be structured to define how data ...

20090282006 - Transaction management - A transaction management system facilitates the storage and management of documents associated with transactions. The system facilitates the review of stored transactions and their associated documents. The system also provides searching capabilities to quickly identify transactions that match a search query. Transaction models can be structured to define how data ...

20090282000 - Web browser accessible search engine with statistics gathering persistence - A search engine server interacts with a client devices and web servers to enhance the quality of search service based on statistics that are gathered by the system. The search engine server monitors the quality of service and data format (i.e. encoding needed for secured delivery) of search results or ...

20090282000 - Web browser accessible search engine with statistics gathering persistence - A search engine server interacts with a client devices and web servers to enhance the quality of search service based on statistics that are gathered by the system. The search engine server monitors the quality of service and data format (i.e. encoding needed for secured delivery) of search results or ...


###
monitor keywords

How KEYWORD MONITOR works... a FREE service from FreshPatents
1. Sign up (takes 30 seconds). 2. Fill in the keywords to be monitored.
3. Each week you receive an email with patent applications related to your keywords.  
Start now! - Receive info on patent apps like Minimal difference query and view matching or other areas of interest.
###


Previous Patent Application:
Method and system for determining relevant sources, querying and merging results from multiple content sources
Next Patent Application:
Systems and methods for indexing and searching data records based on distance metrics
Industry Class:
Data processing: database and file management or data structures

###

FreshPatents.com Support
Thank you for viewing the Minimal difference query and view matching patent info.
IP-related news and info


Results in 0.13691 seconds


Other interesting Feshpatents.com categories:
Daimler Chrysler , DirecTV , Exxonmobil Chemical Company , Goodyear , Intel , Kyocera Wireless , 174
filepatents (1K)

* Protect your Inventions
* US Patent Office filing
patentexpress PATENT INFO