Document processing system, method and program -> Monitor Keywords
Fresh Patents
Monitor Patents Patent Organizer File a Provisional Patent Browse Inventors Browse Industry Browse Agents Browse Locations
site info Site News  |  monitor Monitor Keywords  |  monitor archive Monitor Archive  |  organizer Organizer  |  account info Account Info  |  
04/03/08 - USPTO Class 707 |  76 views | #20080082570 | Prev - Next | About this Page  707 rss/xml feed  monitor keywords

Document processing system, method and program

USPTO Application #: 20080082570
Title: Document processing system, method and program
Abstract: To perform processing at a high speed to identify some of the structure type definitions in a given definition set to which a received XML document matches. There are provided a decision rule generation section which generates a decision rule for determining to which one of a plurality of DTDs an XML document matches on the basis of unique elements of the DTDs, and a DTD determination section which determines to which one of the plurality of DTDs the XML document to be processed matches on the basis of the decision rule generated by the decision rule generation section. (end of abstract)



Agent: Louis Paul Herzberg - Monsey, NY, US
Inventors: Naohiko Uramoto, Hiroshi Maruyama, Kento Tamura
USPTO Applicaton #: 20080082570 - Class: 707102000 (USPTO)

Related Patent Categories: Data Processing: Database And File Management Or Data Structures, Database Schema Or Data Structure, Generating Database Or Data Structure (e.g., Via User Interface)

Document processing system, method and program description/claims


The Patent Description & Claims data below is from USPTO Patent Application 20080082570, Document processing system, method and program.

Brief Patent Description - Full Patent Description - Patent Application Claims
  monitor keywords

FIELD OF THE INVENTION

[0001] The present invention is directed to processing a document described in a structured language such as the extensible markup language (XML). More particularly it is directed to determining which of a plurality of prepared structure type definitions a document to be processed matches.

BACKGROUND ART

[0002] In the field of information communication through networks, systems in which data, etc., are exchanged by using XML formats are becoming prevalent. Ordinarily, not a single XML document format but a plurality of XML document formats exist as the format of XML documents exchanged as business data, etc., between companies. For example, with respect to a leasing contract system, formats for a quotation request, a contract document, a request item inquiry, etc., are conceivable.

[0003] In XML, a format is designated by using a DTD. More specifically, DTD files such as a quotation DTD and a contract DTD are respectively prepared for formats. Although it is technically possible to write a plurality of formats by one DTD but in actuality different DTDs are ordinarily used for a plurality of formats.

[0004] In some case, there is a need to recognize the format of an XML document by identifying a DTD to which the XML document matches. For example, in a case where processing according to the contents of a transferred XML document is performed on the XML document, a key for an electronic signature is used according to the format. In such a case, if a DTD has been explicitly designated for the XML document to be processed, it is apparent that the DTD to which the XML document matches can be identified. However, if no explicit designation has been made, processing for actually verifying the XML document with respect to DTDs is required.

[0005] In some case, no DTD is designated for XML document. For example, no DTD can be designated in a situation described below. In recent years, system construction through Web services has been spotlighted and such Web services use a simple object access protocol (SOAP) for exchange of XML documents. The SOAP specification functions as an envelop containing an XML document. That is, an XML document is transferred by being wrapped in a SOAP envelop in accordance with a SOAP. A SOAP uses not a DTD but an XML schema as a structure type definition (schema). Therefore, even if an XML document contained in a SOAP envelop matches a predetermined DTD, it is not possible to designate the DTD for the XML document by DOCTYPE declaration. Also in the case of an XML document with an electronic signature, a DTD cannot be designated for the XML document due to the same mechanism.

[0006] As described above, if there is a need to identify a DTD for an XML document, and if no DTD is explicitly designated for the XML document to be processed, processing for actually verifying the XML document with respect to DTDs is required to identify a DTD for the XML document.

[0007] In the case of an XML document processed in accordance with a SOAP or by processing with an electronic signature, a DTD for this XML document cannot be designated by DOCTYPE declaration. Therefore, processing for actually verifying the XML document with respect to DTDs is also required to identify a DTD for the XML document. In this case, to enable identification of the DTD for the XML document in a processed state, the DTD for the XML document may be described in a SOAP header or the like. However, there is no standard method for designating a DTD.

[0008] As described, it is necessary in some case to perform processing for verification with respect to DTDs for recognition of the format of an XML document to be processed. In a case where an XML document to be processed has a multiplicity of formats, however, a process for verifying the XML document with respect to all DTDs is troublesome and time-consuming. The same problem is also encountered with a structure type definition (e.g., the above-mentioned XML schema) other than DTDs.

SUMMARY OF THE INVENTION

[0009] Therefore an aspect of the present invention is, assuming that a set of a structure type definitions are given, to enable processing for ascertaining to which one of the structure type definitions a received XML document matches be performed at a high speed.

[0010] Another aspect of the present invention is to realize a system for performing such processing with no influence on existing system configurations and implementations of XML processors.

[0011] In an example embodiment to achieve these aspects, the present invention is realized as a document processing system constructed as described below. The document processing system has a decision rule generation section for generating a decision rule for determining whether a document described in a structured language such as XML matches at least one of a plurality of structure type definitions (e.g., DTDs) on the basis of unique elements of the structure type definitions, and a determination section for determining whether the document to be processed matches at least one of the structure type definitions on the basis of the decision rule generated by the decision rule generation section.

[0012] Another example embodiment of a document processing system in accordance with the present invention has an input section for inputting an XML document to be processed, an XML parser for analyzing the XML document input by the input section, and a DTD (Document Type Definition) determination section for determining at least one of a group of DTDs to which the XML document matches by determination as to whether or not an element unique to the DTD is included in the XML document on the basis of the result of analysis by the XML parser.

[0013] Still another document processing system in accordance with the present invention has a nondeterministic finite state automaton generation section for generating a nondeterministic finite state automaton corresponding to a group of structure type definitions (e.g., DTDs) to at least one of which a document described in a structured language such as XML should match, a deterministic finite state automaton generation section by converting the nondeterministic finite state automaton generated by the nondeterministic finite state automaton generation section to a deterministic finite state automaton, and a determination section for determining on the basis of the deterministic finite state automaton generated by the determinative automaton generation section whether the document to be processed matches at least one of the group of structural type definitions.

[0014] The present invention is also realized as a document processing method for processing a document described in a structured language such as XML by using a computer. That is, this document processing method includes the step of generating a decision rule for determining whether a document to be processed matches at least one of a plurality of structure type definitions (e.g., DTDs) on the basis of unique elements of the structure type definitions, and the step of determining whether the document to be processed matches at least one of the structure type definitions on the basis of the generated decision rule.

[0015] Another document processing method in accordance with the present invention includes the step of inputting and analyzing the XML document and examining whether or not an element unique to any one of a group of DTDs (Document Type Definitions) to at least one of which the XML document to be processed should match is included in the XML document based on the result of the analysis, and identifying the DTD to which the XML document matches.

[0016] Still another document processing method in accordance with the present invention includes a first group of steps of reading out from a memory a group of structure type definitions to at least one of which the document should match and generating a first nondeterministic finite state automaton corresponding to each of structure type definitions included in the group of structural type definitions, a second group of steps of combining the first nondeterministic finite state automatons into a second nondeterministic finite state automaton corresponding to the whole of the group of structural type definitions, a third group of steps of converting the second nondeterministic finite state automaton into a deterministic finite state automaton, and a fourth group of steps of determining whether the document to be processed matches at least one of the group of structural type definitions on the basis of the deterministic finite state automaton.

[0017] Also, the present invention may be realized as a program for realizing the above-described document processing system by using a computer or making a computer execute processes corresponding to the steps in the document processing method. This program may be provided by being distributed for example in a state of being stored in a magnetic disk, an optical disk, a semiconductor memory or any other recording medium or distributed over a network.

BRIEF DESCRIPTION OF THE DRAWINGS

[0018] FIG. 1 is a diagram schematically showing an example of a hardware configuration of a computer suitable for implementation of an XML document determination system in accordance with a first embodiment of the present invention;

[0019] FIG. 2 is a block diagram for explaining the configuration of the XML document determination system in accordance with the first embodiment;

[0020] FIG. 3 is a flowchart for explaining the operation of a decision rule generation section in the first embodiment;

Continue reading about Document processing system, method and program...
Full patent description for Document processing system, method and program

Brief Patent Description - Full Patent Description - Patent Application Claims

Click on the above for other options relating to this Document processing system, method and program patent application.
###
monitor keywords

How KEYWORD MONITOR works... a FREE service from FreshPatents
1. Sign up (takes 30 seconds). 2. Fill in the keywords to be monitored.
3. Each week you receive an email with patent applications related to your keywords.  
Start now! - Receive info on patent apps like Document processing system, method and program or other areas of interest.
###


Previous Patent Application:
Content management framework for use with a system for application development
Next Patent Application:
Method and system for customizing a user interface to an on-demand database service
Industry Class:
Data processing: database and file management or data structures

###

FreshPatents.com Support
Thank you for viewing the Document processing system, method and program patent info.
IP-related news and info


Results in 0.14599 seconds


Other interesting Feshpatents.com categories:
Daimler Chrysler , DirecTV , Exxonmobil Chemical Company , Goodyear , Intel , Kyocera Wireless , 174
filepatents (1K)

* Protect your Inventions
* US Patent Office filing
patentexpress PATENT INFO