|
FREE patent keyword monitoring and additional FREE benefits. |
|
|
Data Processing: Speech Signal Processing, Linguistics, Language Translation, And Audio Compression/decompression > Linguistics > Dictionary Building, Modification, Or Prioritization Dictionary Building, Modification, Or PrioritizationDictionary Building, Modification, Or Prioritization patent applications listed are from June 2005 to current and include Date, Patent Application Number, Patent Title, Patent Abstract summary and are linked to the corresponding patent application page.11/16/06 - 20060259295 - Language interface and apparatus therefor A method of organizing linguistic elements, comprising creating a plurality of talk topics, wherein each talk topic relates to a particular subject; defining a first set of linguistic element categories, each of the linguistic element categories representing a communicative intent; providing an association between the first set of linguistic element ... 11/16/06 - 20060259294 - Voice recognition system and method A method of matching an utterance comprising a word to a listing in a directory using an automated speech recognition system by forming a word list comprising a selection of words from the listings in the directory; using the automated speec recognition system to determine the best possible matches of ... 09/21/06 - 20060212288 - Topic specific language models built from large numbers of documents Forming and/or improving a language model based on data from a large collection of documents, such as web data. The collection of documents is queried using queries that are formed from the language model. The language model is subsequently improved using the information thus obtained. The improvement is used to ... 09/21/06 - 20060212287 - Method for data processing with a view to extracting the main attributes of a product A method is provided for data processing with a view to determining the main attributes of a product defined by a description including a plurality of words in which, for each word, one determines whether it belongs to one or more predetermined sets or glossaries. Then, for each word that ... 09/14/06 - 20060206313 - Dictionary learning method and device using the same, input method and user terminal device using the same This invention provides a dictionary learning method, said method comprising the steps of: learning a lexicon and a Statistical Language Model from an untagged corpus; integrating the lexicon, the Statistical Language Mode and subsidiary word encoding information into a small size dictionary. And this invention also provides an input method ... 09/14/06 - 20060206312 - Systems and methods for retrieving data Systems, methods, and storage mediums for retrieving data are provided. A syntax analysis is performed of a retrieval request according to data definition information in a data dictionary so as to convert the retrieval request into a query statement executable by a database. The database comprises an unnormalized data structure ... 09/14/06 - 20060206311 - System and method of multilingual rights data dictionary System and method of a multilingual rights data dictionary is provided. The system and method provides a multilingual RDD registry to local system for translating rights terms. The system includes the RDD registry and processor for translating the rights term by referring to the registry. ... 09/07/06 - 20060200343 - Enhanced data storage An electronic device that includes a stored data stream composed of dictionary entries. Each dictionary entry or coded data correlates with video or audio (spoken, music or other). Spoken words will have one or more dictionary entries to allow for different quality enunciations, pronunciations, inflections, accents and tones. These qualities ... 09/07/06 - 20060200342 - System for processing sentiment-bearing text The present invention provides a system for identifying, extracting, clustering and analyzing sentiment-bearing text. In one embodiment, the invention implements a pipeline capable of accessing raw text and presenting it in a highly usable and intuitive way. ... 08/10/06 - 20060178869 - Classification filter for processing data for creating a language model The method and apparatus utilize a filter to remove a variety of non-dictated words from data based on probability and improve the effectiveness of creating a language model. ... 07/27/06 - 20060167680 - System and method for optimizing run-time memory usage for a lexicon A system and method of extracting information from a lexicon and using the information with a computer software program. Lexicon data is arranged for a particular language using Unicode values or other uniquely defined code values for each character of word of the language. A location array is then created ... 07/20/06 - 20060161423 - Systems and methods for automatically categorizing unstructured text Systems, methods and software products analyze messages of a message stream based upon human generated concept recognizers. A sample set of messages, representative of messages from the message stream, are analyzed to determine interesting or useful categories. Text categorization engines are then trained, using the sample set and text classifiers ... 06/29/06 - 20060142997 - Predictive text entry and data compression method for a mobile communication terminal The invention relates to mobile terminals. According to the invention, the communication terminal includes a predictive editor application for entering text. The editor is used for editing text for message handling, phonebook editing and searching, etc. The terminal further includes compression and/or decompression software. The invention further relates to a ... 06/15/06 - 20060129384 - Document based character ambiguity resolution Methods and apparatus for document based ambiguous character resolution. An application searches a document for words that do not contain ambiguous characters and adds them to a dictionary, then searches the document for words that do contain ambiguous characters. For each ambiguous word, a set of candidate solutions is created ... 06/15/06 - 20060129383 - Text processing method and system A method of processing text is provided, in which each word or sequence of words is checked against a lexicon of words and sequences of words each having, associated therewith a score on at least one personality scale, which can be a multi-dimensional scale for representing various personality traits. These ... 05/25/06 - 20060111896 - Projecting dependencies to generate target language dependency structure In one embodiment of the present invention, a decoder receives a dependency tree as a source language input and accesses a set of statistical models that produce outputs combined in a log linear framework. The decoder also accesses a table of treelet translation pairs and returns a target dependency tree ... 05/25/06 - 20060111895 - Method and apparatus for determining the meaning of natural language A method for processing natural language includes receiving an information input string. Referent tridbits corresponding to stimuli in the information input string are generated. Assert tridbits defining relationships between the referent tridbits are generated. A language processing system including a rules database and a meaning engine. The rules database is ... 05/11/06 - 20060100858 - System and method for generating markup language text templates A method of generating a markup language text template comprises identifying a variable text element in a source language text string and assigning a first predefined symbol to the variable text element, identifying a grammatical rule for the variable text element and assigning a second predefined symbol to the variable ... 05/11/06 - 20060100857 - Custom collation tool A user interface is provided to facilitate a collation creation process that automatically establishes collation support for sorted linguistic data. Through this user interface, the provider of the sorted linguistic data may participate in the collation creation process by answering queries concerning the sorted linguistic data. The provider's input is ... 04/06/06 - 20060074636 - Language learning system and method A language-learning system and a method thereof are provided. The system includes a memory unit for storing learning data comprising a first databank for storing exercise data and a second database for storing lookup data, an operating interface for users to input commands, a functional module for performing processes according ... 03/30/06 - 20060069547 - Creating a speech recognition grammar for alphanumeric concepts A method and system to generate a grammar adapted for use by a speech recognizer includes receiving a representation of an alphanumeric expression. For instance, the representation can take the form of a regular expression or a mask. The grammar is generated based on the representation. ... 03/02/06 - 20060047502 - Method and apparatus for building semantic structures using self-describing fragments A method and apparatus for identifying a semantic structure from text includes processing the input text to identify self-describing fragments of the input text based on a hierarchical schema defining a domain with at least one top-level node and child nodes. Each identified self-describing fragment includes hierarchical context of a ... 02/16/06 - 20060036430 - System and method for domain-based natural language consultation A technique for domain-based natural language dialogue includes a program that combines a broad-coverage parser with a general-purpose interpreter and a knowledge base to handle unrestricted sentences in a domain, such as the medical self-help domain. The broad-coverage parser may have more than 40,000 words in its dictionary. The general-purpose ... 01/26/06 - 20060020448 - Method and apparatus for capitalizing text using maximum entropy A method and apparatus are provided for selecting a form of capitalization for a text by determining a probability of a capitalization form for a word using a weighted sum of features. The features are based on the capitalization form and a context for the word. ... 01/12/06 - 20060009966 - Method and system for extracting information from unstructured text using symbolic machine learning A method (and structure) of extracting information from text, includes parsing an input sample of text to form a parse tree and using user inputs to define a machine-labeled learning pattern from the parse tree. ... 01/05/06 - 20060004564 - Apparatus and methods for pronunciation lexicon compression A compressed pronunciation lexicon file is generated from a source pronunciation lexicon using a pronunciation prediction algorithm in a multi-output mode. The pronunciation prediction algorithm may generate a deterministic ordered list of phoneme strings from the textual representation of a particular word. The compressed pronunciation lexicon file may include a ... 12/08/05 - 20050273318 - Method and system for retrieving confirming sentences A method, computer readable medium and system are provided which retrieve confirming sentences from a sentence database in response to a query. A search engine retrieves confirming sentences from the sentence database in response to the query. IN retrieving the confirming sentences, the search engine defines indexing units based upon ... 11/10/05 - 20050251385 - Information processing apparatus, information processing method and recording medium From a word set output section, a word is inputted to an optimum word train output section along with a concept notation function which is a function for representing a matter that the word indicates. The optimum word train output section calculates, on the basis of respective concept notation functions, ... 11/10/05 - 20050251384 - Word extraction method and system for use in word-breaking A method, computer readable medium and system are provided which collect new words for addition to a lexicon for an agglutinative language. Sentences in the agglutinative language are retrieved from documents, for example from web pages. New word candidate character strings are identified in the retrieved sentences. The identified new ... 11/03/05 - 20050246161 - Time series data analysis apparatus and method A text data storage unit stores a plurality of text data having attribute data and time data. A dictionary storage unit stores a plurality of events each associated with text data. An analysis condition indication unit indicates an analysis target as attribute data and an analysis condition as an event ... 10/20/05 - 20050234709 - System and method of generating dictionary entries A system for automatically generating a dictionary from full text articles extracts <term, definition> pairs from full text articles and stores the <term, definition> pairs as dictionary entries. The system includes a computer readable corpus having a plurality of documents therein. A pattern processing module (120) and a grammar processing ... 10/13/05 - 20050228644 - Generic user interface testing framework with rules-based wizard A system and method for providing a generic user interface testing framework, together with a rules based wizard for use therewith. The rules based wizard removes the necessity of test developers having to learn the abstract environment, parameters and directives used to implement the UI test automation system. A user ... 09/29/05 - 20050216256 - Configurable formatting system and method A configurable formatting system and method for generating a desired representation of an expression within a word list includes a dictionary database, a working list module, a formatting module, and a configuration file. The dictionary database stores categories containing words and translation rules. The configuration file contains variants to the ... 09/08/05 - 20050197829 - Word collection method and system for use in word-breaking A method, computer readable medium and system are provided which collect new words for addition to a lexicon for an agglutinative language. In the method, a log of queries submitted to a search engine is obtained. The log of queries is sorted to obtain sorted queries. The sorted queries are ... 08/25/05 - 20050187758 - Method of multilingual speech recognition by reduction to single-language recognizer engine components In some speech recognition applications, not only the language of the utterance is not known in advance, but also a single utterance may contain words in more than one language. At the same time, it is impractical to build speech recognizers for all expected combinations of languages. Moreover, business needs ... 08/04/05 - 20050171761 - Disambiguation language model A language model for a language processing system such as a speech recognition system is constructed from training corpus formed from associated characters, word phrases and context cues. A method and apparatus for generating the training corpus used to train the language model and a system or module using such ... 08/04/05 - 20050171760 - Visual thesaurus A visual thesaurus system and method for displaying a selected term in association with its one or more meanings, other words to which it is related, and further relationship information. The results of a search are presented in a directed graph that provides more information than an ordered list. When ... 06/30/05 - 20050143972 - System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies A method for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms is disclosed. The method includes: partitioning the language vocabulary V into subsets of word forms based on frequencies of occurrence of the respective word forms; ... 06/23/05 - 20050137856 - Full-text index module consistency checking Consistency between the components used to generate and query a full-text index is determined and if a mismatch is detected, an error may be surfaced. A structure including information associated with each component used to build an index is programmatically compared with currently available components. The structure may be interrogated ... 06/09/05 - 20050125220 - Method for constructing lexical tree for speech recognition Disclosed is a method for constructing a lexical tree for speech recognition, wherein, even though a name included in an address book in a communication device such as a cellular phone and a word such as “house/office/cellular phone” are sequentially and successively uttered, the method allows the uttered speech to ... 06/02/05 - 20050119876 - Speech recognition method using a single transducer Input data are translated into a lexical output sequence. Sub-lexical entities and various possible combinations of the entities are identified as states ei and ej of first and second language models, respectively, intended to be stored, with an associated likelihood value and a table having memory areas. Each memory area ... ### FreshPatents.com Support |