|Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents - Monitor Patents|
USPTO Class 704 | Browse by Industry: Previous - Next | All
12/2011 | Recent | 13: Jun | May | Apr | Mar | Feb | Jan | 12: Dec | Nov | Oct | Sep | Aug | July | June | May | April | Mar | Feb | Jan | 11: Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan | 10: Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan | | 09: Dec | Nov | Oct | Sep | Aug | Jl | Jn | May | Apr | Mar | Fb | Jn | | 2008 | 2007 |
Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression December patent applications/inventions, industry category 12/11Below are recently published patent applications awaiting approval from the USPTO. Recent week's RSS XML file available below.
Listing for abstract view: USPTO application #, Title, Abstract excerpt,Patent Agent. Listing format for list view: USPTO National Class full category number, title of the patent application. 12/29/2011 > 30 patent applications in 13 patent subcategories. patent applications/inventions, industry category
20110320185 - Systems and methods for machine translation: Systems and methods for machine translation are presented. Embodiments of the systems and methods comprise receiving a phrase table, the phrase table comprising a bi-phrase having a source phrase in a source language and a parallel translated target phrase in a target language; replacing a word in the source and/or... Agent:
20110320190 - Automated sentence planning in a task classification system: The invention relates to a task classification system (900) that interacts with a user. The task classification system (900) may include a recognizer (920) that may recognize symbols in the user's input communication, and a natural language understanding unit (900) that may determine whether the user's input communication can be... Agent: At&t Intellectual Property Ii, L.p.
20110320186 - Entity recognition: A key advantage is the ability to extract terms from documents based on the combination of a limited number of sub-concepts. This avoids the need for the prior identification of all possible terms that current methods require. A second key advantage is the ability to introduce or remove concepts and... Agent: Rolls-royce PLC
20110320187 - Natural language question answering system and method based on deep semantics: In a computer system, systems and methods for automatically answering natural language questions using deep semantics are provided. Methods include receiving a natural language question, mapping it into one or more deductive database queries that captures one or more intents behind the question, computing one or more result sets of... Agent: Experienceon Ventures S.l.
20110320189 - Systems and methods for filtering dictated and non-dictated sections of documents: A system and method for filtering documents to determine section boundaries between dictated and non-dictated text. The system and method identifies portions of a text report that correspond to an original dictation and, correspondingly, those portions that are not part of the original dictation. The system and method include comparing... Agent: Dictaphone Corporation
20110320191 - Text creation system and method: A text creation system and method is described. Input text is provided in an authoring language and may be provided in one or more rendering languages and/or writing styles. The input text is analyzed to determine the semantic content of the input, and the semantic information is stored in a... Agent:
20110320188 - Web-based speech recognition with scripting and semantic objects: The present invention is a system and method for creating and implementing transactional speech applications (SAs) using Web technologies, without reliance on server-side standard or custom services. A transactional speech application may be any application that requires interpretation of speech in conjunction with a speech recognition (SR) system, such as,... Agent: Eliza Corporation
20110320192 - Gateway apparatus and method and communication system: A gateway apparatus receives a call control signal and/or a packet with voice data stored therein in a predetermined protocol from a packet transfer apparatus on a mobile high-speed network and converts the received protocol into a circuit-switched protocol used when an RNC connects to a circuit switching equipment on... Agent:
20110320193 - Speech encoding device, speech decoding device, speech encoding method, and speech decoding method: Provided is a speech encoding device that is capable of performing encoding in an extension encoder even when the core encoder and core decoder of each layer have been interchanged, and that is also capable of performing high precision encoding by using the appropriate codec for each situation. The speech... Agent: Panasonic Corporation
20110320194 - Decoder with embedded silence and background noise compression: There is provided a method for use by a speech encoder to encode an input speech signal. The method comprises receiving the input speech signal; determining whether the input speech signal includes an active speech signal or an inactive speech signal; low-pass filtering the inactive speech signal to generate a... Agent: Mindspeed Technologies, Inc.
20110320195 - Method, apparatus and system for linear prediction coding analysis: The present invention relates to communication technologies and discloses a method, an apparatus and a system for Linear Prediction Coding (LPC) analysis to improve LPC prediction performance and simplify analysis operation. The method includes: obtaining signal feature information of at least one sample point of input signals; comparing and analyzing... Agent:
20110320196 - Method for encoding and decoding an audio signal and apparatus for same: A method for coding and decoding an audio signal or speech signal and an apparatus adopting the method are provided.... Agent: Samsung Electronics Co., Ltd.
20110320198 - Interactive environment for performing arts scripts: One or more embodiments present a script to a user in an interactive script environment. A digital representation of a manuscript is analyzed. This digital representation includes a set of roles and a set of information associated with each role in the set of roles. An active role in the... Agent:
20110320199 - Method and apparatus for fusing voiced phoneme units in text-to-speech: According to one embodiment, an apparatus for fusing voiced phoneme units in Text-To-Speech, includes a reference unit selection module configured to select a reference unit from the plurality of units based on pitch cycle information of the each unit and the number of pitch cycles of the target segment. The... Agent: Kabushiki Kaisha Toshiba
20110320197 - Method for indexing multimedia information: c
20110320200 - Speaker recognition in a multi-speaker environment and comparison of several voice prints to many: One-to-many comparisons of callers' voice prints with known voice prints to identify any matches between them. When a customer communicates with a particular entity, such as a customer service center, the system makes a recording of the real-time call including both the customer's and agent's voices. The system segments the... Agent: American Express Travel Related Services Company, Inc.
20110320202 - Location verification system using sound templates: A system using sound templates is presented that may receive a first template for an audio signal and compares it to templates from different sound sources to determine a correlation between them. A location history database is created that assists in identifying the location of a user in response to... Agent:
20110320201 - Sound verification system using templates: An audio signal verification system is presented for verifying the sound is from a predetermined source. Various methods for analyzing the sound are presented and the various methods may be combined to vary degrees to determine an appropriate correlation with a predefined pattern. Moreover a confidence level or other indication... Agent:
20110320203 - Method and system for identifying and correcting accent-induced speech recognition difficulties: A system for use in speech recognition includes an acoustic module accessing a plurality of distinct-language acoustic models, each based upon a different language; a lexicon module accessing at least one lexicon model; and a speech recognition output module. The speech recognition output module generates a first speech recognition output... Agent: Nuance Communications, Inc.
20110320207 - Coding, modification and synthesis of speech segments: The invention relates to a method for speech signal analysis, modification and synthesis comprising a phase for the location of analysis windows by means of an iterative process for the determination of the phase of the first sinusoidal component and comparison between the phase value of said component and a... Agent: Telefonica, S.a.
20110320205 - Electronic book reader: An electronic book reader includes a display, an audio output device, a text obtaining module, a storing module, a text displaying module, a text analyzing module, a text highlighting module, a speech synthesis module, a player module, and a synchronization control module. The text obtaining module obtains a text from... Agent: Hon Hai Precision Industry Co., Ltd.
20110320206 - Electronic book reader and text to speech converting method: An electronic book reader includes a text obtaining module, a text highlighting module, a speech synthesis module, a player module, and a synchronization control module. The text obtaining module obtains a selected segment of a text. The text highlighting module highlights the selected segment. The speech synthesis module converts the... Agent: Hon Hai Precision Industry Co., Ltd.
20110320204 - Systems and methods for input device audio feedback: Systems, methods, apparatuses and computer program products configured to provide sound feedback for input devices are described. Embodiments take input from a digitizer, such as input using as stylus/pen, and produce sound feedback to enhance the user's input interface experience. Embodiments thus provide a user with a more realistic interface... Agent: Lenovo (singapore) Pte. Ltd.
20110320208 - Page identification method for audio book: A page identification method for audio book with a main housing, a plurality of pages, a plurality of light blocking panels, an audio record and playback electronic circuit including microphone, speaker, power source, record switch and playback switch, a microprocessor and a plurality of light sensing devices. The top surface... Agent:
20110320212 - Audio signal encoding method, audio signal decoding method, encoding device, decoding device, audio signal processing system, audio signal encoding program, and audio signal decoding program: When a frame immediately preceding an encoding target frame to be encoded by a first encoding unit operating under a linear predictive coding scheme is encoded by a second encoding unit operating under a coding scheme different from the linear predictive coding scheme, the encoding target frame can be encoded... Agent:
20110320214 - Dual streaming with exchange of fec streams by audio sinks: A system and method is described herein in which an audio source wirelessly transmits audio content to a first audio sink over one wireless link and to a second audio sink over another wireless link. The two audio sinks also exchange forward error correction (FEC) streams over a link between... Agent: Broadcom Corporation
20110320209 - Frequency domain multiband dynamics compressor with automatically adjusting frequency band boundary locations: A multiband dynamics compressor implements a frequency-domain solution for addressing unwanted magnitude peaks which may occur at the crossover frequency (boundary) between two adjacent frequency bands. The solution proposes making slight adjustments to the frequency band boundary locations, for example on a frame-by-frame basis, in order to prevent a spectral... Agent: Stmicroelectronics, Inc.
20110320211 - Method and apparatus for processing signal: A method and an apparatus for processing a signal are provided. The method includes: obtaining an energy average value of each sub-band for a current frame frequency-domain signal; obtaining a current frame modification coefficient of each sub-band for the current frame frequency-domain signal according to a spectral envelope and the... Agent:
20110320210 - Multiband dynamics compressor with spectral balance compensation: A multiband dynamics compressor implements a solution for minimizing unwanted changes to the long-term frequency response. The solution essentially proposes undoing the multiband compression in a controlled manner using much slower smoothing times. In this regard, the compensation provided acts more like an equalizer than a compressor. What is applied... Agent: Stmicroelectronics, Inc.
20110320213 - Time-warping of decoded audio signal after packet loss: A technique is described for use in a decoder configured to decode a series of frames representing an encoded audio signal. The technique is for transitioning between a lost frame and one or more received frames following the lost frame in the series of frames. In accordance with the technique,... Agent: Broadcom Corporation12/22/2011 > 25 patent applications in 18 patent subcategories. patent applications/inventions, industry category
20110313754 - Language translation of selected content in a web conference: A method for translating selected content in a web conference may include receiving, by a processing device, a selected area in an image from a shared application in a web conference. The selected area may contain text for translation into a chosen language. The method may also include performing an... Agent: International Business Machines Corporation
20110313755 - Multilanguage web page translation system and method for translating a multilanguage web page and providing the translated web page: A system and method for translating a multilingual web page are provided. The method includes receiving an attempt of a user to access a specific web site through the Internet, grasping in which country a language registered by a user, a language used in an area corresponding to the IP... Agent:
20110313757 - Systems and methods for advanced grammar checking: In embodiments of the present invention improved capabilities are described for a method of grammar checking, comprising providing a first level of grammar checking through a computer-based grammar checking facility to grammar check a body of text provided by a source in order to improve the grammatical correctness of the... Agent: Applied Linguistics LLC
20110313756 - Text sizer (tm): This invention called Text Sizer ™ is an innovative method and system for changing the length of a body of text. It may be embodied in the following steps. First, a first text segment may be selected in a body of text. Second, alternative text segments are automatically identified, wherein... Agent:
20110313758 - Method and arrangement for processing of speech quality estimate: Method and arrangement for processing of a speech quality estimate, which involve adaption of a speech quality estimate based on information related to the bandwidth of a reference signal used when determining said speech quality estimate, such that the adapted speech quality estimate is independent of the bandwidth of the... Agent: Telefonaktiebolaget L M Ericsson (publ)
20110313759 - Method for changing the caller voice during conversation in voice communication device: The invention relates to a cellular phone terminal system and in particular to a method for changing caller's voice of speech signal during conversation. The cellular phone terminal system has a filter for filtering signal. The method comprises the steps of: waiting for a caller voice selector key input for... Agent:
20110313760 - Signal decomposition, analysis and reconstruction: The present invention provides a system and method for representing quasi-periodic (“qp”) waveforms comprising, representing a plurality of limited decompositions of the qp waveform, wherein each decomposition includes a first and second amplitude value and at least one time value. In some embodiments, each of the decompositions is phase adjusted... Agent: Digital Intelligence, L.L.C.
20110313761 - Method for encoding signal, and method for decoding signal: The present disclosure relates to a method, apparatus, and system for encoding and decoding signals. The encoding method includes: converting a first-domain signal into a second-domain signal; performing Linear Prediction (LP) processing and Long-Term Prediction (LTP) processing for the second-domain signal; obtaining a long-term flag according to decision criteria; obtaining... Agent:
20110313762 - Speech output with confidence indication: A method, system, and computer program product are provided for speech output with confidence indication. The method includes receiving a confidence score for segments of speech or text to be synthesized to speech. The method includes modifying a speech segment by altering one or more parameters of the speech proportionally... Agent: International Business Machines Corporation
20110313763 - Pickup signal processing apparatus, method, and program product: According to one embodiment, a pickup signal processing apparatus includes microphones, a sound determining unit, a signal level calculating unit, a setting unit, and a calculating unit. The sound determining unit determines whether pickup signals picked up by the microphones are signals from a neighboring sound source or a background... Agent: Kabushiki Kaisha Toshiba
20110313764 - System and method for latency reduction for automatic speech recognition using partial multi-pass results: A system and method is provided for reducing latency for automatic speech recognition. In one embodiment, intermediate results produced by multiple search passes are used to update a display of transcribed text.... Agent: At&t Intellectual Property Ii, L.p.
20110313765 - Conversational subjective quality test tool: A method for assessing quality of conversational speech between nodes of a communication network (1), comprising establishing a voice communication session via the communication network (1) between a user at a user terminal (2) and a virtual subject system (4), the virtual subject system (4) and user terminal (2) being... Agent: Alcatel Lucent
20110313766 - Identification of people using multiple types of input: Systems and methods for detecting people or speakers in an automated fashion are disclosed. A pool of features including more than one type of input (like audio input and video input) may be identified and used with a learning algorithm to generate a classifier that identifies people or speakers. The... Agent: Microsoft Corporation
20110313768 - Compound gesture-speech commands: A multimedia entertainment system combines both gestures and voice commands to provide an enhanced control scheme. A user's body position or motion may be recognized as a gesture, and may be used to provide context to recognize user generated sounds, such as speech input. Likewise, speech input may be recognized... Agent:
20110313767 - System and method for data intensive local inference: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating an accent source. A system practicing the method collects data associated with customer specific services, generates country-specific or dialect-specific weights for each service in the customer specific services list, generates a summary weight based on an aggregation of... Agent: At&t Intellectual Property I, L.p.
20110313769 - Method and system for automatically detecting morphemes in a task classification system using lattices: In an embodiment, a lattice of phone strings in an input communication of a user may be recognized, wherein the lattice may represent a distribution over the phone strings. Morphemes in the input communication of the user may be detected using the recognized lattice. Task-type classification decisions may be made... Agent: At&t Intellectual Property Ii, L.p.
20110313770 - Electronic emergency messaging system: An electronic alert apparatus comprises a radio frequency receiver configured to receive and identify an emergency message preamble that indicates an impending transmission of an emergency message sent at a first data rate and an emergency message addressed to a shared device address sent at a second data rate. The... Agent:
20110313771 - Method and device for audibly instructing a user to interact with a function: A method for audibly instructing a user to interact with a function. A function is associated with a user-written selectable item. The user-written selectable item is recognized on a surface. In response to recognizing the user-written selectable item, a first instructional message related to the operation of the function is... Agent: Leapfrog Enterprises, Inc.
20110313772 - System and method for unit selection text-to-speech using a modified viterbi approach: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for speech synthesis. A system practicing the method receives a set of ordered lists of speech units, for each respective speech unit in each ordered list in the set of ordered lists, constructs a sublist of speech units from a... Agent: At&t Intellectual Property I, L.p.
20110313773 - Search apparatus, search method, and program: A search apparatus includes a sound recognition unit which recognizes input sound, a user information estimation unit which estimates at least one of a physical condition and emotional demeanor of a speaker of the input sound based on the input sound and outputs user information representing the estimation result, a... Agent:
20110313774 - Methods, systems, and products for measuring health: Methods, systems, and products measure health data related to a user. A spoken phrase is received and time-stamped. The user is identified from the spoken phrase. A window of time is determined from a semantic content of the spoken phrase. A sensor measurement is received and time-stamped. A difference in... Agent:
20110313776 - System and method for controlling devices that are connected to a network: A system, method and computer-readable medium for controlling devices connected to a network. The method includes receiving an utterance from a user for remotely controlling a device in a network; converting the received utterance to text using an automatic speech recognition module; accessing a user profile in the network that... Agent: At&t Intellectual Property Ll, L.p.
20110313775 - Television remote control data transfer: A computer-implemented method for information sharing between a portable computing device and a television system includes receiving a spoken input from a user of the portable computing device, by the portable computing device, submitting a digital recording of the spoken query from the portable computing device to a remote server... Agent: Google Inc.
20110313777 - Apparatus, method and computer program for obtaining a parameter describing a variation of a signal characteristic of a signal: An apparatus for obtaining a parameter describing a variation of a signal characteristic of a signal on the basis of actual transform-domain parameters describing the audio signal in transform-domain includes a parameter determinator. The parameter determinator is configured to determine one or more model parameters of a transform-domain variation model... Agent: Fraunhofer-gesellschaft Zur Foerderung Der Angewandten Forschung E.v.
20110313778 - Method and apparatus for adaptively encoding and decoding high frequency band: Provided are a method and apparatus for encoding and decoding an audio signal. According to the present application, a signal of a high frequency band above a preset frequency band is adaptively encoded or decoded in the time domain or in the frequency domain by using a signal of a... Agent: Samsung Electronics Co., Ltd12/15/2011 > 22 patent applications in 16 patent subcategories. patent applications/inventions, industry category
20110307240 - Data modeling of multilingual taxonomical hierarchies: Translations are provided as a property in multilingual taxonomical hierarchies. Translations for each node in a tree structure are associated with the node of a primary language as labels, where each node can have a plurality of labels. If the translation into a secondary language does not exist, a default... Agent: Microsoft Corporation
20110307241 - Enhanced speech-to-speech translation system and methods: A speech translation system and methods for cross-lingual communication that enable users to improve and modify content and usage of the system and easily abort or reset translation. The system includes a speech recognition module configured for accepting an utterance, a machine translation module, an interface configured to communicate the... Agent: Mobile Technologies, LLC
20110307242 - Method for realtime spoken natural language translation and apparatus therefor: A method and apparatus for performing real time automatic translation of the spoken language are provided. A spoken language input is received at one end comprising at least one source language and delivered at other end comprising at least one target language using natural language processing technology and language translation... Agent:
20110307244 - Joint optimization for machine translation system combination: A joint optimization strategy is employed for combining translation hypotheses from multiple machine-translation systems. Decisions on word alignment, between the hypotheses, ordering, and selection of a combined translation output are made jointly in accordance with a set of features. Additional features that model alignment and ordering behavior are also provided... Agent: Microsoft Corporation
20110307243 - Multilingual runtime rendering of metadata: Translated versions of metadata are provided to a user at runtime based on a working language selection. Translated versions of metadata in secondary languages are associated with an original version in a primary language as properties instead of being separate items within an infrastructure hierarchy. The property may be selected... Agent: Microsoft Corporation
20110307245 - Word alignment method and system for improved vocabulary coverage in statistical machine translation: A system and method for generating word alignments from pairs of aligned text strings are provided. A corpus of text strings provides pairs of text strings, primarily sentences, in source and target languages. A first alignment between a text string pair creates links therebetween. Each link links a single token... Agent: Xerox Corporation
20110307246 - Methods and systems for changing a communication quality of a communication session based on a meaning of speech data: Methods and systems are described for changing a communication quality of a communication session based on a meaning of speech data. Speech data exchanged between clients participating in a communication session is parsed. A meaning of the parsed speech data is determined. An action is performed to change a communication... Agent:
20110307247 - Method and system for lexical navigation of items: A method and a system for lexical navigation of a corpus of items are provided. For example, the method may include generating a data structure in a non-transitory, computer readable medium. The data structure may include a number of items, a number of keywords, and a frequency that each of... Agent:
20110307248 - Encoder, decoder, and method therefor: Provided is an encoder which can effectively encode/decode spectrum data of a broad frequency signal in a high frequency range, can dramatically reduce the number of the arithmetic operations to be performed, and can improve the quality of the decoded signal. The encoder comprises a first layer coding unit (202)... Agent: Panasonic Corporation
20110307249 - Method and acoustic signal processing system for interference and noise suppression in binaural microphone configurations: A method determines a bias reduced noise and interference estimation in a binaural microphone configuration with a right and a left microphone signal at a time-frame with a target speaker active. The method includes a determination of the auto power spectral density estimate of the common noise formed of noise... Agent: Siemens Medical Instruments Pte. Ltd.
20110307250 - Modular speech recognition architecture: A speech recognition system is provided. The speech recognition system includes a speech recognition module; a plurality of domain specific dialog manager modules that communicate with the speech recognition module to perform speech recognition; and a speech interface module that that communicates with the plurality of domain specific dialog manager... Agent: Gm Global Technology Operations, Inc.
20110307251 - Sound source separation using spatial filtering and regularization phases: Described is a multiple phase process/system that combines spatial filtering with regularization to separate sound from different sources such as the speech of two different speakers. In a first phase, frequency domain signals corresponding to the sensed sounds are processed into separated spatially filtered signals including by inputting the signals... Agent: Microsoft Corporation
20110307252 - Using utterance classification in telephony and speech recognition applications: Described is the use of utterance classification based methods and other machine learning techniques to provide a telephony application or other voice menu application (e.g., an automotive application) that need not use Context-Free-Grammars to determine a user's spoken intent. A classifier receives text from an information retrieval-based speech recognizer and... Agent: Microsoft Corporation
20110307253 - Speech and noise models for speech recognition: An audio signal generated by a device based on audio input from a user may be received. The audio signal may include at least a user audio portion that corresponds to one or more user utterances recorded by the device. A user speech model associated with the user may be... Agent: Google Inc.
20110307254 - Speech recognition involving a mobile device: A system and method of speech recognition involving a mobile device. Speech input is received (202) on a mobile device (102) and converted (204) to a set of phonetic symbols. Data relating to the phonetic symbols is transferred (206) from the mobile device over a communications network (104) to a... Agent:
20110307255 - System and method for conversion of speech to displayed media data: Specifically, the invention contemplates a method where the program converts a spoken word to a text string, compares that text string to an image library containing media data that is associated with the text string, and if the text string matches a text string in the library, projects the media... Agent: Logoscope LLC
20110307256 - Systems and methods for providing network-based voice authentication: A system enables voice authentication via a network. The system may include an intelligent voice response engine operatively coupled to the network for receiving transaction or access requests from a plurality of telecommunications devices over the network. A speech recognition and verification services engine may be operatively coupled to the... Agent: Verizon Business Global LLC
20110307257 - Methods and apparatus for real-time interaction analysis in call centers: A method and system for indicating in real time that an interaction is associated with a problem or issue, comprising: receiving a segment of an interaction in which a representative of the organization participates; extracting a feature from the segment; extracting a global feature associated with the interaction; aggregating the... Agent: Nice Systems Ltd.
20110307258 - Real-time application of interaction anlytics: A method and apparatus for providing real-time assistance related to an interaction associated with a contact center, comprising steps or components for: receiving at least a part of an audio signal of an interaction captured by a capturing device associated with an organization, and metadata information associated with the interaction;... Agent: Nice Systems Ltd.
20110307259 - System and method for audio content navigation: A system and method for communicating one or more audio files through a network. One or more original files of an original web site are converted into one or more audio files. An indication is provided to a user that the one or more original files are available as the... Agent:
20110307260 - Multi-modal gender recognition: Gender recognition is performed using two or more modalities. For example, depth image data and one or more types of data other than depth image data is received. The data pertains to a person. The different types of data are fused together to automatically determine gender of the person. A... Agent:
20110307261 - Quantizing a joint-channel-encoded audio signal: Provided are, among other things, systems, methods and techniques for quantizing a joint-channel-encoded audio signal, e.g., by: (a) obtaining an audio signal that includes a plurality of channels, with each channel including a block of samples; (b) segmenting the samples within each of a plurality of the blocks into quantization... Agent:12/08/2011 > 29 patent applications in 19 patent subcategories. patent applications/inventions, industry category
20110301934 - Machine based sign language interpreter: A computer implemented method for performing sign language translation based on movements of a user is provided. A capture device detects motions defining gestures and detected gestures are matched to signs. Successive signs are detected and compared to a grammar library to determine whether the signs assigned to gestures make... Agent: Microsoft Corporation
20110301936 - Interpretation terminals and method for interpretation through communication between interpretation terminals: A method for interpreting a dialogue between two terminals includes establishing a communication channel between interpretation terminals of two parties in response to an interpretation request; specifying a language of an initiating party and a language of the other party in each of the interpretation terminals of the two parties... Agent: Electronics And Telecommunications Research Institute
20110301935 - Locating parallel word sequences in electronic documents: Systems and methods for automatically extracting parallel word sequences from comparable corpora are described. Electronic documents, such as web pages belonging to a collaborative online encyclopedia, are analyzed to locate parallel word sequences between electronic documents written in different languages. These parallel word sequences are then used to train a... Agent: Microsoft Corporation
20110301937 - Electronic reading device: The present invention provides an electronic reading device. At the device, a voice is captured by a capturing unit, and then the reference information stored in a storing unit is received by a processing unit for converting the voice to a visual image signal based on the reference information. Afterwards,... Agent: E Ink Holdings Inc.
20110301939 - Methods and systems for selecting a language for text segmentation: Methods and systems for selecting a language for text segmentation are disclosed. In one embodiment, at least a first candidate language and a second candidate language associated with a string of characters are identified, at least a first segmented result associated with the first candidate language and a second segmented... Agent: Google Inc.
20110301938 - Multilingual tagging of content with conditional display of unilingual tags: One or more computers are programmed to obtain an identifier of a natural language (“session language”). Additionally, the one or more computers are programmed to create and store in a computer memory, a webpage to be displayed to the user, including at least a title of a piece of content.... Agent: Oracle International Corporation
20110301940 - Free text voice training: A system and method provide acoustic training of a voice or speech recognition engine and/or voice or speech recognition software application. Instead of requiring a user to read from a prepared or predetermined script, the system and method described herein enable acoustic training using any free text spoken phrases provided... Agent:
20110301942 - Method and apparatus for full natural language parsing: The method and apparatus for discriminative natural language parsing, uses a deep convolutional neural network adapted for text and a structured tag inference in a graph. In the method and apparatus, a trained recursive convolutional graph transformer network, formed by the deep convolutional neural network and the graph, predicts “levels”... Agent: Nec Laboratories America, Inc.
20110301941 - Natural language processing method and system: A computer implemented natural language processing method, the method including the steps of: analysing a sentence string within textual information to determine sub-components of the sentence string, assigning one or more unique tokens to each determined sub-component, determining a probability of use that a determined sub-component has one or more... Agent: Syl Research Limited
20110301943 - System and method of dictation for a speech recognition command system: In embodiments of the present invention, a system and computer-implemented method for enabling dictation may include parsing standard reports in order to identify a plurality of logical phrases in the report used for discrete sections and descriptions. In the report method, the phrases may be parsed and identifier words throughout... Agent: Redstart Systems, Inc.
20110301944 - Diver audio communication system: An underwater communications system is provided that transmits electromagnetic and/or magnetic signals to a remote receiver. The transmitter includes a data input. A digital data compressor compresses data to be transmitted. A modulator modulates compressed data onto a carrier signal. An electrically insulated, magnetic coupled antenna transmits the compressed, modulated... Agent:
20110301945 - Speech signal processing system, speech signal processing method and speech signal processing program product for outputting speech feature: A speech signal processing system which outputs a speech feature, divides an input speech signal into frames so that each pair of consecutive frames have a frame shift length equal to at least one period of the speech signal and have an overlap equal to at least a predetermined length,... Agent: International Business Machines Corporation
20110301947 - Systems, processes and integrated circuits for rate and/or diversity adaptation for packet communications: Packets of real-time information are sent with a source rate greater than zero kilobits per second, and a time or path or combined time/path diversity rate initially being zero kilobits per second. This results in a quality of service QoS, optionally measured at the sender or the receiver. When the... Agent: Texas Instruments Incorporated
20110301946 - Tone determination device and tone determination method: Disclosed is a tone determination device that determines the tonality of an input signal using correlations between the frequency components of a current frame with the frequency components of the preceding frame, such that the tone determination device is able to decrease the calculation complexity. In the device, a vector... Agent: Panasonic Corporation
20110301948 - Echo-related decisions on automatic gain control of uplink speech signal in a communications device: A method for performing a call between a near-end user and a far-end user, which includes the following operations performed during the call by the near-end user's communications device. Automatic gain control (AGC) is performed to update a gain applied to an uplink speech signal. A frame is detected in... Agent: Apple Inc.
20110301949 - Speaker-cluster dependent speaker recognition (speaker-type automated speech recognition): In an example embodiment, there is disclosed herein an automatic speech recognition (ASR) system that employs speaker clustering (or speaker type) for transcribing audio. A large corpus of audio with corresponding transcripts is analyzed to determine a plurality of speaker types (e.g., dialects). The ASR system is trained for each... Agent:
20110301950 - Speech input device, speech recognition system and speech recognition method: A device for speech input includes a speech input unit configured to convert a speech of a user to a speech signal; an angle detection unit configured to detect an angle of the speech input unit; a distance detection unit configured to detect a distance between the speech input unit... Agent: Kabushiki Kaisha Toshiba
20110301951 - Electronic questionnaire: A questionnaire is presented to a user in a more efficient manner in which the user is more likely to participate. The questionnaire is sent electronically to the user's vehicle and presented audibly to the user. The user responds audibly to the questions in the questionnaire. The user's responses are... Agent:
20110301952 - Speech recognition processing system and speech recognition processing method: The present invention provides a speech recognition processing system in which speech recognition processing is executed parallelly by plural speech recognizing units. Before text data as the speech recognition result is output from each of the speech recognizing units, information indicating each speaker is parallelly displayed on a display in... Agent: Nec Corporation
20110301953 - System and method of multi model adaptation and voice recognition: A method of multi model adaptation according to the exemplary embodiment of the present invention includes: selecting any one model designated by a speaker; extracting a feature vector used in a voice model from an inputted voice of the speaker; adapting the extracted feature vector by using a predetermined pronunciation... Agent: Seoby Electronic Co., Ltd
20110301955 - Predicting and learning carrier phrases for speech input: Predicting and learning users' intended actions on an electronic device based on free-form speech input. Users' actions can be monitored to develop of a list of carrier phrases having one or more actions that correspond to the carrier phrases. A user can speak a command into a device to initiate... Agent: Google Inc.
20110301956 - Information processing apparatus, information processing method, and program: An information processing apparatus includes an image analysis unit that executes a process for analyzing an image captured by a camera, a speech analysis unit that executes a process for analyzing speech input from a microphone, and a data processing unit that receives a result of the analysis conducted by... Agent:
20110301957 - System and/or method for audibly prompting a patient with a motion device: A patient notification and response system comprises a communications network, a server, a workstation, and a remote motion device. The server is operatively connected to the communications network and comprises a database and a script generator. The database comprises script programs, where each script program is associated with a patient... Agent: Roy-g-biv Corporation
20110301958 - System-initiated speech interaction: Whenever an event occurs on a computing system which will accept a response from a user of the system, the system automatically determines whether or not to enable speech interaction with the system for the event response. Whenever speech interaction is enabled with the system for the event response, the... Agent: Microsoft Corporation
20110301959 - Voice acquisition system for a vehicle: A voice acquisition system for a vehicle includes an interior rearview mirror assembly attached at an inner portion of the windshield of a vehicle equipped with the interior rearview mirror assembly. The interior rearview mirror assembly includes at least two microphones for receiving audio signals within a cabin of the... Agent: Donnelly Corporation
20110301960 - Coding apparatus, coding method, decoding apparatus, decoding method, and program: A coding apparatus includes a generation unit configured to generate first coding information used for first coding of a first audio signal and second coding information used for second coding of a second audio signal, and generate third coding information used for the first coding of the second audio signal... Agent:
20110301961 - Method and apparatus for encoding and decoding audio signal using adaptive sinusoidal coding: A method and an apparatus for encoding and decoding audio signals using adaptive sinusoidal coding are provided. The audio signal encoding method includes the steps of dividing a synthesized audio signal into a plurality of sub-bands, calculating the energy of each sub-band, selecting a predetermined number of sub-bands having a... Agent:
20110301962 - Stereo encoding method and apparatus: A stereo encoding method and apparatus are provided, so as to reduce distortion caused by delay adjustment. The stereo encoding method includes: extracting a current interchannel delay of a stereo signal and a previous delay adjacent to the current interchannel delay; performing adjustment frame judgment according to characteristics of the... Agent:12/01/2011 > 20 patent applications in 15 patent subcategories. patent applications/inventions, industry category
20110295589 - Locating paraphrases through utilization of a multipartite graph: A method is described herein that includes acts of receiving a selection of a first phrase in a first language and executing a random walk over a computer-implemented multipartite graph, wherein the multipartite-graph includes a first set of nodes that are representative of phrases in the first language, a second... Agent: Microsoft Corporation
20110295590 - Acoustic model adaptation using geographic information: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing speech recognition accuracy. In one aspect, a method includes receiving an audio signal that corresponds to an utterance recorded by a mobile device, determining a geographic location associated with the mobile device, adapting one or... Agent: Google Inc.
20110295593 - Automated message attachment labeling using feature selection in message content: Embodiments are directed towards an automated machine learning framework to extract keywords within a message that are relevant to an attachment to the message. The machine learning model finds a set of relevant sentences within the message determined to be relevant to the one or more attachments based on identification... Agent: Yahoo! Inc.
20110295595 - Document processing, template generation and concept library generation method and apparatus: The present invention relates to document processing method and apparatus which can edit a natural language and generate a machine-processable document; a template generating method and apparatus which can be used for document processing method and apparatus; a concept library generating method and apparatus which can be used for the... Agent: International Business Machines Corporation
20110295592 - Survey analysis and categorization assisted by a knowledgebase: The disclosure generally relates to knowledge retrieval using a knowledgebase storing general and/or expert knowledge. In particular, the disclosure relates to using an enhanced knowledgebase to implement a tool for analysis and categorization of surveys.... Agent: Bank Of America Corporation
20110295591 - System and method to acquire paraphrases: An automatic paraphrase acquisition technique is provided. A common theme of the various embodiments described herein resides in careful design of simple tasks that can elicit the necessary information for the automated process. These tasks are performed quickly and inexpensively. By gathering the results produced, paraphrases can be generated automatically... Agent: Palo Alto Research Center Incorporated
20110295594 - System, method, and program for processing text using object coreference technology: System, method and program product for text processing using object coreference technology. In particular, the invention provides a text processing method which includes, acquiring text to be processed; extracting subject words and entity words corresponding to the subject words from the text; grouping the subject words; determining entity words that... Agent: International Business Machines Corporation
20110295596 - Digital voice recording device with marking function and method thereof: A digital voice recording device includes a storage unit, a display unit, and a processing unit. The processing unit includes a recording module, a storing module, a marking module, and a playing module. The recording module converts audio into digital signals, and records the digital signals into an audio file.... Agent: Hon Hai Precision Industry Co., Ltd.
20110295597 - System and method for automated analysis of emotional content of speech: A method and apparatus for automated analysis of emotional content of speech is presented. Telephony calls are routed via a network such as public service telephone network (PSTN) and delivered to an interactive voice response system (IVR) where prerecorded or synthesized prompts guide a caller to speech responses. Speech responses... Agent:
20110295598 - Systems, methods, apparatus, and computer program products for wideband speech coding: Methods of audio coding are described in which an excitation signal for a first frequency band of the audio signal is used to calculate an excitation signal for a second frequency band of the audio signal that is separated from the first frequency band.... Agent: Qualcomm Incorporated
20110295599 - Aligning scheme for audio signals: Methods, devices, and computer programs described herein may segment a reference signal that corresponds to a non-degraded signal into a plurality of reference signal segments; generate filter coefficients based on each reference signal segment; and filter each reference signal segment with its corresponding generated filter coefficients. The methods, devices, and... Agent: Telefonaktiebolaget Lm Ericsson (publ)
20110295600 - Apparatus and method determining weighting function for linear prediction coding coefficients quantization: An apparatus determining a weighting function for line prediction coding coefficients quantization converts a linear prediction coding (LPC) coefficient of an input signal into one of a line spectral frequency (LSF) coefficient and an immitance spectral frequency (ISF) coefficient and determines a weighting function associated with one of an importance... Agent: Samsung Electronics Co., Ltd.
20110295601 - System and method for automatic identification of speech coding scheme: Methods and systems for extracting speech from such packet streams. The methods and systems analyze the encoded speech in a given packet stream, and automatically identify the actual speech coding scheme that was used to produce it. These techniques may be used, for example, in interception systems where the identity... Agent:
20110295602 - Apparatus and method for model adaptation for spoken language understanding: An apparatus and a method are provided for building a spoken language understanding model. Labeled data may be obtained for a target application. A new classification model may be formed for use with the target application by using the labeled data for adaptation of an existing classification model. In some... Agent: At&t Intellectual Property Ii, L.p.
20110295603 - Speech recognition accuracy improvement through speaker categories: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech recognition. In one aspect, a computer-based method includes receiving a speech corpus at a speech management server system that includes multiple speech recognition engines tuned to different speaker types; using the speech recognition engines to... Agent:
20110295604 - System and method for automatic verification of the understandability of speech: Disclosed herein are systems, methods, and computer-readable storage media for processing a message received from a user to determine whether an estimate of intelligibility is below an intelligibility threshold. The method includes recognizing a portion of a user's message that contains the one or more expected utterances from a critical... Agent: At&t Intellectual Property Ii, L.p.
20110295605 - Speech recognition system and method with adjustable memory usage: This speech recognition system provides a function that is capable of adjusting memory usage according to the different target resources. It extracts a sequence of feature vectors from input speech signal. A module for constructing search space reads a text file and generates a word-level search space in an off-line... Agent: Industrial Technology Research Institute
20110295606 - Contextual conversion platform: A contextual conversion platform, and method for converting text-to-speech, are described that can convert content of a target to spoken content. Embodiments of the contextual conversion platform can identify certain contextual characteristics of the content, from which can be generated a spoken content input. This spoken content input can include... Agent:
20110295607 - System and method for recognizing emotional state from a speech signal: A computerized method, software, and system for recognizing emotions from a speech signal, wherein statistical and MFCC features are extracted from the speech signal, the MFCC features are sorted to provide a basis for comparison between the speech signal and reference samples, the statistical and MFCC features are compared between... Agent:
20110295608 - Methods for improving high frequency reconstruction: The present invention proposes a new method and a new apparatus for enhancement of audio source coding systems utilising high frequency reconstruction (HFR). It utilises a detection mechanism on the encoder side to assess what parts of the spectrum will not be correctly reproduced by the HFR method in the... Agent:Previous industry: Data processing: structural design, modeling, simulation, and emulation
Next industry: Data processing: financial, business practice, management, or cost/price determination
RSS FEED for 20130613:
Integrate FreshPatents.com into your RSS reader/aggregator or website to track weekly updates.
For more info, read this article.
Thank you for viewing Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents on the FreshPatents.com website. These are patent applications which have been filed in the United States. There are a variety ways to browse Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patent applications on our website including browsing by date, agent, inventor, and industry. If you are interested in receiving occasional emails regarding Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents we recommend signing up for free keyword monitoring by email.
FreshPatents.com Support - Terms & Conditions
Results in 0.84114 seconds