Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents - Monitor Patents
FreshPatents.com Logo FreshPatents.com icons
Monitor Keywords Patent Organizer File a Provisional Patent Browse Inventors Browse Industry Browse Agents



USPTO Class 704  |  Browse by Industry: Previous - Next | All     monitor keywords
03/2010 | Recent  |  13: May | Apr | Mar | Feb | Jan | 12: Dec | Nov | Oct | Sep | Aug | July | June | May | April | Mar | Feb | Jan | 11: Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan | 10: Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan |  | 09: Dec | Nov | Oct | Sep | Aug | Jl | Jn | May | Apr | Mar | Fb | Jn |  | 2008 | 2007 |

Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression March listing by industry category 03/10

Below are recently published patent applications awaiting approval from the USPTO. Recent week's RSS XML file available below.
Listing for abstract view: USPTO application #, Title, Abstract excerpt,Patent Agent. Listing format for list view: USPTO National Class full category number, title of the patent application.
  
03/25/2010 > patent applications in patent subcategories. listing by industry category

20100076745 - Apparatus and method of detecting community-specific expression: Conventional publications concerning collections of community specific expressions include collections of technical terms including nouns and compound nouns in technical fields. However, application to new expressions other than nouns is difficult. Even in the field of collection of unknown words and new words, the objective is limited substantially to nouns,... Agent: Hewlett-packard Company Intellectual Property Administration

20100076746 - Computerized statistical machine translation with phrasal decoder: A computerized system for performing statistical machine translation with a phrasal decoder is provided. The system may include a phrasal decoder trained prior to run-time on a monolingual parallel corpus, the monolingual parallel corpus including a machine translation output of source language documents of a bilingual parallel corpus and a... Agent: Microsoft Corporation

20100076747 - Mass electronic question filtering and enhancement system for audio broadcasts and voice conferences: A system for providing electronic filtering and enhancement for audio broadcasts and voice conferences. The system can comprise one or more computing devices configured to record one or more spoken segments, wherein the one or more spoken segments are comprised of utterances. The system can also include one or more... Agent: Wolf Greenfield & Sacks, P.C.

20100076748 - Computer-based device for generating multilanguage threat descriptions concerning computer threats: A computer-based device for generating multilanguage threat descriptions concerning computer threats like phishing and malware including viruses, worms, trojans, adware, spyware and other security-related risks comprises a database storing data as templates and objects relevant for the threat description, an interaction portion including output means for displaying said templates and... Agent: Henry M Feiereisen, LLC Henry M Feiereisen

20100076749 - Language processing system, language processing method, language processing program, and recording medium: A language processing system according to the present invention includes: an input device 1 that receives an input of an input document; and a unit selecting dictionary 22 that selects a document-information-attached user dictionary that is a user dictionary to which document information is attached. The unit selecting dictionary 22... Agent: Mr. Jackson Chen

20100076750 - System for low-latency animation of talking heads: Methods and apparatus for rendering a talking head on a client device are disclosed. The client device has a client cache capable of storing audio/visual data associated with rendering the talking head. The method comprises storing sentences in a client cache of a client device that relate to bridging delays... Agent: At & T Legal Department - Ndq

20100076752 - Automated data cleanup: The described implementations relate to automated data cleanup. One system includes a language model generated from language model seed text and a dictionary of possible data substitutions. This system also includes a transducer configured to cleanse a corpus utilizing the language model and the dictionary.... Agent: Microsoft Corporation Patent Group Docketing Dept.

20100076751 - Voice recognition system: A voice recognition system used for onboard equipment having a genre database (DB) that stores search target vocabularies in accordance with respective genres. It has a mike 1 for outputting speech sounds as spoken data; a first voice recognition dictionary 2a for recognizing words of search target genres in the... Agent: Birch Stewart Kolasch & Birch

20100076753 - Dialogue generation apparatus and dialogue generation method: A dialogue generation apparatus includes a reception unit configured to receive a first text from a dialogue partner, an information storage unit configured to store profile information specific to a person who can be the dialogue partner and a fixed-pattern text associated with the person, a presentation unit configured to... Agent: Ohlandt, Greeley, Ruggiero & Perle, L.L.P.

20100076754 - Low-delay transform coding using weighting windows: The invention relates to transform coding/decoding of a digital audio signal represented by a succession of frames, using windows of different lengths. For the coding within the meaning of the invention, it is sought to detect (51) a particular event, such as an attack, in a current frame (Ti): and,... Agent: Mckenna Long & Aldridge LLP

20100076755 - Decoding apparatus and audio decoding method: A decoding apparatus that uses a less number of hierarchical layers and a less amount of calculation to obtain a decoded signal having a high quality in terms of audibility. In the decoding apparatus, a first layer decoding part (152) decodes a first layer encoded data. A second layer decoding... Agent: Greenblum & Bernstein, P.L.C

20100076756 - Spatio-temporal speech enhancement technique based on generalized eigenvalue decomposition: The present invention describes a speech enhancement method using microphone arrays and a new iterative technique for enhancing noisy speech signals under low signal-to-noise-ratio (SNR) environments. A first embodiment involves the processing of the observed noisy speech both in the spatial- and the temporal-domains to enhance the desired signal component... Agent: Oblon, Spivak, Mcclelland Maier & Neustadt, L.L.P.

20100076757 - Adapting a compressed model for use in speech recognition: A speech recognition system includes a receiver component that receives a distorted speech utterance. The speech recognition also includes an adaptor component that selectively adapts parameters of a compressed model used to recognize at least a portion of the distorted speech utterance, wherein the adaptor component selectively adapts the parameters... Agent: Microsoft Corporation

20100076759 - Apparatus and method for recognizing a speech: A noisy vector is extracted from a noisy speech, which is a clean speech on which a noise is superimposed. A noise parameter of the noise is estimated from the noisy vector. A prior distribution parameter of a clean vector of the clean speech is already stored. A joint Gaussian... Agent: Turocy & Watson, LLP

20100076758 - Phase sensitive model adaptation for noisy speech recognition: A speech recognition system described herein includes a receiver component that receives a distorted speech utterance. The speech recognition also includes an updater component that is in communication with a first model and a second model, wherein the updater component automatically updates parameters of the second model based at least... Agent: Microsoft Corporation

20100076762 - Coarticulation method for audio-visual text-to-speech synthesis: A method for generating animated sequences of talking heads in text-to-speech applications wherein a processor samples a plurality of frames comprising image samples. The processor reads first data comprising one or more parameters associated with noise-producing orifice images of sequences of at least three concatenated phonemes which correspond to an... Agent: At & T Legal Department - Ndq

20100076761 - Decoding-time prediction of non-verbalized tokens: Non-verbalized tokens, such as punctuation, are automatically predicted and inserted into a transcription of speech in which the tokens were not explicitly verbalized. Token prediction may be integrated with speech decoding, rather than performed as a post-process to speech decoding.... Agent: Robert Plotkin, PC

20100076760 - Dialog filtering for filling out a form: The invention discloses a system and method for filling out a form from a dialog between a caller and a call center agent. The caller and the caller center agent can have the dialog in the form of telephone conversation, instant messaging chat or email exchange. The system and method... Agent: Ibm Corporation

20100076763 - Voice recognition search apparatus and voice recognition search method: A voice recognition search apparatus includes: a dictionary create unit creating a first voice recognition dictionary from a search subject data; a voice acquisition unit acquiring first and second voices; a voice recognition unit creating first and second text data by recognizing the first and second voices using the first... Agent: Turocy & Watson, LLP

20100076764 - Method of dialing phone numbers using an in-vehicle speech recognition system: A method of dialing phone numbers using an in-vehicle speech recognition system includes receiving speech input at a vehicle, separating the speech input into a word segment and a digit segment, identifying the letters in a word segment, converting the letters in the word segment to digits, and operating an... Agent: General Motors Corporation C/o Reising, Ethington, Barnes, Kisselle, P.C.

20100076765 - Structured models of repitition for speech recognition: Described is a technology by which a structured model of repetition is used to determine the words spoken by a user, and/or a corresponding database entry, based in part on a prior utterance. For a repeated utterance, a joint probability analysis is performed on (at least some of) the corresponding... Agent: Microsoft Corporation

20100076766 - Method for producing indicators and processing apparatus and system utilizing the indicators: The present invention discloses a method for producing graphical indicators and interactive systems for utilizing the graphical indicators. On the surface of an object, visually negligible graphical indicators are provided. The graphical indicators and main information, i.e. text or pictures, co-exist on the surface of object. The graphical indicators do... Agent: Juan Carlos A. Marquez C/o Stites & Harbison PLLC

20100076767 - Text to speech conversion of text messages from mobile communication devices: A method includes providing a user interface, at a mobile communication device, that includes a first area to receive text input and a second area to receive an identifier associated with an addressee device. The text input and the identifier are received via the user interface. A short message service... Agent: Toler Law Group

20100076768 - Speech synthesizing apparatus, method, and program: Disclosed is a speech synthesizing apparatus including a segment selection unit that selects a segment suited to a target segment environment from candidate segments, includes a prosody change amount calculation unit that calculates prosody change amount of each candidate segment based on prosody information of candidate segments and the target... Agent: Young & Thompson

20100076769 - Speech enhancement employing a perceptual model: Speech enhancement based on a psycho-acoustic model is disclosed that is capable of preserving the fidelity of speech while sufficiently suppressing noise including the processing artifact known as “musical noise”.... Agent: Dolby Laboratories Inc.

20100076770 - System and method for improving the performance of voice biometrics: A System and Method for Improving the Performance of Voice biometrics is provided wherein a digitized audio signal originating from at least one input client device is compressed (standards-based or proprietary) or uncompressed, the signal optionally being passed to a network which then passes the uncompressed signal to at least... Agent: Stoll Keenon Ogden PLLC

20100076771 - Voice signal processing apparatus and voice signal processing method: A voice signal processing apparatus and method includes determining maximum amplitude values of a plurality of different voice frame signals obtained by giving different amounts of phase shift to frequency components of voice frame signals having a predetermined length which are divided from a digital voice signal, and selecting a... Agent: Staas & Halsey LLP

20100076772 - Methods and apparatuses for encoding and decoding object-based audio signals: An audio decoding method and apparatus and an audio encoding method and apparatus which can efficiently process object-based audio signals are provided. The audio decoding method includes receiving a downmix signal and object-based side information, the downmix signal comprising at least two downmix channel signals; extracting gain information from the... Agent: Fish & Richardson P.C.

20100076773 - Secure audio stream scramble system: A process for distributing digital audio sequences according to a nominal flux format including a succession of fields, each of which includes at least one digital block clusterizing a selected number of coefficients corresponding to single audio elements that are digitally coded inside the flux and utilized by audio decoders... Agent: Connolly Bove Lodge & Hutz LLP

20100076774 - Audio decoder: An audio decoder (100) comprising: effect means, decoding means, and rendering means. The effect means (500) generate modified down-mix audio signals from received down-mix audio signals. Said received down-mix audio signals comprise a down-mix of a plurality of audio objects. Said modified down-mix audio signals are obtained by applying effects... Agent: Philips Intellectual Property & Standards

  
03/18/2010 > patent applications in patent subcategories. listing by industry category

20100070261 - Method and apparatus for detecting errors in machine translation using parallel corpus: A method for automatically detecting errors in machine translation using a parallel corpus includes analyzing morphemes of a target language sentence in the parallel corpus and a machine-translated target language sentence, corresponding to a source language sentence, to classify the morphemes into words; aligning by words and decoding, respectively, a... Agent: Staas & Halsey LLP

20100070262 - Adapting cross-lingual information retrieval for a target collection: A method and system for generating a bilingual dictionary that maps words of the source language to words of a target language is provided. A Cross-Lingual Information Retrieval (“CLIR”) system accesses a parallel collection that is comprised of a parallel source collection and a parallel target collection, and generates a... Agent: Perkins Coie LLP/msft

20100070264 - Apparatus and method for changing language in mobile communication terminal: An apparatus and a method for supporting many languages in a mobile communication terminal are provided. In the method, at least two installable languages are determined from a multi language image file comprising language packages of at least two languages. One of the at least two installable languages is selected... Agent: JeffersonIPLaw, LLP

20100070265 - Apparatus, system, and method for multilingual regulation management: An apparatus, system, and method are disclosed for automatically displaying regulations in a first language from a search in a second language under the invention, a regulation storage module is configured to store a regulation of a country and associated information in a first language in a database. A regulation... Agent: Kunzler & Mckenzie

20100070263 - Speech data retrieving web site system: A speech data retrieving Web site system is provided which may improve erroneous indexing with participation of a user by allowing the user to correct text data obtained by conversion using a speech recognition technique. Speech data published on a Web is converted into text data by a speech recognition... Agent: Rankin, Hill & Clark LLP

20100070267 - Method and apparatus for qos improvement with packet voice transmission over wireless lans: A method for improving packetized speech transmitted over a wireless LAN is disclosed. Speech packets transmitted over the wireless LAN are monitored for errors. Any of the speech packets found to have errors are replaced with synthesized speech packets. The synthesized speech packets may be created from a vocal tract... Agent: Mr. Brian S. Mudge Kenyon & Kenyon

20100070266 - Performance metrics for telephone-intensive personnel: Systems and methods for generating performance metrics to monitor and/or enhance the performance of telephone-intensive personnel are disclosed. The method generally includes detecting voice activity on a receive and/or a transmit channel in a communications system, outputting voicing decision outputs based on the detecting, storing the voicing decision outputs over... Agent: Plantronics, Inc.IPDepartment/legal

20100070268 - Multimodal unification of articulation for device interfacing: A system for a multimodal unification of articulation includes a voice signal modality to receive a voice signal, and a control signal modality which receives an input from a user and generates a control signal from the input which is selected from predetermined inputs directly corresponding to the phonetic information.... Agent: Jong Hyun Park

20100070269 - Adding second enhancement layer to celp based core layer: In an embodiment, a method of transmitting an input audio signal is disclosed. A first coding error of the input audio signal with a scalable codec having a first enhancement layer is encoded, and a second coding error is encoded using a second enhancement layer after the first enhancement layer.... Agent: Slater & Matsil, L.L.P.

20100070270 - Celp post-processing for music signals: In one embodiment, a method of receiving a decoded audio signal that has a transmitted pitch lag is disclosed. The method includes estimating pitch correlations of possible short pitch lags that are smaller than a minimum pitch limitation and have an approximated multiple relationship with the transmitted pitch lag, checking... Agent: Slater & Matsil, L.L.P.

20100070272 - method and an apparatus for processing a signal: An apparatus for processing an encoded signal and method thereof are disclosed, by which an audio signal can be compressed and reconstructed in higher efficiency. An audio signal processing method includes the steps of identifying whether a coding type of the audio signal is a music signal coding type using... Agent: Birch Stewart Kolasch & Birch

20100070271 - Transmission error concealment in audio signal: A method of concealing transmission error in a digital audio signal, wherein a signal that has been decoded after transmission is received, the samples decoded while the transmitted data is valid are stored, at least one short-term prediction operator and one long-term prediction operator are estimated as a function of... Agent: Cohen, Pontani, Lieberman & Pavane LLP

20100070273 - Speech synthesis and voice recognition in metrologic equipment: An electronic test equipment apparatus is provided. A metrologic device is adapted for creating stimulus signals and capturing responses from electronic devices under test (DUTs). An auditory device is in communication with the metrologic device. The auditory device is adapted for converting an output of the metrologic device to an... Agent: Honeywell/ifl Patent Services

20100070274 - Apparatus and method for speech recognition based on sound source separation and sound source identification: An apparatus for a speech recognition based on source separation and identification includes: a sound source separator for separating mixed signals, which are input to two or more microphones, into sound source signals by using independent component analysis (ICA), and estimating direction information of the separated sound source signals; and... Agent: Staas & Halsey LLP

20100070275 - Speech to message processing: Voice message processors are configured to produce text representations of voice messages. The text representations can be compacted based on one or more abbreviation libraries or rule libraries. Abbreviation processing can be applied to produce a compact text representation based on display properties of a destination device or to enhance... Agent: At&t Legal Department - Dwt

20100070276 - Method and apparatus for interaction or discourse analytics: A method and apparatus for analyzing and segmenting a vocal interaction captured in a test audio source, the test audio source captured within an environment. The method and apparatus first use text and acoustic features extracted from the interaction with tagging information, for constructing a model. Then, at production time,... Agent: Soroker-agmon Advocate And Patent Attorneys

20100070277 - Voice recognition device, voice recognition method, and voice recognition program: A voice recognition device that recognizes a voice of an input voice signal, comprises a voice model storage unit that stores in advance a predetermined voice model having a plurality of detail levels, the plurality of detail levels being information indicating a feature property of a voice for the voice... Agent: Mr. Jackson Chen

20100070278 - Method for creating a speech model: A transformation can be derived which would represent that processing required to convert a male speech model to a female speech model. That transformation is subjected to a predetermined modification, and the modified transformation is applied to a female speech model to produce a synthetic children's speech model. The male... Agent: Kaplan Gilman & Pergament LLP

20100070279 - Piecewise-based variable -parameter hidden markov models and the training thereof: A speech recognition system uses Gaussian mixture variable-parameter hidden Markov models (VPHMMs) to recognize speech under many different conditions. Each Gaussian mixture component of the VPHMMs is characterized by a mean parameter μ and a variance parameter Σ. Each of these Gaussian parameters varies as a function of at least... Agent: Microsoft Corporation

20100070280 - Parameter clustering and sharing for variable-parameter hidden markov models: A speech recognition system uses Gaussian mixture variable-parameter hidden Markov models (VPHMMs) to recognize speech. The VPHMMs include Gaussian parameters that vary as a function of at least one environmental conditioning parameter. The relationship of each Gaussian parameter to the environmental conditioning parameter(s) is modeled using a piecewise fitting approach,... Agent: Microsoft Corporation

20100070282 - Method and apparatus for improving transaction success rates for voice reminder applications in e-commerce: Methods and apparatuses are disclosed for improving transaction success rates for voice reminder applications in e-commerce. In one embodiment of the invention, the voice reminder applications in e-commerce utilizes a network-based text-to-speech (TTS) alert system, which can generate a purchase reminder associated with a recipient's potential purchase. The network-based text-to-speech... Agent: Invent Capture, LLC

20100070281 - System and method for audibly presenting selected text: Disclosed herein are methods for presenting speech from a selected text that is on a computing device. This method includes presenting text on a touch-sensitive display and having that text size within a threshold level so that the computing device can accurately determine the intent of the user when the... Agent: At & T Legal Department - Ndq

20100070283 - Voice emphasizing device and voice emphasizing method: A voice emphasizing device emphasizes in a speech a “strained rough voice” at a position where a speaker or user of the speech intends to generate emphasis or musical expression. Thereby, the voice emphasizing device can provide the position with emphasis of anger, excitement, tension, or an animated way of... Agent: Wenderoth, Lind & Ponack L.L.P.

20100070285 - method and an apparatus for processing an audio signal: The present invention includes receiving a plurality of frame data including first frame data and second frame data encoded by at least one coding schemes, obtaining first flag information indicating whether the first frame data and the second frame data are encoded by frequency domain transform coding scheme, respectively, decoding... Agent: Birch Stewart Kolasch & Birch

20100070287 - Adapting masking thresholds for encoding a low frequency transient signal in audio data: An improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set... Agent: Hickman Palermo Truong & Becker LLP/apple Inc.

20100070284 - Method and an apparatus for processing a signal:

20100070286 - Technique for controlling codec selection along a complex call path: The invention relates to a technique of operating a call control node controlling at least one section of a call path. The call path includes between two opposite edge nodes a multi-section harmonization path along which codec selection is to be harmonized. A method embodiment of the technique, wherein the... Agent: Ericsson Inc.

  
03/11/2010 > patent applications in patent subcategories. listing by industry category

20100063794 - Method and apparatus for translating hand gestures: A sign language recognition apparatus and method is provided for translating hand gestures into speech or written text. The apparatus includes a number of sensors on the hand, arm and shoulder to measure dynamic and static gestures. The sensors are connected to a microprocessor to search a library of gestures... Agent: Blank Rome LLP

20100063795 - Data processing device, data processing method, and data processing program: [PROBLEMS] To provide a data processing device such as a text mining device capable of extracting characteristic structures properly even in case a plurality of words indicating identical contents or a plurality of words semantically associated are contained in input data. [MEANS FOR SOLVING PROBLEMS] Association node extraction unit (22)... Agent: Sughrue Mion, PLLC

20100063797 - Discovering question and answer pairs: The present invention provides a new approach to extracting question-answer pairs from online forums. The system develops a classification-based technique to discover questions in forums using sequential patterns automatically extracted from both questions and non-question sentences in forums as features. Once the questions are discovered, the system discovers the answers.... Agent: Perkins Coie LLP/msft

20100063798 - Error-detecting apparatus and methods for a chinese article: The invention discloses an error-detecting method for a Chinese article, handling a Chinese sentence including a first erroneous Chinese character string in a first location. The method includes subdividing the first erroneous Chinese character string into a plurality of first subgroups, wherein each of the first subgroups consists of two... Agent: Birch Stewart Kolasch & Birch

20100063800 - Method, system and software for implementing an automated call routing application in a speech enabled call center environment: A system, method and software for implementing an automated call routing application in a speech enabled call center environment are provided. In operation, the invention provides for the identification of a call center transaction selection from a natural language user utterance and the invocation of one or more scripts operable... Agent: At&t Legal Department - Jw Attn: Patent Docketing

20100063799 - Process for constructing a semantic knowledge base using a document corpus: Related free-text documents, a corpus, are used to empirically derive a semantic knowledge base through a method in which documents are segmented into unique sentences, and then used to define sentential propositions which are arranged in a knowledge hierarchy. The method takes compound natural language sentences and transforms them to... Agent: Logical Semantics, Inc.

20100063796 - Word sense disambiguation using emergent categories: Disclosed herein is a computer implemented method and system for word sense disambiguation in a natural language sentence. The natural language sentence is parsed for identifying possible parts of speech for each term and identifying possible phrase structures. Terms comprising one or more linguistic roles are identified. The possible sense... Agent: Ashok Tankha

20100063802 - Adaptive frequency prediction: In one embodiment, a method of transceiving an audio signal is disclosed. The method includes providing low band spectral information having a plurality of spectrum coefficients and predicting a high band extended spectral fine structure from the low band spectral information for at least one subband, where the high band... Agent: Slater & Matsil, L.L.P.

20100063801 - Postfilter for layered codecs: A scalable decoder device (50) for signals representing audio comprises a primary decoder (21) connected to an input (40). The primary decoder (21) is arranged to provide a primary decoded signal (23) based on received parameters (4). A primary postfilter (31) is connected to the primary decoder (23) to provide... Agent: Ericsson Inc.

20100063803 - Spectrum harmonic/noise sharpness control: A transmitted data that includes audio data and a transmitted spectral sharpness parameter representing a spectral harmonic/noise sharpness of a plurality of subbands are received. A measured spectral sharpness parameter is estimated from received audio data. The transmitted spectral sharpness parameter is compared with the measured spectral sharpness parameter. A... Agent: Slater & Matsil, L.L.P.

20100063804 - Adaptive sound source vector quantization device and adaptive sound source vector quantization method: Provided is an adaptive sound source vector quantization device which can always perform a pitch cycle search with a resolution appropriate for any section of the pitch cycle search range of a second sub-frame when a pitch cycle search range of the second sub-frame changes in accordance with a pitch... Agent: Greenblum & Bernstein, P.L.C

20100063806 - Classification of fast and slow signal: Low bit rate audio coding such as BWE algorithm often encounters conflict goal of achieving high time resolution and high frequency resolution at the same time. In order to achieve best possible quality, input signal can be first classified into fast signal and slow signal. This invention focuses on classifying... Agent: Yang Gao

20100063805 - Non-causal postfilter: A decoder arrangement comprising a receiver input for parameters of frame-based coded signals and a decoder arranged to provide frames of decoded audio signals based on the parameters. The receiver input and/or the decoder is arranged to establish a time difference between the occasion when parameters of a first frame... Agent: Ericsson Inc.

20100063808 - Spectral envelope coding of energy attack signal: MDCT or FFT-based audio coding algorithms often have the problem named here spectral pre-echoes when coding an energy attack signal. This invention presents several possibilities to avoid the spectral pre-echoes existing in decoded signal segment before the energy attack point. The spectral envelope before the attack point can be improved... Agent: Yang Gao

20100063807 - Subtraction of a shaped component of a noise reduction spectrum from a combined signal: A system and methods of subtraction of a shaped component of a noise reduction spectrum from a combined signal are disclosed. In an embodiment, a method includes identifying a selected frequency component using a corresponding frequency component of a noise sample spectrum. A noise set is comprised of the noise... Agent: Texas Instruments Incorporated

20100063809 - Double talk detector: A double talk detector for controlling the echo path estimation in a telecommunication system by indicating when a received coded speech signal is dominated by a non-echo signal; i.e., that so-called double talk exists. This is determined by extracting LSPs from a coded speech frame of the received coded speech... Agent: Ericsson Inc.

20100063812 - Efficient temporal envelope coding approach by prediction between low band signal and high band signal: This invention proposes a more efficient way to quantize temporal envelope shaping of high band signal by benefiting from energy relationship between low band signal and high band signal; if low band signal is well coded or it is coded with time domain codec such as CELP, temporal envelope shaping... Agent: Yang Gao

20100063810 - Noise-feedback for spectral envelope quantization: A method of transmitting an input audio signal is disclosed. A current spectral magnitude of the input audio signal is quantized. A quantization error of a previous spectral magnitude is fed back to influence quantization of the current spectral magnitude. The feeding back includes adaptively modifying a quantization criterion to... Agent: Slater & Matsil, L.L.P.

20100063811 - Temporal envelope coding of energy attack signal by using attack point location: A method of transceiving an audio signal is disclosed. An input audio signal is provided. It is determined whether an energy attack signal exists within the input audio signal and a decision flag is set if the energy attack signal exists. A temporal location of the energy attack point in... Agent: Slater & Matsil, L.L.P.

20100063813 - System and method for multidimensional gesture analysis: Hand gestures are translated by first detecting the hand gestures with an electronic sensor and converting the detected gestures into respective electrical transfer signals in a frequency band corresponding to that of speech. These transfer signals are inputted in the audible-sound frequency band into a speech-recognition system where they are... Agent: K.f. Ross P.C.

20100063814 - Apparatus, method and computer program product for recognizing speech: A speech recognition apparatus includes a document input unit configured to input a document including a reference term which a user refers to; a vocabulary storage unit configured to store a vocabulary list including a group of notation information, reading information and part of speech; a hypernym hyponym relation storage... Agent: Turocy & Watson, LLP

20100063815 - Real-time transcription: A computing system accepts audio from one or more sources, parses the audio into chunks, and transcribes the chunks in substantially real time. Some transcription is performed automatically, while other transcription is performed by humans who listen to the audio and enter the words spoken and/or the intent of the... Agent: Bingham Mchale LLP

20100063816 - Method and system for parsing of a speech signal: A method for processing an analog speech signal for speech recognition. The analog speech signal is sampled to produced a sampled speech signal. The sampled speech signal is framed into multiple frames of the sampled speech signal. The absolute value of the sampled speech signal is integrated within the frames... Agent: The Law Office Of Michael E. Kondoudis

20100063817 - Acoustic model registration apparatus, talker recognition apparatus, acoustic model registration method and acoustic model registration processing program: When a talker utters for the N utterances and the utterance sounds of the N utterances are input through the microphone 1, the sound feature quantity extraction part 4 extracts sound feature quantities which indicate the acoustic features of the input utterance sounds, wherein each sound feature quantity has one-to-one... Agent: Sughrue Mion, PLLC

20100063820 - Correlating video images of lip movements with audio signals to improve speech recognition: A speech recognition device can include an audio signal receiver configured to receive audio signals from a speech source, a video signal receiver configured to receive video signals from the speech source, and a processing unit configured to process the audio signals and the video signals. In addition, the speech... Agent: Mcandrews Held & Malloy, Ltd

20100063819 - Language model learning system, language model learning method, and language model learning program: A language model learning system for learning a language model on an identifiable basis relating to a word error rate used in speech recognition. The language model learning system (10) includes a recognizing device (101) for recognizing an input speech by using a sound model and a language model and... Agent: Young & Thompson

20100063818 - Multi-tiered voice feedback in an electronic device: This invention is directed to providing voice feedback to a user of an electronic device. Because each electronic device display may include several speakable elements (i.e., elements for which voice feedback is provided), the elements may be ordered. To do so, the electronic device may associate a tier with the... Agent: Kramer Levin Naftalis & Frankel LLP

20100063821 - Hands-free and non-visually occluding object information interaction system: Technologies are described herein for providing a hands-free and non-visually occluding interaction with object information. In one method, a visual capture of a portion of an object is received through a hands-free and non-visually occluding visual capture device. An audio capture is also received from a user through a hands-free... Agent: Hope Baldauff Hartman, LLC

20100063822 - Communication system for speech disabled individuals: A communication system that is specifically designed for the needs of speech impaired individuals, particularly aphasia victims, makes use of a speech generating mobile terminal communication device (SGMTD) (12) that is designed to be hand held and operated by a speech disabled individual. The SGMTD includes a database of audio... Agent: Jones, Tullar & Cooper, P.C.

20100063823 - Method and system for generating dialogue managers with diversified dialogue acts: A method to generate dialogue manager (DM) is provided, in which a plurality DMs with the same purpose but having different dialogue acts is automatically generated according to a DM designed by a designer. An automatic aiding tool facilitates the design of a dialogue flow and the adjustment of DM... Agent: Jianq Chyun Intellectual Property Office

20100063824 - Apparatus and method for widening audio signal band: An audio signal band expanding apparatus (100a) includes a harmonic generator (3) that receives an input audio signal having a predetermined band and generates, based on the input audio signal, harmonic signals, and an adder (2) that adds the harmonic signals generated by the harmonic generator (3) to the input... Agent: Wenderoth, Lind & Ponack L.L.P.

20100063826 - Computation apparatus and method, quantization apparatus and method, audio encoding apparatus and method, and program: A computation apparatus includes: a range calculation section for calculating a range of an input value that can give a predetermined discrete value obtained by discretizing a computation result of a nonlinear operation; and a discrete value output section for outputting, when the input value is input, the predetermined discrete... Agent: Wolf Greenfield & Sacks, P.C.

20100063827 - Selective bandwidth extension: A method of receiving an audio signal includes measuring a periodicity of the audio signal to determine a checked periodicity. At least one best available subband is determined. At least one extended subband is composed, wherein composing includes reducing a ratio of composed harmonic components to composed noise components if... Agent: Slater & Matsil, L.L.P.

20100063825 - Systems and methods for memory management and crossfading in an electronic device: Systems and methods are disclosed for the management of memory used in a crossfading operation in an electronic device. In one embodiment, a processor is used to alternately decode two audio streams, one which is being faded out and one which is being faded in to implement a crossfade. The... Agent: Apple Inc. C/o Fletcher Yoder, PC

20100063828 - Stream synthesizing device, decoding unit and method: A stream synthesizing device includes an input unit which inputs at least two coded signals each including a first downmix acoustic signal and an extended signal, each of first downmix acoustic signals being obtained by coding an acoustic signal into which at least two sound signals are downmixed, and the... Agent: Wenderoth, Lind & Ponack L.L.P.

  
03/04/2010 > patent applications in patent subcategories. listing by industry category

20100057432 - Method and apparatus for improving word alignment quality in a multilingual corpus: A method for improving word alignment quality in a multilingual corpus including a plurality of corresponding sentence pairs between any two languages among a first language, a second language and at least one other language and word alignment information between each of the plurality of corresponding sentence pairs, the method... Agent: Oblon, Spivak, Mcclelland Maier & Neustadt, L.L.P.

20100057431 - Method and apparatus for language interpreter certification: A process provides, if a language interpreter candidate is a beginning level language interpreter candidate, a preliminary self-assessment and a language proficiency test. Further, the process provides, if the language interpreter has a predetermined amount of entry level language interpreter experience or has completed beginning level language interpreter requirements, an... Agent: Patent Ingenuity, PC

20100057430 - Multiple language communication system: A method and system for communicating in more than one language includes a communication system of cards. In one embodiment, the communication system of cards is for a communicator to communicate in a first language to a receiver in a second language. The communication system includes a communication card comprising... Agent: Tod T. Tumey

20100057434 - Image processing apparatus, image processing method, computer-readable medium and computer data signal: An image processing apparatus includes an image receiving unit, a writing detection unit, a writing deletion unit, a character recognition unit, a character string generation unit, a translation unit and a translation image generation unit. The image receiving unit receives an image including a writing. The writing detection unit detects... Agent: Sughrue-265550

20100057436 - Method and portable system for phonetic language translation using brian interface: A phonetic language translation system receives audio output from an audible program presented to the user, so as to identify any speech signal contained within the audio output. The speech signals are broken down into recognizable phonemes which make up the most basic elements of speech in spoken languages. The... Agent: Johnson Manuel-devadoss ("johnson Smith")

20100057435 - System and method for speech-to-speech translation: Disclosed herein are systems and methods for receiving an input speech sample in a first language and outputting a translated speech sample in a second language in the unique voice of a user. According to several embodiments, a translation system includes a translation mode performing the above functions and a... Agent: Stoel Rives LLP - Slc

20100057433 - Systems and methods for providing translations of applications using decentralized contributions: Various embodiments of the present invention provide systems and methods for providing a translation for a set of one or more terms or phrases related to a software application using decentralized contributions. In particular, various embodiments provide systems and methods by which multiple users of the application contribute translations for... Agent: Alston & Bird LLP

20100057437 - Machine-translation apparatus using multi-stage verbal-phrase patterns, methods for applying and extracting multi-stage verbal-phrase patterns: A machine-translation apparatus using multi-level verbal-phrase patterns includes: a simple sentence generation unit for generating an input simple sentence; a basic verbal-phrase pattern-matching unit for trying a match of a semantic code of each case component of the input simple sentence with basic verbal-phrase patterns; a default verbal-phrase pattern matching... Agent: Staas & Halsey LLP

20100057438 - Phrase-based statistics machine translation method and system: A phrase-based statistics machine translation method includes for phrases in an input sentence, performing fuzzy matching in a pre-constructed phrase table. In the method, by performing fuzzy matching on the phrases, high quality translations can be generated for long phrases in the input sentence, thus the quality of the translation... Agent: Oblon, Spivak, Mcclelland Maier & Neustadt, L.L.P.

20100057439 - Portable storage medium storing translation support program, translation support system and translation support method: A portable storage medium storing a translation support program supporting translation of an original document being document data containing Japanese and a foreign language for expressing a word of one language in another language includes: correcting the correction target character contained in the original document in accordance with the correction... Agent: Greer, Burns & Crain

20100057441 - Information processing apparatus and operation setting method: An information processing apparatus according to an embodiment of the present invention includes an input unit into which a language used by a user and a usage country are input, a plurality of information providing units that provide program information a display unit that displays the program information, a decision... Agent: Lerner, David, Littenberg, Krumholz & Mentlik

20100057440 - Multi-language support in preboot environment: Systems and methods for providing multi-language support in a pre-boot environment are supplied. User interface type information, such as keyboard type information and translation tables, are ascertained and provided to the pre-boot environment of the apparatus, allowing the apparatus to properly receive and/or translate multi-language inputs in an appropriate fashion.... Agent: Ference & Associates LLC

20100057442 - Device, method, and program for determining relative position of word in lexical space: The position of a word in the lexical space is determined stably and highly accurately by arbitrarily setting a predetermined initial condition, determining the occurrence frequency and cooccurrence relationship of the word under a given condition, and minimizing the difference between the values of the occurrence frequency and cooccurrence and... Agent: Hewlett-packard Company Intellectual Property Administration

20100057443 - Systems and methods for responding to natural language speech utterance: Systems and methods are provided for receiving speech and non-speech communications of natural language questions and/or commands, transcribing the speech and non-speech communications to textual messages, and executing the questions and/or commands. The invention applies context, prior information, domain knowledge, and user specific profile data to achieve a natural environment... Agent: Pillsbury Winthrop Shaw Pittman, LLP

20100057444 - Method and system of extending battery life of a wireless microphone unit: A method of extending battery life of a wireless microphone unit includes muting the wireless microphone unit responsive to a mute signal from a base station unit, transmitting, by the wireless microphone unit, compressed muted audio data, wherein the compressed muted audio data is compressed via a first compression scheme,... Agent: Winstead PC

20100057445 - System and method for automatically adjusting floor controls for a conversation: A system and method for automatically adjusting floor controls for a conversation is provided. Audio streams are received, which each originate from an audio source. Floor controls for a current configuration including at least a portion of the audio streams are maintained. Conversational characteristics shared by two or more of... Agent: Cascadia Intellectual Property

20100057446 - Encoding device and encoding method: Provided is an encoding device which can obtain a sound quality preferable for auditory sense even if the number of information bits is small. The encoding device includes a shape quantization unit (111) having: a section search unit (121) which searches for a pulse for each of bands into which... Agent: Greenblum & Bernstein, P.L.C

20100057447 - Parameter decoding device, parameter encoding device, and parameter decoding method: Provided is a parameter decoding device which performs parameter compensation process so as to suppress degradation of a main observation quality in a prediction quantization. The parameter decoding device includes amplifiers (305-1 to 305-M) which multiply inputted quantization prediction residual vectors xn−1 to xn-M by a weighting coefficient β1 to... Agent: Greenblum & Bernstein, P.L.C

20100057448 - Multicodebook source-dependent coding and decoding: A method for coding data, includes: grouping data into frames; classifying the frames into classes; for each class, transforming the frames belonging to the class into filter parameter vectors, which are extracted from the frames by applying a first mathematical transformation; for each class, computing a filter codebook based on... Agent: Finnegan, Henderson, Farabow, Garrett & Dunner LLP

20100057449 - Apparatus and method of enhancing quality of speech codec: An apparatus and method of improving the quality of a speech codec are provided. In the method, a first energy of a signal decoded by a core codec is calculated, and a second energy of a signal decoded by a low-band enhancement mode is calculated. Then, when the first energy... Agent: Ladas & Parry LLP

20100057451 - Distributed speech recognition using one way communication: A speech recognition client sends a speech stream and control stream in parallel to a server-side speech recognizer over a network. The network may be an unreliable, low-latency network. The server-side speech recognizer recognizes the speech stream continuously. The speech recognition client receives recognition results from the server-side recognizer in... Agent: Robert Plotkin, PC

20100057450 - Hybrid speech recognition: A hybrid speech recognition system uses a client-side speech recognition engine and a server-side speech recognition engine to produce speech recognition results for the same speech. An arbitration engine produces speech recognition output based on one or both of the client-side and server-side speech recognition results.... Agent: Robert Plotkin, PC

20100057452 - Speech interfaces: The described implementations relate to speech interfaces and in some instances to speech pattern recognition techniques that enable speech interfaces. One system includes a feature pipeline configured to produce speech feature vectors from input speech. This system also includes a classifier pipeline configured to classify individual speech feature vectors utilizing... Agent: Microsoft Corporation

20100057453 - Voice activity detection system and method: Discrimination between at least two classes of events in an input signal is carried out in the following way. A set of frames containing an input signal is received, and at least two different feature vectors are determined for each of said frames. Said at least two different feature vectors... Agent: Ibm Corporation

20100057454 - System and method for echo cancellation: An echo canceller for improved recognition and removal of an echo from a communication device. The echo canceller can dynamically reduce echo using an improved energy estimator and an improved adaptive filter. The improved energy estimator can determine if conversation is in a single talk period or a double talk... Agent: Qualcomm Incorporated

20100057458 - Image processing apparatus, image processing program and image processing method: Regarding audio data related to document data, an image processing apparatus pertaining to the present invention generates text data by using a speech recognition technology in advance, and determines delimiter positions in the text data and the audio data in correspondence. In a keyword search, if a keyword is detected... Agent: Buchanan, Ingersoll & Rooney PC

20100057455 - Method and system for 3d lip-synch generation with data-faithful machine learning: A method for generating three-dimensional speech animation is provided using data-driven and machine learning approaches. It utilizes the most relevant part of the captured utterances for the synthesis of input phoneme sequences. If highly relevant data are missing or lacking, then it utilizes less relevant (but more abundant) data and... Agent: Park Law Firm

20100057457 - Speech recognition system and program therefor: An unknown word is additionally registered in a speech recognition dictionary by utilizing a correction result, and a new pronunciation of the word that has been registered in a speech recognition dictionary is additionally registered in the speech recognition dictionary, thereby increasing the accuracy of speech recognition. The start time... Agent: Rankin, Hill & Clark LLP

20100057460 - Verbal labels for electronic messages: Verbal labels for electronic messages, as well as systems and methods for making and using such labels, are disclosed. A verbal label is a label containing audio data (such as a digital audio file of a user's voice and/or a speaker template thereof) that is associated with one or more... Agent: Morgan, Lewis & Bockius LLP/google

20100057459 - Voice recognition system for interactively gathering information to generate documents: A voice recognition system for interactively gathering information to generate a document, form, or application. An user establishes a connection with the voice recognition system and provides verbal responses to a plurality of verbal questions generated by voice recognition system to compile a document, form or application. The voice recognition... Agent: Fulbright & Jaworski, LLP

20100057456 - Voice response unit mapping: A system, method and program product for mapping voice response units (VRUs). A system is provided that includes: an interrogation system for interrogating a VRU and gathering a hierarchical set of options associated with the VRU; a map building system for converting the hierarchical set of options into a VRU... Agent: Ibm Corporation

20100057461 - Method and system for creating or updating entries in a speech recognition lexicon: In a method and a system (20) for creating or updating entries in a speech recognition (SR) lexicon (7) of a speech recognition system, said entries mapping speech recognition (SR) phoneme sequences to words, said method comprising entering a respective word, and in the case that the word is a... Agent: Wolf Greenfield & Sacks, P.C.

20100057462 - Speech recognition: The present invention relates to a method for speech recognition of a speech signal comprising the steps of providing at least one codebook comprising codebook entries, in particular, multivariate Gaussians of feature vectors, that are frequency weighted such that higher weights are assigned to entries corresponding to frequencies below a... Agent: Sunstein Kann Murphy & Timbers LLP

20100057463 - System and method for generating natural language phrases from user utterances in dialog systems: Embodiments of a dialog system that employs a corpus-based approach to generate responses based on a given number of semantic constraint-value pairs are described. The system makes full use of the data from the user input to produce dialog system responses in combination with a template generator. The system primarily... Agent: Courtney Staniford & Gregory LLP

20100057466 - Method and apparatus for scrolling text display of voice call or message during video display session: A method and communication device disclosed includes displaying a video on a display, converting voice audio data to textual data by applying voice-to-text conversion, and displaying the textual data as scrolling text displayed along with the video on the display and either above, below or across the video. The method... Agent: Qualcomm Incorporated

20100057464 - System and method for variable text-to-speech with minimized distraction to operator of an automotive vehicle: A text-to-speech (TTS) system implemented in an automotive vehicle is dynamically tuned to increase intelligibility over a wide variety of vehicle operating states and environmental conditions by tuning characteristics of the synthesized voice in response to measured operating states. To decrease distractions to an operator of the vehicle, an embodiment... Agent: Capitol City Techlaw, PLLC

20100057465 - Variable text-to-speech for automotive application: A text-to-speech (TTS) system implemented in an automotive vehicle is dynamically tuned to improve intelligibility over a wide variety of vehicle operating states and environmental conditions. In one embodiment of the present invention, a TTS system is interfaced to one or more vehicle sensors to measure parameters including vehicle speed,... Agent: Capitol City Techlaw, PLLC

20100057467 - Speech synthesis with dynamic constraints: A method is disclosed for providing speech parameters to be used for synthesis of a speech utterance. In at least one embodiment, the method includes receiving an input time series of first speech parameter vectors, preparing at least one input time series of second speech parameter vectors consisting of dynamic... Agent: Harness, Dickey & Pierce, P.L.C

20100057468 - Binary-caching for xml documents with embedded executable code: A method, system and voice browser execute voice applications to perform a voice-based function. A document is retrieved and parsed to create a parse tree. Script code is created from the parse tree, thereby consuming part of the parse tree to create a reduced parse tree. The reduced parse tree... Agent: Christopher & Weisberg, P.A.

20100057469 - Method and system for ordering content using a voice menu system: A method and system for ordering content includes a voice menu system and a phone device communicating a phone signal to the voice menu system. The voice menu system determines the phone number associated with the phone device through the phone signal and generates a voice prompt for recording a... Agent: The Directv Group, Inc. Patent Docket Administration

20100057470 - System and method for voice-enabled media content selection on mobile devices: A system for voice-enabled location and execution for playback of media content selections stored on a media content playback device has a voice input circuitry for inputting voice-based commands into the playback device; codec circuitry for converting voice input from analog content to digital content for speech recognition and for... Agent: Stevens Law Group

20100057475 - Method and system for digital gain control in an audio codec: Aspects of a method and system for digital gain control in an audio CODEC are provided. In this regard, in a hardware audio CODEC, a plurality of gain values in decibel format may be generated and may be summed to generate an overall gain value in decibels. Gain values may... Agent: Mcandrews Held & Malloy, Ltd

20100057474 - Method and system for digital gain processing in a hardware audio codec for audio transmission: In a hardware audio CODEC which processes audio signals from a plurality of inputs, voltage and/or power levels of the input audio signals may be adjusted such that the digitally adjusted levels are approximately equal for each of the plurality of inputs. The digital adjustment may comprise, for each audio... Agent: Mcandrews Held & Malloy, Ltd

20100057473 - Method and system for dual voice path processing in an audio codec: Aspects of a method and system for dual voice path processing in an audio CODEC may enable selecting two or more signals received via one or more audio input devices, and filtering and down-sampling each of the selected signals via two or more signal processing branches. Furthermore, an output sample... Agent: Mcandrews Held & Malloy, Ltd

20100057472 - Method and system for frequency compensation in an audio codec: In a method and system for frequency compensation in an audio CODEC, a filter in a hardware audio CODEC may be configured based on power consumption and based on a frequency response of an active output device to which the filter is communicatively coupled. The filter may comprise a plurality... Agent: Mcandrews Held & Malloy, Ltd

20100057471 - Method and system for processing audio signals via separate input and output processing paths: Aspects of a method and system for processing audio signals via separate input and output processing paths are provided. In this regard, a hardware audio CODEC comprising one or more audio inputs and one or more audio outputs and may be enabled to route, via one or more switching elements,... Agent: Mcandrews Held & Malloy, Ltd

20100057476 - Signal bandwidth extension apparatus: A signal bandwidth extension apparatus includes a determination unit which determines whether or not a peak component of the input signal is lacked in the band to be extended, and a control unit which controls to extend the bandwidth when the determination unit determines that the peak component of the... Agent: Frishauf, Holtz, Goodman & Chick, PC

20100057477 - Method and system for multi-band amplitude estimation and gain control in an audio codec: Aspects of a method and system for multi-band amplitude estimation and gain control in an audio CODEC are provided. In this regard, an audio signal may be filtered and delayed to generate one or more sub-band signals, a gain may be applied to each sub-band signal to generate one or... Agent: Mcandrews Held & Malloy, Ltd

Previous industry: Data processing: structural design, modeling, simulation, and emulation
Next industry: Data processing: financial, business practice, management, or cost/price determination


######

RSS FEED for 20130516: xml
Integrate FreshPatents.com into your RSS reader/aggregator or website to track weekly updates.
For more info, read this article.

######

Thank you for viewing Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents on the FreshPatents.com website. These are patent applications which have been filed in the United States. There are a variety ways to browse Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patent applications on our website including browsing by date, agent, inventor, and industry. If you are interested in receiving occasional emails regarding Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents we recommend signing up for free keyword monitoring by email.



###

FreshPatents.com Support - Terms & Conditions

Results in 0.75431 seconds

PATENT INFO