Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents - Monitor Patents
Fresh Patents
Monitor Patents Patent Organizer File a Provisional Patent Browse Inventors Browse Industry Browse Agents Browse Locations




USPTO Class 704  |  Browse by Industry: Previous - Next | All     monitor keywords
06/2009 | Recent  |  09: Oct | Sept | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan |  | 08: Dec | Nov | Oct | Sp | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan |  | 07: Dec  | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan |  | 06: Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | 

Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression inventions 06/09

Recently published patent applications awaiting approval from the USPTO. Recent week's RSS XML file available below.
Listing format for abstract view: USPTO application #, Title, Abstract excerpt,Patent Agent. Listing format for list view: USPTO National Class full category number, title of the patent application.
  
06/25/2009 > patent applications in patent subcategories.

20090164206 - Method and apparatus for training a target language word inflection model based on a bilingual corpus, a tlwi method and apparatus, and a translation method and system for translating a source language text into a target language translation: The present invention provides a method and apparatus for training a target language word inflection (TLWI) model based on a bilingual corpus, a TLWI method and apparatus, and a translation method and system for translating a source language text into a target language translation. In the method for training a... Agent: Charles N.j. Ruggiero, Esq. Ohlandt, Greeley, Ruggiero & Perle, L.L.P.

20090164208 - Method and apparatus for aligning parallel spoken language corpora: The method for aligning parallel spoken language corpora comprises obtaining a statistics method and dictionaries-based word alignment set from the parallel spoken language corpora, aligning chunks of the parallel spoken language corpora by using the statistics method and dictionaries-based word alignment set, to obtain a chunk alignment set, and aligning... Agent: Charles N.j. Ruggiero, Esq. Ohlandt, Greeley, Ruggiero & Perle, L.L.P.

20090164207 - User device having sequential multimodal output user interace: In one aspect of the exemplary embodiments of this invention an apparatus includes a user interface that contains a plurality of input modalities and a plurality of output modalities, and a data processor coupled with the user interface and configurable to present a user with a content item that includes... Agent: Harrington & Smith, PC

20090164209 - Device and method for capturing and forwarding verbalized comments to a remote location: Disclosed is a device for sending a verbalized comment to a remote computer server. The device includes a processor that executes and operates the various software and hardware components. A microphone is utilized to record a comment. Temporary storage buffers the recorded comment. An auto-dialing application is utilized to automatically... Agent: Gregory Stephens Williams Mullen PC

20090164210 - Codebook sharing for lsf quantization: In accordance with one aspect of the invention, a selector supports the selection of a first encoding scheme or the second encoding scheme based upon the detection or absence of the triggering characteristic in the interval of the input speech signal. The first encoding scheme has a pitch pre-processing procedure... Agent: Farshad Farjami, Esq. Farjami & Farjami LLP

20090164211 - Speech encoding apparatus and speech encoding method: Provided is a voice encoding device for acquiring a satisfactory sound quality by making sufficient use of a tendency according to the noisiness or noiselessness of an input signal to be encoded. In this voice encoding device, a weight adding unit (206) in a searching loop (204) of a fixed... Agent: Greenblum & Bernstein, P.L.C

20090164212 - Systems, methods, and apparatus for multi-microphone based speech enhancement: Systems, methods, and apparatus for processing an M-channel input signal are described that include outputting a signal produced by a selected one among a plurality of spatial separation filters. Applications to separating an acoustic signal from a noisy environment are described, and configurations that may be implemented on a multi-microphone... Agent: Qualcomm Incorporated

20090164213 - Digital media recognition apparatus and methods: One of the embodiments of the invention includes a method of identifying illegal uses of copyright material. The steps of the method preferably include the steps of: (a) providing a primary digital media object, (b) associating an auxiliary construct with the object, (c) transforming the construct using at least one... Agent: Schox PLC

20090164214 - System, method and software program for enabling communications between customer service agents and users of communication devices: The present invention provides a system, method and software application for enabling a customer service agent to efficiently communicate with users of a communication device. When a user enters speech input into his communication device, the speech is converted to text, and the text is displayed to the customer service... Agent: Tina M. Lessani Lessani & Lessani LLP

20090164215 - Device with voice-assisted system: A device with a voice-assisted system is provided by using a voice command to adjust operations. The voice-assisted system includes a voice recognition engine and a control device. The voice recognition engine receives a voice command and outputting a voice signal based on the voice command to the control unit.... Agent: Jianq Chyun Intellectual Property Office

20090164216 - In-vehicle circumstantial speech recognition: A method of circumstantial speech recognition in a vehicle. A plurality of parameters associated with a plurality of vehicle functions are monitored as an indication of current vehicle circumstances. At least one vehicle function is identified as a candidate for user-intended ASR control based on user interaction with the vehicle.... Agent: General Motors Corporation C/o Reising, Ethington, Barnes, Kisselle, P.C.

20090164218 - Method and apparatus for uniterm discovery and voice-to-voice search on mobile device: A method, system and communication device for enabling uniterm discovery from audio content and voice-to-voice searching of audio content stored on a device using discovered uniterms. Received audio/voice input signal is sent to a uniterm discovery and search (UDS) engine within the device. The audio data may be associated with... Agent: Dillon & Yudell, LLP

20090164217 - Multiresolution searching: This invention relates to processing of audio files, and more specifically, to an improved technique of searching audio. More particularly, a method and system for processing audio using a multi-stage searching process is disclosed.... Agent: Occhiuti Rohlicek & Tsao, LLP

20090164219 - Accelerometer-based control of wearable devices: Accelerometer-based orientation and/or movement detection for controlling wearable devices, such as wrist-worn audio recorders and wristwatches. A wrist-worn audio recorder can use an accelerometer to detect the orientation and/or movement of a user's wrist and subsequently activate a corresponding audio-recorder function, for instance recording or playback. A wearable device with... Agent: Enbiomedic

20090164220 - Direct message playback and recording apparatus and method: A sound recording and playback apparatus and associated method, comprising: an audio storage medium; a microphone; a speaker; and a plurality of direct message access buttons, each direct message access button simultaneously associated both with a particular pre-recorded sound sequence stored in the storage medium, and with a particular new... Agent: Law Office Of Jay R. Yablon

20090164227 - Apparatus for processing media signal and method thereof: The present invention relates to an apparatus for processing a media signal and method thereof. A method of processing a media signal according to the present invention includes extracting a downmix signal from a bitstream, extracting at least one of first spatial information and second spatial information from the bitstream,... Agent: Fish & Richardson P.C.

20090164223 - Lossless multi-channel audio codec: A lossless audio codec segments audio data within each frame to improve compression performance subject to a constraint that each segment must be fully decodable and less than a maximum size. For each frame, the codec selects the segment duration and coding parameters, e.g., a particular entropy coder and its... Agent: Dts, Inc.

20090164224 - Lossless multi-channel audio codec: A lossless audio codec segments audio data within each frame to improve compression performance subject to a constraint that each segment must be fully decodable and less than a maximum size. For each frame, the codec selects the segment duration and coding parameters, e.g., a particular entropy coder and its... Agent: Dts, Inc.

20090164226 - Method and apparatus for lossless encoding of a source signal using a lossy encoded data stream and a lossless extension data stream: In lossy based lossless coding a PCM audio signal passes through a lossy encoder to a lossy decoder. The lossy encoder provides a lossy bit stream. The difference signal between the PCM signal and the lossy decoder output is lossless encoded, providing an extension bit stream. The invention facilitates enhancing... Agent: Thomson Licensing LLC

20090164225 - Method and apparatus of audio matrix encoding/decoding: A method to audio matrix encode/decode, which encode and decode audio signals of two or more channels into an audio signal of one or more channel while preserving the direction of a sound image includes extracting pieces of sound image information from audio signals of multi channels, encoding and allocating... Agent: Stanzione & Kim, LLP

20090164221 - Methods and apparatuses for encoding and decoding object-based audio signals: Provided are an audio encoding method and apparatus and an audio decoding method and apparatus in which audio signals can be encoded or decoded so that sound images can be localized at any desired position for each object audio signal. The audio decoding method includes extracting a downmix signal and... Agent: Fish & Richardson P.C.

20090164222 - Methods and apparatuses for encoding and decoding object-based audio signals: Provided are an audio encoding method and apparatus and an audio decoding method and apparatus in which audio signals can be encoded or decoded so that sound images can be localized at any desired position for each object audio signal. The audio decoding method includes extracting a downmix signal and... Agent: Fish & Richardson P.C.

  
06/18/2009 > patent applications in patent subcategories.

20090157379 - Language converter with enhanced search capability: A weighted search program is disclosed. The weighted search program may be integrated into a translation program, or the weighted search program may be used independently with an available search engine. When integrated with the translation program, setting and weighting may be combined in a single search. In one embodiment,... Agent: Duke W. Yee

20090157380 - Method and apparatus for providing hybrid automatic translation: The present invention provides a Korean-English hybrid automatic translation method for providing translation from Korean to English, includes: performing a morpheme analysis and a syntactic analysis on a Korean input source text; segmenting the Korean input source text into at least two source text segments, based on the results of... Agent: Ampacc Law Group

20090157381 - Web translation provider: A web translation server discovers a document address for a document. The document is accessed and parsed for text data in a first language. The parsed text data is translated into text data in a second language and stored in a database. A client accesses the document and sends a... Agent: Senniger Powers LLP (msft)

20090157382 - Decision-support expert system and methods for real-time exploitation of documents in non-english languages: A method for real-time exploitation of documents in non-English languages includes processing an input document in into a processed input document, extracting ontology elements from the processed input document to obtain a document digest (DD), statistically scoring each DD to obtain a DD with category scores, refining the DD and... Agent: Pearl Cohen Zedek Latzer, LLP

20090157383 - Voice query extension method and system: A voice query extension method and system. The voice query extension method includes: detecting voice activity of a user from an input signal and extracting a feature vector from the voice activity; converting the feature vector into at least one phoneme sequence and generating the at least one phoneme sequence;... Agent: Staas & Halsey LLP

20090157387 - Connected text data system: A connected text data system for efficiently and accurately translating connected text. The connected text data system includes inputting or receiving connected text, transmitting the connected text to a text iterator, scanning the connected text, identifying a plurality of words in the connected text, and translating the connected text to... Agent: Neustel Law Offices, Ltd.

20090157386 - Diagnostic evaluation of machine translators: A system for evaluating translation quality of a machine translator is discussed. The system includes a bilingual data generator configured to intermittently access a wide area network and generate a bilingual corpus from data received from the wide area network. The method also includes an example extraction component configured to... Agent: Westman Champlin (microsoft Corporation)

20090157385 - Inverse text normalization: Embodiments are directed to efficient multilingual inverse text normalization (ITN) of text in spoken form to produce normalized text for display. Embodiments are directed to preprocessing the multilingual text into a language-independent representation, tokenizing text in spoken form, segmenting the tokenized text into ITN items by grouping consecutive words using... Agent: Banner & Witcoff, Ltd.

20090157388 - Method and device for outputting information and/or status messages, using speech: In a method and device for outputting information and/or messages from at least one device using speech, the information and/or messages required for vocal output are provided in a voice memory, the information and/or messages are read by a processing device according to a demand, and the information and/or messages... Agent: Kenyon & Kenyon LLP

20090157384 - Semi-supervised part-of-speech tagging: A word is selected from a received text and features are identified from the word. The features are applied to a model to identify probabilities for sets of part-of-speech tags. The probabilities for the sets of part-of-speech tags are used to weight scores for possible part-of-speech tags for the selected... Agent: Microsoft Corporation

20090157389 - System and method for computerized psychological content analysis of computer and media generated communications to produce communications management support, indications and warnings of dangerous behavior, assessment of media images, and personnel select: At least one computer-mediated communication produced by or received by an author is collected and parsed to identify categories of information within it. The categories of information are processed with at least one analysis to quantify at least one type of information in each category. A first output communication is... Agent: Antonelli, Terry, Stout & Kraus, LLP

20090157390 - Method and apparatus for discovering and classifying polysemous word instances in web documents: A method and apparatus for discovering polysemous words and classifying polysemous words found in web documents. All document corpi in any natural language have words that have multiple usage contexts or words that have multiple meanings. Semantic analysis is not feasible for classifying all word occurrences in all documents on... Agent: Hickman Palermo Truong & Becker LLP/yahoo! Inc.

20090157391 - Extraction and matching of characteristic fingerprints from audio signals: An audio fingerprint is extracted from an audio sample, where the fingerprint contains information that is characteristic of the content in the sample. The fingerprint may be generated by computing an energy spectrum for the audio sample, resampling the energy spectrum logarithmically in the time dimension, transforming the resampled energy... Agent: Robert A. Hulse Fenwick & West LLP

20090157392 - Providing speech recognition data to a speech enabled device when providing a new entry that is selectable via a speech recognition interface of the device: The present invention discloses a solution for providing a phonetic representation for a content item along with a content item delivered to a speech enabled computing device. The phonetic representation can be specified in a manner that enables it to be added to a speech recognition grammar of the speech... Agent: Patents On Demand, P.A. Ibm-rsw

20090157393 - Encoding device and decoding device: An encoding device (200) includes an MDCT unit (202) that transforms an input signal in a time domain into a frequency spectrum including a lower frequency spectrum, a BWE encoding unit (204) that generates extension data which specifies a higher frequency spectrum at a higher frequency than the lower frequency... Agent: Wenderoth, Lind & Ponack L.L.P.

20090157394 - System and method for frequency domain audio speed up or slow down, while maintaining pitch: Presented herein are system(s) and method(s) for frequency domain audio speed up or slow down, while maintaining pitch. An encoded audio signal is received. Frames from the encoded audio signal are retrieved. The frames of the audio signal are transformed into a frequency domain, wherein each of said frames are... Agent: Mcandrews Held & Malloy, Ltd

20090157395 - Adaptive codebook gain control for speech coding: In accordance with one aspect of the invention, a selector supports the selection of a first encoding scheme or the second encoding scheme based upon the detection or absence of the triggering characteristic in the interval of the input speech signal. The first encoding scheme has a pitch pre-processing procedure... Agent: Farshad Farjami, Esq. Farjami & Farjami LLP

20090157396 - Voice data signal recording and retrieving: Embodiments related to recording and retrieving of voice data signals are described and depicted.... Agent: Infineon Technologies Ag Patent Department

20090157397 - Voice rule-synthesizer and compressed voice-element data generator for the same: A voice rule-synthesizer synthesizes a voice waveform based on the voice data stored in a database, which stores a large number of compressed voice data sections in a data stream. Each voice data section is stored as a plurality of frames compressed in a fixed-length frame format. The storage capacity... Agent: Whitham, Curtis & Christofferson & Cook, P.C.

20090157398 - Method and apparatus for detecting noise: A method of and apparatus for detecting noise are provided. The method of detecting noise includes: receiving an input of a voice frame and converting the voice frame into a filter bank vector; converting the converted filter bank vector into band data; calculating a weight Gaussian mixture model (GMM) for... Agent: Staas & Halsey LLP

20090157399 - Apparatus and method for evaluating performance of speech recognition: An apparatus for evaluating the performance of speech recognition includes a speech database for storing N-number of test speech signals for evaluation. A speech recognizer is located in an actual environment and executes the speech recognition of the test speech signals reproduced using a loud speaker from the speech database... Agent: Ampacc Law Group

20090157400 - Speech recognition system and method with cepstral noise subtraction: The invention relates to a speech recognition system and method with cepstral noise subtraction. The speech recognition system and method utilize a first scalar coefficient, a second scalar coefficient, and a determining condition to limit the process for the cepstral feature vector, so as to avoid excessive enhancement or subtraction... Agent: Connolly Bove Lodge & Hutz, LLP

20090157401 - Semantic decoding of user queries: An intelligent query system for processing voiced-based queries is disclosed, which uses semantic based processing to identify the question posed by the user by understanding the meaning of the user's utterance. Based on identifying the meaning of the utterance, the system selects a single answer that best matches the user's... Agent: J. Nicholas Gross, Attorney

20090157403 - Human speech recognition apparatus and method: A speech recognition apparatus generates a feature vector series corresponding to a speech signal, and recognizes a phoneme series corresponding to the feature vector series using sounds corresponding to phonemes and a phoneme language model. In addition, the speech recognition apparatus recognizes vocabulary that corresponds to the recognized phoneme series.... Agent: Staas & Halsey LLP

20090157402 - Method of constructing model of recognizing english pronunciation variation: A method of constructing a model of recognizing English pronunciation variations is used to recognize English pronunciations with different intonations influenced by native languages. The method includes collecting a plurality of sound information corresponding to English expressions; corresponding phonetic alphabets of the native language and English of a region to... Agent: Morris Manning Martin LLP

20090157404 - Grammar weighting voice recognition information: A device receives a voice recognition statistic from a voice recognition application and applies a grammar improvement rule based on the voice recognition statistic. The device also automatically adjusts a weight of the voice recognition statistic based on the grammar improvement rule, and outputs the weight adjusted voice recognition statistic... Agent: Verizon Patent Management Group

20090157405 - Using partial information to improve dialog in automatic speech recognition systems: A method, system and computer readable device for recognizing a partial utterance in an automatic speech recognition (ASR) system where said method comprising the steps of, receiving, by a ASR recognition unit, an input signal representing a speech utterance or word and transcribing the input signal into text, interpreting, by... Agent: Scully, Scott, Murphy & Presser, P.C.

20090157406 - Acoustic signal transmission method and acoustic signal transmission apparatus: The acoustic signal transmission method is based on generating a synthesized sound electrical signal by electrically synthesizing an audible sound signal and another signal different than the audible sound signal at the sending side, and transmitting the synthesized sound electrical signal, and extracting the another signal different than the audible... Agent: Robert E. Krebs, Esq. Burns, Doane, Swecker & Mathis, L.l.p

20090157409 - Method and apparatus for training difference prosody adaptation model, method and apparatus for generating difference prosody adaptation model, method and apparatus for prosody prediction, method and apparatus for speech synthesis: A method includes, generating, for each parameter of the prosody vector, an initial parameter prediction model with a plurality of attributes related to difference prosody prediction and at least part of attribute combinations of the plurality of attributes, in which each of the plurality of attributes and the attribute combinations... Agent: Charles N.j. Ruggiero, Esq. Ohlandt, Greeley, Ruggiero & Perle, L.L.P.

20090157407 - Methods, apparatuses, and computer program products for semantic media conversion from source files to audio/video files: An apparatus for semantic media conversion from source data to audio/video data may include a processor. The processor may be configured to parse source data having text and one or more tags and create a semantic structure model representative of the source data, and generate audio data comprising at least... Agent: Alston & Bird LLP

20090157408 - Speech synthesizing method and apparatus: The present invention relates to a speech synthesizing method and apparatus based on a hidden Markov model (HMM). Among code words that are obtained by quantizing speech parameter instances for each state of an HMM model, a code word closest to a speech parameter generated from an input text using... Agent: Ampacc Law Group

20090157410 - Speech translating system: Disclosed is a speech translating system for translating speech from a first language to a language selected from a set of second languages. The system includes an input unit, a processor, and an output unit. The input unit is capable of receiving the speech in the first language. The processor... Agent: Jay M. Schloff Intellipex PLLC

20090157412 - Method for streaming through a data service over a radio link subsystem: An apparatus for controlling a data rate in a data client for a digital audio broadcasting system includes a buffer for storing data, a codec for coding data, and a control module for controlling a bit rate of the codec in response to a level of the data in the... Agent: Pietragallo Gordon Alfano Bosick & Raspanti LLP

20090157411 - Methods and apparatuses for encoding and decoding object-based audio signals: An audio encoding method and apparatus and an audio decoding method and apparatus are provided. The audio signal decoding method includes extracting a downmix signal and object-based side information from an audio signal; generating a modified downmix signal based on the downmix signal and extracted information which is extracted from... Agent: Fish & Richardson P.C.

20090157413 - Speech encoding apparatus and speech encoding method: There is provided an audio encoding device capable of maintaining continuity of spectrum energy and preventing degradation of audio quality even when a spectrum of a low range of an audio signal is copied at a high range a plurality of times. The audio encoding device (100) includes: an LPC... Agent: Greenblum & Bernstein, P.L.C

  
06/11/2009 > patent applications in patent subcategories.

20090150139 - Method and apparatus for translating a speech: There is provided a method for translating a speech, includes recognizing the speech into a text which includes a long sentence containing a plurality of simple sentences, segmenting the long sentence into the simple sentences, and translating each simple sentence into a sentence of a target language. A long sentence... Agent: Charles N.j. Ruggiero, Esq. Ohlandt, Greeley, Ruggiero & Perle, L.L.P.

20090150142 - Behavior determination apparatus and method, behavior learning apparatus and method, robot apparatus, and medium recorded with program: A robot includes a knowledge acquisition unit for extracting words from external instruction information, a network construction unit for constructing a network from the extracted words and updating weightings between the words, and a behavior determination unit for determining a behavior on the basis of a word network in which... Agent: Kenyon & Kenyon LLP

20090150140 - Efficient stemming of semitic languages: A system for stemming words of Semitic languages, the system including an affix scanner configured to scan a word of a Semitic language for at least one affix according to a predefined scanning sequence and determine if at least one predefined scanning criterion is met, and a stemmer configured to... Agent: Sughrue Mion PLLC Uspto Customer No With Ibm/svl

20090150141 - Method and system for learning second or foreign languages: The present invention provides a method for providing linguistically interesting terms to a user, the method comprising processing a received digital text by a natural language processing technology, and then comparing the processed digital text with a linguistically interesting term database with a plurality of predetermined linguistically interesting terms. When... Agent: Raymond R. Moser Jr., Esq. MoserIPLaw Group

20090150143 - Mdct domain post-filtering apparatus and method for quality enhancement of speech: A post-filtering apparatus and method for speech enhancement in a modified discrete cosine transform (MDCT) domain are disclosed. In the apparatus and method, previous and current MDCT coefficients are used for obtaining a speech spectrum coefficient similar to a real speech spectrum, and a convex function is used for transforming... Agent: Staas & Halsey LLP

20090150145 - Learning word segmentation from non-white space languages corpora: Illustrative embodiments provide a computer implemented method, apparatus, and computer program product for learning word segmentation from non-white space language corpora. In one illustrative embodiment, the computer implemented method receives text input characters and calculates a ratio-measure for each pair of characters in the input characters. The computer implemented method... Agent: Duke W. Yee

20090150146 - Microphone array based speech recognition system and target speech extracting method of the system: A microphone-array-based speech recognition system using a blind source separation (BBS) and a target speech extraction method in the system are provided. The speech recognition system performs an independent component analysis (ICA) to separate mixed signals input through a plurality of microphone into sound-source signals, extracts one target speech spoken... Agent: Ampacc Law Group

20090150144 - Robust voice detector for receive-side automatic gain control: A voice detector improves voice output quality. The voice detector may be incorporated into a cellphone, hands-free car phone, or any other device that provides voice output. The voice detector provides excellent voice output quality even when signal dropouts and other significant signal artifacts are present in the received signal.... Agent: Harman - Brinks Hofer Chicago Brinks Hofer Gilson & Lione

20090150147 - Recording audio metadata for stored images: A method of processing audio signals recorded during display of image data from a media file on a display device to produce semantic understanding data and associating such data with the original media file, includes: separating a desired audio signal from the aggregate mixture of audio signals; analyzing the separated... Agent: Frank Pincelli Patent Legal Staff

20090150148 - Voice recognition apparatus and memory product: A voice recognition apparatus can reduce false recognition caused by matching with respect to the phrases composed of a small number of syllables, when it performs a recognition process, by a pronunciation unit, for voice data based on voice produced by a speaker such as a syllable and further performs... Agent: Staas & Halsey LLP

20090150151 - Audio processing apparatus, audio processing system, and audio processing program: Disclosed herein is an audio processing apparatus for processing a plurality of pieces of audio data of sounds picked up by a plurality of microphones. The apparatus includes: a speaker identification section configured to identify a speaker based on the audio data; a simultaneous speech section identification section configured to,... Agent: Lerner, David, Littenberg, Krumholz & Mentlik

20090150149 - Identifying far-end sound: Frames containing audio data may be received, the audio data having been derived from a microphone array, at least some of the frames containing residual acoustic echo after having acoustic echo partially removed therefrom. Probability distribution functions are determined from the frames of audio data. A probability distribution function comprises... Agent: Microsoft Corporation

20090150150 - System and method for controlling access to a handheld device by validating voice sounds: A method for controlling access to a handheld device (10) by validating voice sounds includes: setting voice characteristics acceptable error margin; storing voice characteristics of the original voice sounds of a user in a memory (12) of the handheld; recording validation voice sounds of the user through a microphone (11)... Agent: PCe Industry, Inc. Att. Steven Reiss

20090150153 - Grapheme-to-phoneme conversion using acoustic data: Described is the use of acoustic data to improve grapheme-to-phoneme conversion for speech recognition, such as to more accurately recognize spoken names in a voice-dialing system. A joint model of acoustics and graphonemes (acoustic data, phonemes sequences, grapheme sequences and an alignment between phoneme sequences and grapheme sequences) is described,... Agent: Microsoft Corporation

20090150152 - Method and apparatus for fast search in call-center monitoring: A method and apparatus for indexing one or more audio signals using a speech to text engine and a phoneme detection engine, and generating a combined lattice comprising a text part and a phoneme part. A word to be searched is searched for in the text part, and if not... Agent: Ohlandt, Greeley, Ruggiero & Perle, LLP

20090150154 - Method and system of generating and detecting confusing phones of pronunciation: A method of generating and detecting confusing phones/syllables is disclosed. The method includes a generating stage and a detecting stage. The generating stage includes: (a) input a Mandarin utterance; (b) partition the Mandarin utterance into segmented phones/syllables and generate the most likely route in a recognition net via Forced Alignment... Agent: Joe Mckinney Muncy

20090150155 - Keyword extracting device: The present invention aims at extracting a keyword of conversation without preparations by advanced anticipation of keywords of conversation. A keyword extracting device of the present invention includes an audio input section 101 by way of which a speech sound made by a speaker is input; a speech segment determination... Agent: Pearne & Gordon LLP

20090150156 - System and method for providing a natural language voice user interface in an integrated voice navigation services environment: A conversational, natural language voice user interface may provide an integrated voice navigation services environment. The voice user interface may enable a user to make natural language requests relating to various navigation services, and further, may interact with the user in a cooperative, conversational dialogue to resolve the requests. Through... Agent: Pillsbury Winthrop Shaw Pittman, LLP

20090150157 - Speech processing apparatus and program: A word dictionary including sets of a character string which constitutes a word, a phoneme sequence which constitutes pronunciation of the word and a part of speech of the word is referenced, an entered text is analyzed, the entered text is divided into one or more subtexts, a phoneme sequence... Agent: Oblon, Spivak, Mcclelland Maier & Neustadt, P.C.

20090150158 - Portable networked picting device: A portable picting device automatically converts an audio signal from a microphone into a digital data stream, parses a series of words from the digital data stream, and detects any words that match tags in a tag/image database. An image corresponding to the matching tag(s) is then retrieved and transmitted... Agent: Ibm Corporation (jvm)

20090150160 - Systems and methods of performing speech recognition using gestures: Embodiments of the present invention improve methods of performing speech recognition using human gestures. In one embodiment, the present invention includes a speech recognition method comprising detecting a gesture, selecting a first recognition set based on the gesture, receiving a speech input signal, and recognizing the speech input signal in... Agent: Fountainhead Law Group, PC

20090150159 - Voice searching for media files: A consumer electronic device has a controller, a speech processing circuit, and a memory to store media files such as audio or video files. The device allows the user to use his or her voice to fast-forward or rewind through the media file to a desired position. Particularly, the device... Agent: Coats & Bennett/sony Ericsson

20090150165 - Encoding and detecting apparatus: An encoding data processing apparatus generates a marked version of an audio signal provided on an audio channel. The marked copy is generated by embedding data representative of a payload data word into the audio signal. The encoding data processing apparatus comprises a code word generator operable to generate a... Agent: Oblon, Spivak, Mcclelland Maier & Neustadt, P.C.

20090150163 - Method and apparatus for multichannel upmixing and downmixing: Loudspeakers in domestic or automotive environments are rarely placed ideally with respect to the sources supplying them, and the stereo and surround images are seldom satisfying. According to the invention there is provided a method and apparatus for combining a precise knowledge about the relative positions of the loudspeakers that... Agent: Stites & Harbison PLLC

20090150162 - Stereo encoding apparatus, stereo decoding apparatus, and their methods: A stereo audio encoding apparatus capable of preventing degradation of the sound quality of a decoded signal, while reducing the encoding bit rate. In the apparatus, a spatial information analyzing part (101) analyzes the spatial information for each of L and R channel signals. A similarity raising part (102) corrects,... Agent: Greenblum & Bernstein, P.L.C

20090150161 - Synchronizing parametric coding of spatial audio with externally provided downmix: Embodiments of the present invention are directed to a binaural cue coding (BCC) scheme in which an externally provided audio signal (e.g., a studio engineering audio signal) is transmitted, along with derived cue codes, to a receiver instead of an automatically downmixcd audio signal. The cue codes are (adaptively) synchronized... Agent: Mendelsohn & Associates, P.C.

20090150164 - Tri-model audio segmentation: Apparatus, methods, and machine readable media that segment audio streams based upon application of three models to the audio stream are disclosed. One method includes extracting audio features from an audio stream and identifying a set of candidate change points between segments of the audio stream based upon the extracted... Agent: Barnes & Thornburg, LLP

  
06/04/2009 > patent applications in patent subcategories.

20090144047 - Methods involving translating text emphasis: An exemplary method for translating emphasis in text, the method comprising, receiving text in a first language, determining a first emphasis associated with the first language in the text, comparing the first emphasis with emphases associated with a second language to determine a second emphasis associated with the second language... Agent: Cantor Colburn LLP - IBM Lotus

20090144048 - Method and device for instant translation: Digital translating, using computer program, internet site or mobile phone, for use by one or more people, involving two or more languages, one after the other or simultaneously. Using technologies including voice recognition, language recognition, voice activation, voice-voice translation and voice-text translation. Automatic language recognition and thereby automatic translation to... Agent: Uzi Ezra Havosha & Partners

20090144049 - Method and system for adaptive transliteration: A system and method for transliteration between two different character-based languages is provided. In some embodiments, the system and method provide transliteration from the Arabic language into Roman-based languages such as English. In some embodiments this system and method allows a user to more easily produce Arabic text on English... Agent: Pepper Hamilton LLP

20090144050 - System and method for augmenting spoken language understanding by correcting common errors in linguistic performance: A method and system for automatic speech recognition are disclosed. The method comprises receiving speech from a user, the speech including at least one speech error, increasing the probabilities of closely related words to the at least one speech error and processing the received speech using the increased probabilities. A... Agent: At & T Legal Department - Ndq

20090144052 - Method and system for providing conversation dictionary services based on user created dialog data: A method for providing a conversation dictionary service includes the steps of: (a) receiving a request for editing conversation expressions from a user; (b) providing the user with a format page for editing the expressions; (c) creating expression data by connecting the information inputted in a conversation entry window and... Agent: Edwards Angell Palmer & Dodge LLP

20090144051 - Method of providing personal dictionary: Disclosed is a method of using a computerized dictionary for building an electronic word list. The method can include accessing a computerized dictionary, opening an entry of the dictionary for a word, the entry comprising a listing of a plurality of descriptions of the word, selecting a first one of... Agent: Knobbe Martens Olson & Bear LLP

20090144053 - Speech processing apparatus and speech synthesis apparatus: An information extraction unit extracts spectral envelope information of L-dimension from each frame of speech data. The spectral envelope information does not have a spectral fine structure. A basis storage unit stores N bases (L>N>1). Each basis is differently a frequency band having a maximum as a peak frequency in... Agent: Oblon, Spivak, Mcclelland Maier & Neustadt, P.C.

20090144054 - Embedded system to perform frame switching: The present patent discloses an embedded transient detection module, which improves the quality of the audio encoder, at the same time requires less computational power, as compared to existing schemes. This module uses a long frame, when the input audio signal is in steady state, while a short frame is... Agent: SprinkleIPLaw Group

20090144055 - Audio coding system using temporal shape of a decoded signal to adapt synthesized spectral components: A receiver in an audio coding system receives a signal conveying frequency subband signals representing an audio signal. The subband signals are examined to assess one or more characteristics of the audio signal including temporal shape. Spectral components are synthesized having the one or more assessed characteristics, integrated with the... Agent: Gallagher & Lathrop, A Professional Corporation

20090144056 - Method and computer program product for generating recognition error correction information: A method for providing recognition error correction information, the method includes: obtaining metadata associated with a capture of a media item; and generating recognition error correction information in response to the metadata. The recognition error correction information is to be used in a recognition process selected out of a list... Agent: Ibm Corporation, T.j. Watson Research Center

20090144057 - Method, apparatus, and program for certifying a voice profile when transmitting text messages for synthesized speech: A mechanism is provided for authenticating and using a personal voice profile. The voice profile may be issued by a trusted third party, such as a certification authority. The personal voice profile may include information for generating a digest or digital signature for text messages. A speech synthesis system may... Agent: Ibm Corp (ya) C/o Yee & Associates PC

20090144058 - Restoration of high-order mel frequency cepstral coefficients: A method for estimating high-order Mel Frequency Cepstral Coefficients, the method comprising initializing any of N-L high-order coefficients (HOC) of an MFCC vector of length N having L low-order coefficients (LOC) to a predetermined value, thereby forming a candidate MFCC vector, synthesizing a speech signal frame from the candidate MFCC... Agent: Ibm Corporation, T.j. Watson Research Center

20090144059 - High performance hmm adaptation with joint compensation of additive and convolutive distortions: A method of compensating for additive and convolutive distortions applied to a signal indicative of an utterance is discussed. The method includes receiving a signal and initializing noise mean and channel mean vectors. Gaussian dependent matrix and Hidden Markov Model (HMM) parameters are calculated or updated to account for additive... Agent: Microsoft Corporation

20090144060 - System and method for generating a web podcast service: Disclosed is a system and method for generating a web podcast interview that allows a single user to create his own multi-voices interview from his computer. The method allows the user to enter a set of questions from a text file using a text editor. (Answers may also be entered... Agent: Law Offices Of Ira D. Blecker, P.C.

20090144061 - Systems and methods for generating verbal feedback messages in head-worn electronic devices: Systems and methods for generating and providing verbal feedback messages to wearers of man-machine interface (MMI)-enabled head-worn electronic devices. An exemplary head-worn electronic device includes an MMI and an acoustic signal generator configured to provide verbal acoustic messages to a wearer of the head-worn electronic device in response to the... Agent: Plantronics, Inc.IPDepartment/legal

20090144063 - Method and apparatus for control of randering multiobject or multichannel audio signal using spatial cue: The present research relates to controlling rendering of multi-object or multi-channel audio signals. The present research provides a method and apparatus for controlling rendering of multi-object or multi-channel audio signals based on spatial cues in a process of decoding the multi-object or multi-channel audio signals. To achieve the purpose, the... Agent: Ladas & Parry LLP

20090144062 - Method and apparatus to facilitate provision and use of an energy value to determine a spectral envelope shape for out-of-signal bandwidth content: One provides (101) a digital audio signal having a corresponding signal bandwidth, and then provides (102) an energy value that corresponds to at least an estimate of out-of-signal bandwidth energy as corresponds to that digital audio signal. One then uses (103) the energy value to simultaneously determine both a spectral... Agent: Motorola/fetf

20090144064 - Local pitch control based on seamless time scale modification and synchronized sampling rate conversion: This invention locally controls the pitch of speech and audio signals. The invention is based on a seamless time scale modification (S-TSM) scheme connected to a synchronized sampling rate converter that switches between different time scale factors in a seamless manner and controls pitch during playback in a nearly continuous... Agent: Texas Instruments Incorporated

Previous industry: Data processing: structural design, modeling, simulation, and emulation
Next industry: Data processing: financial, business practice, management, or cost/price determination


######

RSS FEED for 20091112: - PDF
Integrate FreshPatents.com into your RSS reader/aggregator or website to track weekly updates.
For more info, read this article.

######

Thank you for viewing Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents on the FreshPatents.com website. These are patent applications which have been filed in the United States. There are a variety ways to browse Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patent applications on our website including browsing by date, agent, inventor, and industry. If you are interested in receiving occasional emails regarding Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents we recommend signing up for free keyword monitoring by email.



###

FreshPatents.com Support

Results in 1.04099 seconds

filepatents (1K)

* Easy, fast online form
* Protect your Inventions
* US Patent Office filing

Provisional Patent
Utility Patent

- - - - - - - - - - - - - - - - - - - - - -

filetrademarks (1K)

* Fast online form
* Protect your Name/Design
* US Government filing

Trademark Services

- - - - - - - - - - - - - - - - - - - - - -

PATENT INFO