|Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents - Monitor Patents|
USPTO Class 704 | Browse by Industry: Previous - Next | All
09/2011 | Recent | 13: May | Apr | Mar | Feb | Jan | 12: Dec | Nov | Oct | Sep | Aug | July | June | May | April | Mar | Feb | Jan | 11: Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan | 10: Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan | | 09: Dec | Nov | Oct | Sep | Aug | Jl | Jn | May | Apr | Mar | Fb | Jn | | 2008 | 2007 |
Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression September class, title,number 09/11Below are recently published patent applications awaiting approval from the USPTO. Recent week's RSS XML file available below.
Listing for abstract view: USPTO application #, Title, Abstract excerpt,Patent Agent. Listing format for list view: USPTO National Class full category number, title of the patent application. 09/29/2011 > 24 patent applications in 11 patent subcategories. class, title,number
20110238404 - General digital semantic database for mechanical language translation: A general digital semantic database for mechanical language translation is provided. The database has vocabulary decomposed to part of speech characteristics and semantic characteristics to form inseparable basic semantic points. The vocabulary is regularly ordered according to classes of the semantic characteristic, part of speech characteristic, background and grammatical relation.... Agent:
20110238405 - A translation method and a device, and a headset forming part of said device: A translation method and device (1) enabling a first individual speaking in a first language to converse with a second individual speaking in a second language different from the first language, the device includes translation element (30) translating the first words of the first individual into the second language and... Agent:
20110238406 - Messaging system with translation and method of operation thereof: A method of operation of a messaging system includes: receiving a source message; identifying a phrase of the source message; searching a translation hierarchy for the phrase, the translation hierarchy having multiple dictionaries in a priority order; and translating a target message, for displaying on a device, from the source... Agent: Telenav, Inc.
20110238407 - Systems and methods for speech-to-speech translation: Disclosed herein are systems and methods for receiving an input speech sample in a first language and outputting a translated speech sample in a second language in the unique voice of a user. According to several embodiments, a translation system includes a training mode for developing a voice recognition database... Agent: O3 Technologies, LLC
20110238411 - Document proofing support apparatus, method and program: According to one embodiment, a document proofing support apparatus includes an input unit, an analysis unit, a detection unit, a database unit, a retrieval unit, and a display unit. The input unit is configured to receive input of one of at least one proof document and at least one entry... Agent: Kabushiki Kaisha Toshiba
20110238408 - Semantic clustering: Semantic clustering techniques are described. In various implementations, a conversational agent is configured to perform semantic clustering of a corpus of user utterances. Semantic clustering may be used to provide a variety of functionality, such as to group a corpus of utterances into semantic clusters in which each cluster pertains... Agent:
20110238409 - Semantic clustering and conversational agents: Semantic clustering techniques are described. In various implementations, a conversational agent is configured to perform semantic clustering of a corpus of user utterances. Semantic clustering may be used to provide a variety of functionality, such as to group a corpus of utterances into semantic clusters in which each cluster pertains... Agent:
20110238410 - Semantic clustering and user interfaces: Semantic clustering techniques are described. In various implementations, a conversational agent is configured to perform semantic clustering of a corpus of user utterances. Semantic clustering may be used to provide a variety of functionality, such as to group a corpus of utterances into semantic clusters in which each cluster pertains... Agent:
20110238413 - Domain dictionary creation: Methods, systems, and apparatus, including computer program products, to identify topic words in a collection of documents that includes topic documents related to a topic are disclosed. A reference topic word divergence value based on a document collection and the topic document collection is determined. A candidate topic word divergence... Agent: Google Inc.
20110238412 - Method for constructing pronunciation dictionaries: Embodiments of the invention disclose a system and a method for constructing a pronunciation dictionary by transforming an unaligned entry to an aligned entry. The unaligned entry and the aligned entry include a set of words and a set of pronunciations corresponding to the set of words. The method aligns... Agent:
20110238414 - Telephony service interaction management: A method for managing an interaction of a calling party to a communication partner is provided. The method includes automatically determining if the communication partner expects DTMF input. The method also includes translating speech input to one or more DTMF tones and communicating the one or more DTMF tones to... Agent: Microsoft Corporation
20110238415 - Hybrid speech recognition: A hybrid speech recognition system uses a client-side speech recognition engine and a server-side speech recognition engine to produce speech recognition results for the same speech. An arbitration engine produces speech recognition output based on one or both of the client-side and server-side speech recognition results.... Agent:
20110238416 - Acoustic model adaptation using splines: Described is a technology by which a speech recognizer is adapted to perform in noisy environments using linear spline interpolation to approximate the nonlinear relationship between clean speech, noise, and noisy speech. Linear spline parameters that minimize the error the between predicted noisy features and actual noisy features are learned... Agent: Microsoft Corporation
20110238418 - Method and device for tracking background noise in communication system: A method and a device for tracking background noise in a communication system, where the method includes: calculating a SNR of a current frame according to input audio signals; increasing a frame counter, and calculating tone features and signal steadiness features of the current frame if the SNR of the... Agent: Huawei Technologies Co., Ltd.
20110238417 - Speech detection apparatus: According to one embodiment, a speech detection apparatus includes a first acoustic signal analyzing unit configured to analyze a frequency spectrum of a first acoustic signal, and a feature extracting unit configured to remove a frequency spectrum of the first acoustic signal from a third acoustic signal, which is obtained... Agent: Kabushiki Kaisha Toshiba
20110238419 - Binaural method and binaural configuration for voice control of hearing devices: A binaural configuration and an associated method have/utilize first and second hearing devices for the voice control of the hearing devices by voice commands. The configuration contains a first voice recognition module in the first hearing device and a second voice recognition module in the second hearing device. The second... Agent: Siemens Medical Instruments Pte. Ltd.
20110238420 - Method and apparatus for editing speech, and method for synthesizing speech: According to one embodiment, a method for editing speech is disclosed. The method can generate speech information from a text. The speech information includes phonologic information and prosody information. The method can divide the speech information into a plurality of speech units, based on at least one of the phonologic... Agent: Kabushiki Kaisha Toshiba
20110238421 - Speech output device, control method for a speech output device, printing device, and interface board: A speech output device, a control method for a speech output device, a printer, and an interface board can improve the productivity of foreign language speaking workers in industries such as retailing and food services. A data communication unit 191 acquires print data. A data interpreter 193 analyzes and converts... Agent: Seiko Epson Corporation
20110238422 - Method for sonic document classification: A method to identify and classify a document (5) by weight or thickness based on the sound the document makes while moving through a document transport (30). Using an audio transducer (20), the sound of the document is captured and compared to previously saved and stored characteristics of various weighted... Agent:
20110238423 - Sonic document classification: An apparatus for classifying documents (5) based on sound includes a document transport (30) for transporting a document; an audio transducer (20) for detecting a sonic profile produced by the document as it is transported; and a controller for determining document characteristics based on the sonic profile.... Agent:
20110238426 - Audio decoder, audio encoder, method for decoding an audio signal, method for encoding an audio signal, computer program and audio signal: An audio decoder for providing a decoded audio information on the basis of an entropy encoded audio information includes a context-based entropy decoder configured to decode the entropy-encoded audio information in dependence on a context, which context is based on a previously-decoded audio information in a non-reset state-of-operation. The context-based... Agent:
20110238424 - Method and apparatus for encoding and decoding excitation patterns from which the masking levels for an audio signal encoding and decoding are determined: For the quantisation of spectral data in an audio transform encoder psycho-acoustic information is required, i.e. an approximation of the true masking threshold. According to the invention, for each spectrum to be quantised in the audio signal encoding, an excitation pattern is computed and coded for both long and short... Agent: Thomson Licensing
20110238425 - Multi-resolution switched audio encoding/decoding scheme: An audio encoder for encoding an audio signal has a first coding branch, the first coding branch comprising a first converter for converting a signal from a time domain into a frequency domain. Furthermore, the audio encoder has a second coding branch comprising a second time/frequency converter. Additionally, a signal... Agent:
20110238427 - Signal classification processing method, classification processing device, and encoding system: A signal classification processing method, a classification processing device, and an encoding system are provided. The signal classification processing method includes: obtaining a high band input signal; determining a signal type of the obtained high band input signal according to a time domain characteristic parameter and/or a frequency domain characteristic... Agent: Huawei Technologies Co., Ltd.09/22/2011 > 17 patent applications in 11 patent subcategories. class, title,number
20110231180 - Multi-language closed captioning: A computer server receives, from a remote device, a request for closed caption data, the request specifying media content for which the closed caption data is to be provided. In the server, it is determined whether closed-captioned data for the specified media content is available, and if closed-captioned data for... Agent: Verizon Patent And Licensing Inc.
20110231181 - Web translation provider: A web translation server discovers a document address for a document. The document is accessed and parsed for text data in a first language. The parsed text data is translated into text data in a second language and stored in a database. A client accesses the document and sends a... Agent: Microsoft Corporation
20110231183 - Language model creation device: This device 301 stores a first content-specific language model representing a probability that a specific word appears in a word sequence representing a first content, and a second content-specific language model representing a probability that the specific word appears in a word sequence representing a second content. Based on a... Agent: Nec Corporation
20110231182 - Mobile systems and methods of supporting natural language human-machine interactions: A mobile system is provided that includes speech-based and non-speech-based interfaces for telematics applications. The mobile system identifies and uses context, prior information, domain knowledge, and user specific profile data to achieve a natural environment for users that submit requests and/or commands in multiple domains. The invention creates, stores and... Agent: Voicebox Technologies, Inc.
20110231184 - Correlation of transcribed text with corresponding audio: In one embodiment, a method includes receiving at a communication device an audio communication and a transcribed text created from the audio communication, and generating a mapping of the transcribed text to the audio communication independent of transcribing the audio. The mapping identifies locations of portions of the text in... Agent: Cisco Technology, Inc.
20110231185 - Method and apparatus for blind signal recovery in noisy, reverberant environments: A maximum-kurtosis, distortionless response (MKDR) technique and an extension, the maximum-kurtosis, Wiener estimate (MKWE) technique, are provided. In one form, blind estimates of the speech source's channel response are made from the microphone data and MVDR is applied. The source direction is estimated by finding weights that maximize output kurtosis,... Agent:
20110231186 - Speech detection method: A speech detection method is presented, which includes the following steps. A first voice captured device samples a first signal and a second voice captured device samples a second signal. The first voice captured device is closer to a speech signal source than the second voice captured device. A first... Agent: Issc Technologies Corp.
20110231187 - Voice processing device, voice processing method and program: A voice processing device includes a zone detection unit which detects a voice zone including a voice signal or a non-steady sound zone including a non-steady signal other than the voice signal from an input signal and a filter calculation unit that calculates a filter coefficient for holding the voice... Agent:
20110231188 - System and method for providing an acoustic grammar to dynamically sharpen speech interpretation: The system and method described herein may provide an acoustic grammar to dynamically sharpen speech interpretation. In particular, the acoustic grammar may be used to map one or more phonemes identified in a user verbalization to one or more syllables or words, wherein the acoustic grammar may have one or... Agent: Voicebox Technologies, Inc.
20110231190 - Method of and system for providing adaptive respondent training in a speech recognition application: A system for conducting a telephonic speech recognition application includes an automated telephone device for making telephonic contact with a respondent and a speech recognition device which, upon the telephonic contact being made, presents the respondent with at least one introductory prompt for the respondent to reply to; receives a... Agent: Eliza Corporation
20110231189 - Methods and apparatus for extracting alternate media titles to facilitate speech recognition: Techniques for generating a set of one or more alternate titles associated with stored digital media content and updating a speech recognition system to enable the speech recognition system to recognize the set of alternate titles. The system operates on an original media title to extract a set of alternate... Agent: Nuance Communications, Inc.
20110231191 - Weight coefficient generation device, voice recognition device, navigation device, vehicle, weight coefficient generation method, and weight coefficient generation program: A weight coefficient generation device, a speech recognition device, a navigation system, a vehicle, a vehicle coefficient generation method, and a weight coefficient generation program are provided for the purpose of improving a speech recognition performance of place names. In order to address the above purpose, an address database 12... Agent:
20110231193 - Synthesized singing voice waveform generator: Various technologies for generating a synthesized singing voice waveform. In one implementation, the computer program may receive a request from a user to create a synthesized singing voice using the lyrics of a song and a digital file containing its melody as inputs. The computer program may then dissect the... Agent: Microsoft Corporation
20110231192 - System and method for audio content generation: A system and method for generating audio content. Content is automatically retrieved from an original website according to a predetermined schedule to generate retrieved content. The retrieved content is converted to one or more audio file. A hierarchy is assigned to the one or more audio files to provide an... Agent:
20110231194 - Interactive speech preparation: In an embodiment, a method of interactive speech preparation is disclosed. The method may include or comprise displaying an interactive speech application on a display device, wherein the interactive speech application has a text display window. The method may also include or comprise accessing text stored in an external storage... Agent:
20110231196 - Dual-mode encoder, system including same, and method for generating infra-red signals: A dual-mode encoder. The encoder includes a logic device. The logic device includes a first input terminal for receiving a signal associated with an audio source, a second input terminal for receiving a mode instruction signal, and an output terminal for outputting an encoded signal. The logic device is configured... Agent: Unwired Technology LLC
20110231195 - High-frequency bandwidth extension in the time domain: A system extends the high-frequency spectrum of a narrowband audio signal in the time domain. The system extends the harmonics of vowels by introducing a non linearity in a narrow band signal. Extended consonants are generated by a random-noise generator. The system differentiates the vowels from the consonants by exploiting... Agent:09/15/2011 > 30 patent applications in 21 patent subcategories. class, title,number
20110224967 - Method and apparatus for automatically magnifying a text based image of an object: Method and apparatus for capturing a text based source image 5, 5A provided on an object 3 supported on a surface. Positioned above the object 3 is a camera 7 for capturing a view of the text based image 5, 5A. The camera 7, through lens 9 generates a focused... Agent:
20110224969 - Method, a media server, computer program and computer program product for combining a speech related to a voice over ip voice communication session between user equipments, in combination with web based applications: A media server, a method, a computer program and a computer program product for the media server, are provided for combining a speech related to a voice over IP (VoIP) voice communication session between a user equipment A and a user equipment B, with a web based applications. The method... Agent: Telefonaktiebolaget L M Ericsson (publ)
20110224968 - Translation apparatus and translation method: A display section displays an obtained character-string written in a first language. A translation unit extraction section divides the obtained character-string into predetermined translation units, and extracts a character-string for each translation unit. A translation unit translates the extracted character-strings in the translation units into a second language. A display... Agent:
20110224970 - Method and system for providing translation services: A method and system of automatic interpreting in which either one or two telecommunication devices are used. In one arrangement, there is one phone shared by two parties who speak different languages. The languages to be spoken are identified e.g. using an onscreen menu on the phone and an appropriate... Agent:
20110224972 - Localization for interactive voice response systems: A language-neutral speech grammar extensible markup language (GRXML) document and a localized response document are used to build a localized GRXML document. The language-neutral GRXML document specifies an initial grammar rule element. The initial grammar rule element specifies a given response type identifier and a given action. The localized response... Agent: Microsoft Corporation
20110224971 - N-gram selection for practical-sized language models: Described is a technology by which a statistical N-gram (e.g., language) model is trained using an N-gram selection technique that helps reduce the size of the final N-gram model. During training, a higher-order probability estimate for an N-gram is only added to the model when the training data justifies adding... Agent: Microsoft Corporation
20110224973 - System, method and computer program product for dynamically correcting grammar associated with text: In accordance with embodiments, there are provided mechanisms and methods for dynamically correcting grammar associated with text. These mechanisms and methods for dynamically correcting grammar associated with text can enable enhanced data display, simplified language support, etc.... Agent: Salesforce.com, Inc.
20110224974 - Speech recognition and transcription among users having heterogeneous protocols: A system is disclosed for facilitating speech recognition and transcription among users employing incompatible protocols for generating, transcribing, and exchanging speech. The system includes a system transaction manager that receives a speech information request from at least one of the users. The speech information request includes formatted spoken text generated... Agent:
20110224975 - Low-delay audio coder: The present invention relates to methods and devices for encoding and decoding digital audio signals, e.g. a speech signal. An audio coder and a decoder are provided wherein a modeller adds a first distribution model obtained from model parameters of past segments of the digital audio signal and a fixed... Agent: GlobalIPSolutions, Inc
20110224976 - Speech intelligibility predictor and applications thereof: The application relates to a method of providing a speech intelligibility predictor value for estimating an average listener's ability to understand of a target speech signal when said target speech signal is subject to a processing algorithm and/or is received in a noisy environment. The application further relates to a... Agent:
20110224977 - Robot, method and program of controlling robot: A robot may include a driving control unit configured to control a driving of a movable unit that is connected movably to a body unit, a voice generating unit configured to generate a voice, and a voice output unit configured to output the voice, which has been generated by the... Agent: Honda Motor Co., Ltd.
20110224978 - Information processing device, information processing method and program: An information processing device includes an audio-based speech recognition processing unit which is input with audio information as observation information of a real space, executes an audio-based speech recognition process, thereby generating word information that is determined to have a high probability of being spoken, an image-based speech recognition processing... Agent:
20110224979 - Enhancing speech recognition using visual information: Speech recognition device uses visual information to narrow down the range of likely adaptation parameters even before a speaker makes an utterance. Images of the speaker and/or the environment are collected using an image capturing device, and then processed to extract biometric features and environmental features. The extracted features and... Agent: Honda Motor Co., Ltd.
20110224980 - Speech recognition system and speech recognizing method: A speech recognition system according to the present invention includes a sound source separating section which separates mixed speeches from multiple sound sources from one another; a mask generating section which generates a soft mask which can take continuous values between 0 and 1 for each frequency spectral component of... Agent: Honda Motor Co., Ltd.
20110224981 - Dynamic speech recognition and transcription among users having heterogeneous protocols: A system is disclosed for facilitating free form dictation, including directed dictation and constrained recognition and/or structured transcription among users having heterogeneous native (legacy) protocols for generating, transcribing, and exchanging recognized and transcribed speech. The system includes at least one system transaction manager having a “system protocol,” to receive a... Agent:
20110224982 - Automatic speech recognition based upon information retrieval methods: Described is a technology in which information retrieval (IR) techniques are used in a speech recognition (ASR) system. Acoustic units (e.g., phones, syllables, multi-phone units, words and/or phrases) are decoded, and features found from those acoustic units. The features are then used with IR techniques (e.g., TF-IDF based retrieval) to... Agent: C/o Microsoft Corporation
20110224983 - N-gram model smoothing with independently controllable parameters: Described is a technology by which a probability is estimated for a token in a sequence of tokens based upon a number of zero or more times (actual counts) that the sequence was observed in training data. The token may be a word in a word sequence, and the estimated... Agent: Microsoft Corporation
20110224984 - Fast partial pattern matching system and method: Method, system and computer program for determining the matching between a first and a second sampled signals using an improved Dynamic Time Warping algorithm, called Unbounded DTW. It uses a dynamic programming algorithm to find exact start-end alignment points, unknown a priori, being the initial subsampling of the similarity matrix... Agent: Telefonica, S.a.
20110224985 - Model adaptation device, method thereof, and program thereof: A model adaptation device includes a text database that stores a plurality of sentences containing predetermined phonemes; a sentence list that includes a plurality of sentences that describe the contents of the input voice; an input unit to which the input voice is input; a model adaptation unit that performs... Agent:
20110224986 - Voice authentication systems and methods: A method for configuring a voice authentication system employing at least one authentication engine comprises utilising the at least one authentication engine to systematically compare a plurality of impostor voice sample against a voice sample of a legitimate person to derive respective authentication scores. The resultant authentication scores are analysed... Agent:
20110224987 - Detection of voice inactivity within a sound stream: A method for identifying end of voiced speech within an audio stream of a noisy environment employs a speech discriminator. The discriminator analyzes each window of the audio stream, producing an output corresponding to the window. The output is used to classify the window in one of several classes, for... Agent: Applied Voice & Speech Technologies, Inc.
20110224988 - Intracardiac electrogram time frequency noise detection: Systems, methods, and apparatus for identifying and classifying noise of an intracardiac electrogram of a cardiac rhythm management device to prevent inaccurate detection of a cardiac episode are disclosed. In an example, three channels are analyzed to identify and determine whether an episode or noise has been detected.... Agent:
20110224989 - Methods and systems for word tone implementation: Word Tone uses one or more recordings of recognizable scriptures and sets it to an appropriate custom beat or rhythm which enhances the enjoyment of the recorded verse. Further exemplary embodiments enable the user to set up a plurality of profiles for one or more callers such that the electronic... Agent:
20110224990 - Speaker speed conversion system, method for same, and speed conversion device: A speaker speed conversion system includes: a risk site detection unit (22) for detecting sites of risk regarding sound quality from among speech that is received as input, a frame boundary detection unit (23) for searching for a plurality of points that can serve as candidates of frame boundaries from... Agent:
20110224993 - Apparatus and method for processing multi-channel audio signal using space information: An apparatus for and a method of processing a multi-channel audio signal using space information. The apparatus includes: a main coding unit down mixing a multi-channel audio signal by applying space information to surround components included in the multi-channel audio signal, generating side information using the multi-channel audio signal or... Agent:
20110224995 - Coding with noise shaping in a hierarchical coder: A method is provided for hierarchical coding of a digital audio signal comprising, for a current frame of the input signal: a core coding, delivering a scalar quantization index for each sample of the current frame and at least one enhancement coding delivering indices of scalar quantization for each coded... Agent: France Telecom
20110224994 - Energy conservative multi-channel audio coding: The invention relates to the technical field of audio encoding and/or decoding technologies, and thus concerns an overall encoding procedure and associated decoding procedure. The encoding procedure involves at least two signal encoding processes (S1-S3) operating on signal representations of a set of audio input channels, as well as residual... Agent: Telefonaktiebolaget Lm Ericsson (publ)
20110224991 - Scalable lossless audio codec and authoring tool: An audio codec losslessly encodes audio data into a sequence of analysis windows in a scalable bitstream. This is suitably done by separating the audio data into MSB and LSB portions and encoding each with a different lossless algorithm. An authoring tool compares the buffered payload to an allowed payload... Agent: Dts, Inc.
20110224992 - Set-top-box with integrated encoder/decoder for audience measurement: Systems and methods are disclosed for encoding audio in a set-top box that is invoked by a user when listening to a broadcast audio signal from a radio, TV, streaming or other audio device. A detection and identification system comprising an audio encoder is integrated in a set-top box, where... Agent:
20110224996 - Adjustable sampling rate converter: Techniques of this disclosure provide for adjustment of a conversion rate of a sampling rate converter (SRC) in real-time. The SRC determines relative timing of generated output samples based on non-approximated integer components that are recursively updated. The SRC may further base relative timing of output samples on a value... Agent: Qualcomm Incorporated09/08/2011 > 17 patent applications in 15 patent subcategories. class, title,number
20110218796 - Transliteration using indicator and hybrid generative features: Described is a transliteration engine/substring decoder that back-transliterates an input string from a source language into an output string in a target language. The transliteration engine may be based upon discriminately weighted indicator features and/or generative models in which the decoder's discriminative parameters are learned. The training data may be... Agent: Microsoft Corporation
20110218797 - Encoder for audio signal including generic audio and speech frames: A method for encoding audio frames by producing a first frame of coded audio samples by coding a first audio frame in a sequence of frames, producing at least a portion of a second frame of coded audio samples by coding at least a portion of a second audio frame... Agent: Motorola, Inc.
20110218798 - Obfuscating sensitive content in audio sources: Techniques implemented as systems, methods, and apparatuses, including computer program products, for obfuscating sensitive content in an audio source representative of an interaction between a contact center caller and a contact center agent. The techniques include performing, by an analysis engine of a contact center system, a context-sensitive content analysis... Agent: Nexdia Inc.
20110218799 - Decoder for audio signal including generic audio and speech frames: A method for decoding audio frames includes producing a first frame of coded audio samples, producing at least a portion of a second frame of coded audio samples, generating audio gap filler samples based on parameters representative of a weighted segment of the first frame of coded audio samples or... Agent: Motorola, Inc.
20110218800 - Method and apparatus for obtaining pitch gain, and coder and decoder: The present invention relates to a method and apparatus for obtaining a pitch gain, and a coder and a decoder. The method includes: obtaining information about an input signal; and obtaining a pitch gain corresponding to the information about the input signal according to the correspondence between the signal information... Agent: Huawei Technologies Co., Ltd.
20110218801 - Method for error concealment in the transmission of speech data with errors: The invention relates to a method for outputting a speech signal. Speech signal frames are received and are used in a predetermined sequence in order to produce a speech signal to be output. If one speech signal frame to be received is not received, then a substitute speech signal frame... Agent: Robert Bosch Gmbh
20110218802 - Continuous speech recognition: A computerized method for continuous speech recognition using a speech recognition engine and a phoneme model. The computerized method inputs a speech signal into the speech recognition engine. Based on the phoneme model, the speech signal is indexed by scoring for the phonemes of the phoneme model and a time-ordered... Agent:
20110218803 - Method and system for assessing intelligibility of speech represented by a speech signal: A method for assessing intelligibility of speech represented by a speech signal includes providing a speech signal and performing a feature extraction on at least one frame of the speech signal so as to obtain a feature vector for each of the at least one frame of the speech signal.... Agent: Deutsche Telekom Ag
20110218804 - Speech processor, a speech processing method and a method of training a speech processor: combining the likelihoods determined by the acoustic model and the language model and outputting a sequence of words identified from said speech input signal, wherein said acoustic model is context based for said speaker, said context based information being contained in said model using a plurality of decision trees, wherein... Agent: Kabushiki Kaisha Toshiba
20110218806 - Determining text to speech pronunciation based on an utterance from a user: Systems and methods are provided for automatically building a native phonetic lexicon for a speech-based application trained to process a native (base) language, wherein the native phonetic lexicon includes native phonetic transcriptions (base forms) for non-native (foreign) words which are automatically derived from non-native phonetic transcriptions of the non-native words.... Agent: Nuance Communications, Inc.
20110218807 - Method for automated sentence planning in a task classification system: The invention relates to a method for sentence planning (120) in a task classification system that interacts with a user. The method may include recognizing symbols in the user's input communication and determining whether the user's input communication can be understood. If the user's communication can be understood, understanding data... Agent: At&t Intellectual Property Ll, Lp
20110218805 - Spoken term detection apparatus, method, program, and storage medium: A spoken term detection apparatus includes: processing performed by a processor includes a feature extraction process extracting an acoustic feature from speech data accumulated in an accumulation part and storing an extracted acoustic feature in an acoustic feature storage, a first calculation process calculating a standard score from a similarity... Agent: Fujitsu Limited
20110218808 - System and method for spelling recognition using speech and non-speech input: A system and method for non-speech input or keypad-aided word and spelling recognition is disclosed. The method includes generating an unweighted grammar, selecting a database of words, generating a weighted grammar using the unweighted grammar and a statistical letter model trained on the database of words, receiving speech from a... Agent: At&t Intellectual Property Ii, Lp
20110218809 - Voice synthesis device, navigation device having the same, and method for synthesizing voice message: A voice synthesis device includes: a memory for storing a plurality of recorded voice data; a dividing unit for dividing a text into a plurality of words or phrases, wherein the text is to be converted into a voice message; a verifying unit for verifying whether one of the recorded... Agent: Denso Corporation
20110218810 - System for controlling digital effects in live performances with vocal improvisation: A system for controlling digital effects in live performances with vocal improvisation is described. The system features a complex controller that in one embodiment utilizes several magnetically activated electronic switches attached to a glove that is worn by an artist during a live performance. The switches are activated by a... Agent:
20110218811 - Retaining device for singing balloon and balloon singing incorporating same: A singing balloon includes a balloon body, a balloon retaining body, an oscillator unit, an audio and power unit. When the balloon body is placed within an accommodating space defined by the balloon retaining body and the balloon body is allowed to be completely adhered to the oscillator unit, the... Agent: Medici Creativity Co., Ltd.
20110218812 - Increasing the relevancy of media content: The present invention relates to increasing the relevance of media content communicated to consumers who are consuming the media content. In this regard, at least one of a personal device can be synced with a media device, each of the personal device is associated with at least one of a... Agent:09/01/2011 > 11 patent applications in 9 patent subcategories. class, title,number
20110213607 - Conference system, information processor, conference supporting method and information processing method: Speech given by a speaker in English is recognized. An upper half of a subtitle display area of a display used by a listener is used as a parallel area and a lower half thereof is used as an original area. In the parallel area, a parallel subtitle in which... Agent: Sharp Kabushiki Kaisha
20110213608 - Apparatus and method for rendering multi-lingual text: A computer readable storage medium includes executable instructions, which when executed by a computer, cause the computer to specify a font property file. Font bit map files are created based upon the font property file. An input file with multi-lingual text is received. The font bit map files are accessed... Agent: Sap Ag
20110213609 - Language-independent program instruction: A natural language-independent computer program is constructed. A data element is defined by a graphical representation in a user interface. A data element has a data type and a value. An operator is defined on multiple data elements by association of the graphical representations in the user interface. A natural... Agent: International Business Machines Corporation
20110213610 - Processor implemented systems and methods for measuring syntactic complexity on spontaneous non-native speech data by using structural event detection: Systems and methods are provided for providing a score for a spontaneous non-native speech response to a prompt. A transcription of the spontaneous speech response is accessed. A plurality of clauses are identified within the spontaneous speech response, where identifying a clause includes identifying a beginning boundary and an end... Agent:
20110213612 - Acoustic signal classification system: A system classifies the source of an input signal. The system determines whether a sound source belongs to classes that may include human speech, musical instruments, machine noise, or other classes of sound sources. The system is robust, performing classification despite variation in sound level and noise masking. Additionally, the... Agent: Qnx Software Systems Co.
20110213611 - Method and device for controlling the transport of an object to a predetermined destination: A method and a device control the transport of an object to a predetermined destination. The object is provided with information on a destination to which the object is to be transported. The destination information with which the object is provided is inputted into a speech detection station. A speech... Agent: Siemens Aktiengesellschaft
20110213613 - Automatic language model update: A method for generating a speech recognition model includes accessing a baseline speech recognition model, obtaining information related to recent language usage from search queries, and modifying the speech recognition model to revise probabilities of a portion of a sound occurrence based on the information. The portion of a sound... Agent: Google Inc., A Ca Corporation
20110213614 - Method of analysing an audio signal: A method of analysing an audio signal is disclosed. A digital representation of an audio signal is received and a first output function is generated based on a response of a physiological model to the digital representation. At least one property of the first output function may be determined. One... Agent: Newsouth Innovations Pty Limited
20110213615 - Voice authentication system and methods: A method for configuring a voice authentication system comprises ascertaining a measure of confidence associated with a voice sample enrolled with the authentication system. The measure of confidence is derived through simulated impostor testing carried out on the enrolled sample.... Agent: Auraya Pty Ltd
20110213616 - \"system and method for the adaptive use of uncertainty information in speech recognition to assist in the recognition of natural language phrases\": A speech recognition system includes a natural language processing component and an automated speech recognition component distinct from each other such that uncertainty in speech recognition is isolated from uncertainty in natural language understanding, wherein the natural language processing component and an automated speech recognition component communicate corresponding weighted meta-information... Agent:
20110213617 - Audio source system and method: A system includes a computer having a device driver. The device driver includes a detection module to detect an audio input. The device driver includes a selection module to send the audio input to audio hardware after detection of the audio input. The device driver also includes an emulation module... Agent: Sigmatel, Inc.Previous industry: Data processing: structural design, modeling, simulation, and emulation
Next industry: Data processing: financial, business practice, management, or cost/price determination
RSS FEED for 20130509:
Integrate FreshPatents.com into your RSS reader/aggregator or website to track weekly updates.
For more info, read this article.
Thank you for viewing Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents on the FreshPatents.com website. These are patent applications which have been filed in the United States. There are a variety ways to browse Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patent applications on our website including browsing by date, agent, inventor, and industry. If you are interested in receiving occasional emails regarding Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents we recommend signing up for free keyword monitoring by email.
FreshPatents.com Support - Terms & Conditions
Results in 0.8669 seconds