|Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents - Monitor Patents|
USPTO Class 704 | Browse by Industry: Previous - Next | All
03/2012 | Recent | 13: May | Apr | Mar | Feb | Jan | 12: Dec | Nov | Oct | Sep | Aug | July | June | May | April | Mar | Feb | Jan | 11: Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan | 10: Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan | | 09: Dec | Nov | Oct | Sep | Aug | Jl | Jn | May | Apr | Mar | Fb | Jn | | 2008 | 2007 |
Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression March listing by industry category 03/12Below are recently published patent applications awaiting approval from the USPTO. Recent week's RSS XML file available below.
Listing for abstract view: USPTO application #, Title, Abstract excerpt,Patent Agent. Listing format for list view: USPTO National Class full category number, title of the patent application. 03/29/2012 > 36 patent applications in 17 patent subcategories. listing by industry category
20120078608 - Simultaneous translation of open domain lectures and speeches: A real-time open domain speech translation system for simultaneous translation of a spoken presentation that is a spoken monologue comprising one of a lecture, a speech, a presentation, a colloquium, and a seminar. The system includes an automatic speech recognition unit configured for accepting sound comprising the spoken presentation in... Agent: Mobile Technologies, LLC
20120078607 - Speech translation apparatus, method and program: According to one embodiment, a speech translation apparatus includes a receiving unit, a first recognition unit, a second recognition unit, a first generation unit, a translation unit, a second generation unit, a synthesis unit. The receiving unit is configured to receive a speech in a first language and convert to... Agent: Kabushiki Kaisha Toshiba
20120078609 - System and method for language translation in a hybrid peer-to-peer environment: An improved system and method are disclosed for peer-to-peer communications. In one example, the method enables an endpoint to send and/or receive audio speech translations to facilitate communications between users who speak different languages.... Agent: Damaka, Inc.
20120078611 - Context-aware conversational user interface: An input handler may receive natural language input associated with a command from a user through a user interface, and a language parser may parse the natural language input to determine parsed natural language input. A context monitor may receive context information associated with the user, and a context parser... Agent: Sap Ag
20120078610 - Determining offer terms from text: Systems, methods, and machine readable and executable instructions are provided for determining offer terms from text. A method for determining offer terms from text can include mapping keywords to a domain of a procurement event, and receiving, to a computing device, an offer text associated with the procurement event. Event-specific... Agent:
20120078616 - Handheld electronic device and associated method enabling spell checking in a text disambiguation environment: An improved handheld electronic device and associated method enable spell checking in a reduced keyboard and disambiguation environment. The improved spell checking routine converts a misspelled word into a canonical version thereof and receives from a dictionary 42 proposed letter for possible acceptance by the spell checking routine. The spell... Agent: Research In Motion Limited
20120078613 - Method, system, and computer readable medium for graphically displaying related text in an electronic document: Disclosed herein are systems and methods for navigating electronic texts. According to an aspect, a method may include receiving search criteria for searching an electronic text. Further, the method may include determining text subgroups within the electronic text. The method may also include determining, for each text subgroup, a similarity... Agent: Rhonda Enterprises, LLC
20120078615 - Multiple touchpoints for efficient text input: Methods and systems for using multiple simultaneous touchpoints of a touch-sensitive keyboard, such as an on-screen keyboard, for more efficient text input are provided. A method for generating text using a touch-sensitive keyboard may include receiving touch input from multiple simultaneous touchpoints. The method may also include determining a text... Agent: Google Inc.
20120078612 - Systems and methods for navigating electronic texts: Disclosed herein are systems and methods for navigating electronic texts. According to an aspect, a method may include determining text subgroups within an electronic text. The method may also include selecting a text seed within one of the text subgroups. Further, the method may include determining a similarity relationship between... Agent: Rhonda Enterprises, LLC
20120078614 - Virtual keyboard for a non-tactile three dimensional user interface: A method, including presenting, by a computer system executing a non-tactile three dimensional user interface, a virtual keyboard on a display, the virtual keyboard including multiple virtual keys, and capturing a sequence of depth maps over time of a body part of a human subject. On the display, a cursor... Agent: Primesense Ltd.
20120078617 - System and method for increasing recognition rates of in-vocabulary words by improving pronunciation modeling: The present disclosure relates to systems, methods, and computer-readable media for generating a lexicon for use with speech recognition. The method includes receiving symbolic input as labeled speech data, overgenerating potential pronunciations based on the symbolic input, identifying potential pronunciations in a speech recognition context, and storing the identified potential... Agent: At&t Intellectual Property I, L.p.
20120078618 - Method and apparatus for generating lattice vector quantizer codebook: A method and an apparatus for generating a lattice vector quantizer codebook are disclosed. The method includes: storing an eigenvector set that includes amplitude vectors and/or length vectors, where the amplitude vectors and/or length vectors are different from each other and correspond to a root leader of a lattice vector... Agent: Huawei Technologies Co., Ltd
20120078619 - Control apparatus and control method: An apparatus may include a control unit to selectively control volume of content sound and volume of speech sound according to a priority assigned to a user corresponding to speech sound and a priority assigned to content data. When volume control is to be performed on a priority basis, the... Agent: Sony Corporation
20120078620 - Robust noise estimation: An enhancement system improves the estimate of noise from a received signal. The system includes a spectrum monitor that divides a portion of the signal at more than one frequency resolution. Adaptation logic derives a noise adaptation factor of the received signal. A plurality of devices tracks the characteristics of... Agent: Qnx Software Systems Co.
20120078623 - Method and apparatus for communication between humans and devices: This invention relates to methods and apparatus for improving communications between humans and devices. The invention provides a method of modulating operation of a device, comprising: providing an attentive user interface for obtaining information about an attentive state of a user; and modulating operation of a device on the basis... Agent:
20120078621 - Sparse representation features for speech recognition: i
20120078622 - Spoken dialogue apparatus, spoken dialogue method and computer program product for spoken dialogue: According to one embodiment, a spoken dialogue apparatus includes a detection unit configured to detect speech of a user; a recognition unit configured to recognize the speech; an output unit configured to output a response voice corresponding to the result of speech recognition; an estimate unit configured to estimate probability... Agent: Kabushiki Kaisha Toshiba
20120078624 - Method for detecting voice section from time-space by using audio and video information and apparatus thereof: The present invention relates to a method for detecting a voice section in time-space by using audio and video information. According to an embodiment of the present invention, a method for detecting a voice section from time-space by using audio and video information comprises the steps of: detecting a voice... Agent: Korea University-industrial & Academic Collaboration Foundation
20120078625 - Waveform analysis of speech: A waveform analysis of speech is disclosed. Embodiments include methods for analyzing captured sounds produced by animals, such as human vowel sounds, and accurately determining the sound produced. Some embodiments utilize computer processing to identify the location of the sound within a waveform, select a particular time within the sound,... Agent: Waveform Communications, LLC
20120078627 - Electronic device with text error correction based on voice recognition data: During operation of an electronic device such as a cellular telephone with a touch screen display or other electronic equipment, a voice recognition engine may gather data on spoken words. Data on the spoken words that are recognized may be maintained in a spoken word database maintained by an input... Agent:
20120078628 - Head-mounted text display system and method for the hearing impaired: The head-mounted text display system for the hearing impaired is a speech-to-text system, in which spoken words are converted into a visual textual display and displayed to the user in passages containing a selected number of words. The system includes a head-mounted visual display, such as eyeglass-type dual liquid crystal... Agent:
20120078629 - Meeting support apparatus, method and program: According to one embodiment, a meeting support apparatus includes a storage unit, a determination unit, a generation unit. The storage unit is configured to store storage information for each of words, the storage information indicating a word of the words, pronunciation information on the word, and pronunciation recognition frequency. The... Agent: Kabushiki Kaisha Toshiba
20120078626 - Systems and methods for converting speech in multimedia content to text: Methods and systems for converting speech to text are disclosed. One method includes analyzing multimedia content to determine the presence of closed captioning data. The method includes, upon detecting closed captioning data, indexing the closed captioning data as associated with the multimedia content. The method also includes, upon failure to... Agent:
20120078631 - Recognition of target words using designated characteristic values: Target word recognition includes: obtaining a candidate word set and corresponding characteristic computation data, the candidate word set comprising text data, and characteristic computation data being associated with the candidate word set; performing segmentation of the characteristic computation data to generate a plurality of text segments; combining the plurality of... Agent: Alibaba Group Holding Limited
20120078630 - Utterance verification and pronunciation scoring by lattice transduction: In the field of language learning systems, proper pronunciation of words and phrases is an integral aspect of language learning, determining the proximity of the language learner's pronunciation to a standardized, i.e. ‘perfect’, pronunciation is utilized to guide the learner from imperfect toward perfect pronunciation. In this regard, a phoneme... Agent:
20120078632 - Voice-band extending apparatus and voice-band extending method: An optical device includes a fast Fourier transform (FFT) unit, a signal noise ratio (SNR) calculation processing unit, a band selecting unit, an extension-signal creating unit, an addition unit, and an inverse fast Fourier transform (IFFT) unit. The FFT unit performs the Fourier transform on an input signal that is... Agent: Fujitsu Limited
20120078633 - Reading aloud support apparatus, method, and program: According to one embodiment, a reading aloud support apparatus includes a reception unit, a first extraction unit, a second extraction unit, an acquisition unit, a generation unit, a presentation unit. The reception unit is configured to receive an instruction. The first extraction unit is configured to extract, as a partial... Agent: Kabushiki Kaisha Toshiba
20120078634 - Voice dialogue system, method, and program: A voice dialogue system executing an operation by a voice dialogue with a user, includes a history storage unit storing an operation name of the operation executed by the voice dialogue system and an operation history corresponding to a number of execution times of the executed operation; a voice storage... Agent: Kabushiki Kaisha Toshiba
20120078636 - Evidence diffusion among candidate answers during question answering: Diffusing evidence among candidate answers during question answering may identify a relationship between a first candidate answer and a second candidate answer, wherein the candidate answers are generated by a question-answering computer process, the candidate answers have associated supporting evidence, and the candidate answers have associated confidence scores. All or... Agent: International Business Machines Corporation
20120078637 - Method and apparatus for performing and controlling speech recognition and enrollment: A method and an apparatus for performing and controlling speech recognition and enrolment are provided. The method for performing speech recognition and enrolment includes: receiving a Speech Enrolment Start Request and a Speech Recognition Request sent from a media gateway controller (MGC); performing speech recognition and enrolment according to the... Agent: Huawei Technologies Co., Ltd.
20120078635 - Voice control system: One embodiment of a voice control system includes a first electronic device communicatively coupled to a server and configured to receive a speech recognition file from the server. The speech recognition file may include a speech recognition algorithm for converting one or more voice commands into text and a database... Agent: Apple Inc.
20120078638 - Centralized biometric authentication: A communications system includes a receiver and at least one transmitter. The receiver receives, from different intermediate systems, biometric samples from parties attempting to obtain services from the intermediate systems and information characterizing the expected identifies of the parties. The at least one transmitter transmits, to the intermediate systems, verification... Agent: At&t Intellectual Property I, L.p.
20120078639 - System and method for voice authentication over a computer network: Systems, computer-implemented methods, and tangible computer-readable media are provided for voice authentication. The method includes receiving a speech sample from a user through an Internet browser for authentication as part of a request for a restricted-access resource, performing a comparison of the received speech sample to a previously established speech... Agent: At&t Intellectual Property I, L.p.
20120078640 - Audio encoding device, audio encoding method, and computer-readable medium storing audio-encoding computer program: An audio encoding device includes, a time-frequency transformer that transforms signals of channels, a first spatial-information determiner that generates a frequency signal of a third channel, a second spatial-information determiner that generates a frequency signal of the third channel, a similarity calculator that calculates a similarity between the frequency signal... Agent: Fujitsu Limited
20120078641 - Compression coding and decoding method, coder, decoder, and coding device: The embodiments of the present invention relate to a compression coding and decoding method, a coder, a decoder and a coding device. The compression coding method includes: extracting sign information of an input signal to obtain an absolute value signal of the input signal; obtaining a residual signal of the... Agent: Huawei Technologies Co., Ltd.
20120078642 - Encoding method and encoding device, decoding method and decoding device and transcoding method and transcoder for multi-object audio signals: A method of encoding a multi-object audio signal and an encoding apparatus, a decoding method and a decoding apparatus, and a transcoding method and a transcoder are provided. A multi-object audio signal encoding apparatus may encode object signals obtained by excluding ForeGround Objects (FGOs) from a plurality of input object... Agent:03/22/2012 > 26 patent applications in 15 patent subcategories. listing by industry category
20120072201 - Language translation reuse in different systems: An account system obtains a first translation file associated with it. The account system obtains a second translation file from a second account system, wherein the second account system is a data processing system. The account system determines whether a third account system has a third translation file with untranslated... Agent: International Business Machines Corporation
20120072202 - Sentence-based paragraphic online translating system and method thereof: A sentence-based paragraphic online translating system and the method thereof are described. The paragraphic online translating system and the method thereof establish at least one sentence item in the database of a server, wherein each of the sentence items contains a first language article and a correspondingly translated second language... Agent: Inventec Corporation
20120072203 - System and method for using first language input to instantly output second language: A system and a method for using first language input to instantly output second language are provided. The system and the method continuously receive input words in the first language and find the corresponding translation words in the second language according to the stored words, and then output the translation... Agent: Inventec Corporation
20120072205 - Handheld electronic device and method for disambiguation of compound text input and that employs n-gram data to limit generation of low-probability compound language solutions: A handheld electronic device includes a reduced QWERTY keyboard and is enabled with disambiguation software that is operable to disambiguate compound text input. The device is able to assemble language objects in the memory to generate compound language solutions. The device is able to analyze the combinations of language objects... Agent: Research In Motion Limited
20120072204 - Systems and methods for normalizing input media: A method and system for processing input media for provision to a text to speech engine comprising: a rules engine configured to maintain and update rules for processing the input media; a pre-parsing filter module configured to determine one or more metadata attributes using pre-parsing rules; a parsing filter module... Agent: Voice On The Go Inc.
20120072207 - Down-mixing device, encoder, and method therefor: Provided are a down-mixing method and an encoder, wherein a high quantization performance can be realized when a balance adjustment operation due to a balance weight coefficient and a removal operation of a main component are combined. In the encoder (100), a down-mixing unit (101) generates a mono signal by... Agent: Panasonic Corporation
20120072206 - Terminal apparatus and speech processing program: A terminal apparatus configured to obtain positional information indicating a position of another apparatus; to obtain positional information indicating a position of the terminal apparatus; to obtain a first direction, which is a direction to the obtained position of the another apparatus and calculated using the obtained position of the... Agent: Fujitsu Limited
20120072208 - Determining pitch cycle energy and scaling an excitation signal: An electronic device for determining a set of pitch cycle energy parameters is described. The electronic device includes a processor and executable instructions stored in memory. The electronic device obtains a frame, a set of filter coefficients and a residual signal based on the frame and the set of filter... Agent: Qualcomm Incorporated
20120072209 - Estimating a pitch lag: An electronic device for estimating a pitch lag is described. The electronic device includes a processor and executable instructions stored in memory that is in electronic communication with the processor. The electronic device obtains a current frame. The electronic device also obtains a residual signal based on the current frame.... Agent: Qualcomm Incorporated
20120072210 - Signal processing method, apparatus and program: In one embodiment, a signal processing method is disclosed. The method can perform filter processing of convoluting a tap coefficient in a first signal sequence to generate a second signal sequence. The method can subtract the second signal sequence from a third signal sequence to generate a fourth signal sequence.... Agent: Kabushiki Kaisha Toshiba
20120072213 - Speech sound intelligibility assessment system, and method and program therefor: The speech sound intelligibility assessment system includes: an output section for presenting a speech sound to a user; a biological signal measurement section for measuring an electroencephalogram signal of the user; a positive component determination section for determining presence/absence of a positive component of an event-related potential in the electroencephalogram... Agent: Panasonic Corporation
20120072212 - System and method for mobile automatic speech recognition: A system and method of updating automatic speech recognition parameters on a mobile device are disclosed. The method comprises storing user account-specific adaptation data associated with ASR on a computing device associated with a wireless network, generating new ASR adaptation parameters based on transmitted information from the mobile device when... Agent: At&t Intellectual Property Ii, L.p.
20120072211 - Using codec parameters for endpoint detection in speech recognition: Systems, methods and apparatus for determining an estimated endpoint of human speech in a sound wave received by a mobile device having a speech encoder for encoding the sound wave to produce an encoded representation of the sound wave. The estimated endpoint may be determined by analyzing information available from... Agent: Nuance Communications, Inc.
20120072214 - Frame erasure concealment technique for a bitstream-based feature extractor: A frame erasure concealment technique for a bitstream-based feature extractor in a speech recognition system particularly suited for use in a wireless communication system operates to “delete” each frame in which an erasure is declared. The deletions thus reduce the length of the observation sequence, but have been found to... Agent: At&t Intellectual Property Ii, L.p.
20120072216 - Age determination using speech: A method and device are configured to receive voice data from a user and perform speech recognition on the received voice data. A confidence score is calculated that represents the likelihood that received voice data has been accurately recognized. A likely age range is determined associated with the user based... Agent: Verizon Patent And Licensing Inc.
20120072215 - Full-sequence training of deep structures for speech recognition: A method is disclosed herein that include an act of causing a processor to access a deep-structured model retained in a computer-readable medium, wherein the deep-structured model comprises a plurality of layers with weights assigned thereto, transition probabilities between states, and language model scores. The method can further include the... Agent: Microsoft Corporation
20120072217 - System and method for using prosody for voice-enabled search: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating relevant responses to a user query with voice-enabled search. A system practicing the method receives a word lattice generated by an automatic speech recognizer based on a user speech and a prosodic analysis of the user speech, generates... Agent: At&t Intellectual Property I, L.p
20120072218 - System and method for tracking persons of interest via voiceprint: Disclosed are systems, methods, and computer readable media for tracking a person of interest. The method embodiment comprises identifying a person of interest, capturing a voiceprint of the person of interest, comparing a received voiceprint of a caller with the voiceprint of the person of interest, and tracking the caller... Agent: At&t Intellectual Property Ii, L.p.
20120072221 - Distributed voice user interface: A distributed voice user interface system includes a local device which receives speech input issued from a user. Such speech input may specify a command or a request by the user. The local device performs preliminary processing of the speech input and determines whether it is able to respond to... Agent: Ben Franklin Patent Holding, LLC
20120072220 - Matching text sets: Matching text sets is disclosed, including: extracting a text set from data associated with a current period; storing the text set with a plurality of text sets; extracting a keyword from the text set; determining a weight value associated with the keyword associated with the text set; determining a degree... Agent: Alibaba Group Holding Limited
20120072219 - System and method for enhancing voice-enabled search based on automated demographic identification: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating responses to a user speech query in voice-enabled search based on metadata that include demographic features of the speaker. A system practicing the method recognizes received speech from a speaker to generate recognized speech, identifies metadata about the... Agent: At & T Intellectual Property I, L.p.
20120072222 - Automatic detection, summarization and reporting of business intelligence highlights from automated dialog systems: A method and system for reporting data from a spoken dialog service is disclosed. The method comprises extracting data regarding user dialogs using a dialog logging module in the spoken dialog service, analyzing the data to identify trends and reporting the trends. The data may be presented in a visual... Agent: At&t Intellectual Property Ii, L.p.
20120072223 - System and method for configuring voice synthesis: Systems and methods for providing synthesized speech in a manner that takes into account the environment where the speech is presented. A method embodiment includes, based on a listening environment and at least one other parameter associated with at least one other parameter, selecting an approach from the plurality of... Agent: At&t Intellectual Property Ii, L.p.
20120072224 - Method of speech synthesis: The present invention relates to a method of text-based speech synthesis, wherein at least one portion of a text is specified; the intonation of each portion is determined; target speech sounds are associated with each portion; physical parameters of the target speech sounds are determined; speech sounds most similar in... Agent:
20120072226 - Parcor coefficient quantization method, parcor coefficient quantization apparatus, program and recording medium: On a criterion to minimize the entropy of the linear prediction residual of the input signal used for calculation of the input PARCOR coefficient sequence, PARCOR coefficients with larger absolute values are quantized with higher quantization precisions so as to reduce the increase of the code amount of the linear... Agent: Nippon Telegraph And Telephone Corporation
20120072225 - Systems and methods for encoding and decoding: Systems and methods for encoding and decoding are disclosed. The systems and methods include multimedia decoder instantiation systems and multimedia processing engines which are capable of being upgraded or reconfigured to support a new or previously-unsupported compression format, without the need for platform-specific software or hardware upgrades.... Agent: Onecodec, Ltd.03/15/2012 > 28 patent applications in 17 patent subcategories. listing by industry category
20120065957 - Interpersonal communications device and method: Device for converting a source language message into a target language message is disclosed. An embodiment of the device includes an input, a controller, and an output. The input receives separately entered source language phrases as the source language message for a user. The controller obtains, for each entered source... Agent:
20120065958 - Methods and systems for providing anonymous and traceable external access to internal linguistic assets: The present application is directed towards methods and systems for providing anonymous and traceable external access to internal linguistic assets. The methods and systems described allow users the freedom to use a linguistic resource with the security that their identities and interactions with the resource are shielded from other users.... Agent:
20120065960 - Generating parser combination by combining language processing parsers: A computer implemented method, a computer system, and a program for generating a parser combination. The method includes: generating a parser combination by combining parsers each associated with at least one grammar description, where the step is carried out using (i) at least one grammar description means and (ii) a... Agent: International Business Machines Corporation
20120065961 - Speech model generating apparatus, speech synthesis apparatus, speech model generating program product, speech synthesis program product, speech model generating method, and speech synthesis method: According to one embodiment, a speech model generating apparatus includes a spectrum analyzer, a chunker, a parameterizer, a clustering unit, and a model training unit. The spectrum analyzer acquires a speech signal corresponding to text information and calculates a set of spectral coefficients. The chunker acquires boundary information indicating a... Agent: Kabushiki Kaisha Toshiba
20120065963 - System and method of generating responses to text-based messages: In accordance with one aspect of the present invention, an automated method of and system for generating a response to a text-based natural language message is disclosed. The method includes identifying a first selected input clause in a sentence in the text-based natural language message. Also, assigning a semantic tag... Agent: At&t Intellectual Property I, Lp
20120065962 - Systems and methods of building and using custom word lists: Standard word lists that are often used for such operations as predictive text, spell checking, and word completion are based on general linguistic data that might not accurately reflect actual text usage patterns of particular users. Systems and methods of building and using a custom word list for use in... Agent:
20120065959 - Word graph: One example embodiment includes a method for constructing a word graph. The method includes obtaining a subject text and dividing the subject text into one or more units. The method also includes dividing the units into one or more sub-units and recording each of the one or more sub-units.... Agent:
20120065964 - Method and apparatus for introducing information into a data stream and method and apparatus for encoding an audio signal: Techniques for introducing information into a data stream first obtains the spectral values of the short-term spectrum of the audio signal. Separately, information to be introduced are combined with a spread sequence obtaining a spread information signal, whereupon a spectral representation of the spread information is generated, then weighted with... Agent:
20120065965 - Apparatus and method for encoding and decoding signal for high frequency bandwidth extension: An apparatus and method for encoding and decoding a signal for high frequency bandwidth extension are provided. An encoding apparatus may down-sample a time domain input signal, may core-encode the down-sampled time domain input signal, may transform the core-encoded time domain input signal to a frequency domain input signal, and... Agent: Samsung Electronics Co., Ltd.
20120065966 - Voice activity detection method and apparatus, and electronic device: A voice activity detection method and apparatus, and an electronic device are provided. The method includes: obtaining a time domain parameter and a frequency domain parameter from an audio frame; obtaining a first distance between the time domain parameter and a long-term slip mean of the time domain parameter in... Agent: Huawei Technologies Co., Ltd.
20120065967 - Communication device and signal processing method: Provided is a communication device which can easily provide a function to enable signal cross-reference among a plurality of voice signals having different frequency ranges and to enhance the quality of voice communications, at a low cost. In the communication device, a band expansion unit (104) expands a narrow band... Agent: Panasonic Corporation
20120065968 - Speech recognition method: In a speech recognition method, a number of audio signals are obtained from a voice input of a number of utterances of at least one speaker into a pickup system. The audio signals are examined using a speech recognition algorithm and a recognition result is obtained for each audio signal.... Agent: Siemens Aktiengesellschaft
20120065969 - System and method for contextual social network communications during phone conversation: An embodiment of the invention includes methods and systems for contextual social network communications during a phone conversation. A telephone conversation between a first user and at least one second user is monitored. More specifically, a monitor identifies terms spoken by the first user and the second user during the... Agent: International Business Machines Corporation
20120065970 - System and method for providing group discussions: A system and method for providing a discussion, including receiving by a processor text related to a discussion; converting by the processor the text to voice; storing by the processor in a memory the converted voice; receiving by the processor voice related to the discussion; storing by the processor in... Agent: Sequent, Inc.
20120065971 - Voice control of multimedia and communications devices: A method for operating a communications device can include receiving a plurality of spoken commands uttered by a user, the plurality of spoken commands comprising a custom written communication message to be displayed. The method can also include executing a speech recognition engine to recognize and convert each of the... Agent: Avon Associates, Inc.
20120065974 - Joint factor analysis scoring for speech processing systems: Method, system, and computer program product are provided for Joint Factor Analysis (JFA) scoring in speech processing systems. The method includes: carrying out an enrolment session offline to enrol a speaker model in a speech processing system using JFA, including: extracting speaker factors from the enrolment session; estimating first components... Agent: International Business Machines Corporation
20120065973 - Method and apparatus for performing microphone beamforming: A method and apparatus for performing microphone beamforming. The method includes recognizing a speech of a speaker, searching for a previously stored image associated with the speaker, searching for the speaker through a camera based on the image, recognizing a position of the speaker, and performing microphone beamforming according to... Agent: Samsung Electronics Co., Ltd.
20120065972 - Wireless voice recognition control system for controlling a welder power supply by voice commands: A wireless voice recognition control system for controlling the operation of an electric welder power supply by operator voice commands is disclosed. The system includes a remote module carried by the welder and a host module interfaced with the electric welder power supply. The remote module compares voice commands by... Agent: Var Systems Ltd.
20120065975 - System and method for pronunciation modeling: Systems, computer-implemented methods, and tangible computer-readable media for generating a pronunciation model. The method includes identifying a generic model of speech composed of phonemes, identifying a family of interchangeable phonemic alternatives for a phoneme in the generic model of speech, labeling the family of interchangeable phonemic alternatives as referring to... Agent: At&t Intellectual Property I, L.p.
20120065976 - Deep belief network for large vocabulary continuous speech recognition: A method is disclosed herein that includes an act of causing a processor to receive a sample, wherein the sample is one of spoken utterance, an online handwriting sample, or a moving image sample. The method also comprises the act of causing the processor to decode the sample based at... Agent: Microsoft Corporation
20120065977 - System and method for teaching non-lexical speech effects: Herein, a method is disclosed, which may include delexicalizing a first speech segment to provide a first prosodic speech signal; storing data indicative of the first prosodic speech signal in a computer memory; audibly playing the first speech segment to a language student; prompting the student to recite the speech... Agent: Rosetta Stone, Ltd.
20120065978 - Voice processing device: In voice processing, a first distribution generation unit approximates a distribution of feature information representative of voice of a first speaker per a unit interval thereof as a mixed probability distribution which is a mixture of a plurality of first probability distributions corresponding to a plurality of different phones. A... Agent: Yamaha Corporation
20120065979 - Method and system for text to speech conversion: A system and method for text to speech conversion. The method of performing text to speech conversion on a portable device includes: identifying a portion of text for conversion to speech format, wherein the identifying includes performing a prediction based on information associated with a user. While the portable device... Agent: Sony Corporation
20120065980 - Coding and decoding a transient frame: An electronic device for coding a transient frame is described. The electronic device includes a processor and executable instructions stored in memory that is in electronic communication with the processor. The electronic device obtains a current transient frame. The electronic device also obtains a residual signal based on the current... Agent: Qualcomm Incorporated
20120065981 - Text presentation apparatus, text presentation method, and computer program product: According to an embodiment, a text presentation apparatus presenting text for a speaker to read aloud for voice recording includes: a text storing unit for storing first text; a presenting unit for presenting the first text; a determination unit for determining whether or not the first text needs to be... Agent: Kabushiki Kaisha Toshiba
20120065982 - Dynamically generating a vocal help prompt in a multimodal application: Dynamically generating a vocal help prompt in a multimodal application that include detecting a help-triggering event for an input element of a VoiceXML dialog, where the detecting is implemented with a multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or... Agent: Nuance Communications, Inc.
20120065984 - Decoding device and decoding method: Provided is a decoding device that can reduce abrupt changes in the number of channels in a decoded signal when transmission errors occur as a result of lost frames in an encoding/decoding system for multichannel signals. Said decoding device is also capable of per-sample smoothing and can reduce degradation of... Agent: Panasonic Corporation
20120065983 - Efficient combined harmonic transposition: The present document relates to audio coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), and to digital effect processors, e.g. so-called exciters, where generation of harmonic distortion adds brightness to the processed signal. In particular; a system configured to generate a high frequency... Agent: Dolby International Ab03/08/2012 > 16 patent applications in 11 patent subcategories. listing by industry category
20120059644 - Translation apparatus, translation method, computer program, and recording medium: It is possible to improve convenience for a user and obtain translation of an original sentence effectively in real time in accordance with display switching of a page. When a control portion detects a request for display switching of a page made by the user and the request for display... Agent: Sharp Kabushiki Kaisha
20120059645 - Methods and apparatus for teaching or learning vocabulary: Provided herein are methods and apparatus for teaching or learning vocabulary comprising providing a writing instrument and one or more vocabulary words and corresponding definitions displayed on the exterior surface of the writing instrument. Also provided herein are methods and apparatus for learning terms of art or formulas comprising a... Agent:
20120059646 - Script detection service: Script detection service techniques are described. In an implementation, values representing individual text characters in a string of one or more text characters are identified to determine which human writing system is associated with the individual text characters. The values are compared to a table that associates subsets of values... Agent: Microsoft Corporation
20120059647 - Touchless texting exercise: A method, system, and computer program product are provided for touchless texting that enhances user activity. A plurality of graphical images are displayed on a computer display. An exercise motion is detected using a camera, and the motion is resolved to a selected graphical image from the plurality of graphical... Agent: International Business Machines Corporation
20120059649 - Howling canceller: A howling canceller which suppresses occurrence of howling even when an open loop gain exceeds “1” in the whole reproduction band. In the howling canceller, an adaptive filter (107) operates a digital received voice signal with a tap coefficient to generate a pseudo echo; a subtractor (108) subtracts the pseudo... Agent: Yugengaisya Cepstrum
20120059650 - Method and device for the objective evaluation of the voice quality of a speech signal taking into account the classification of the background noise contained in the signal: A method and device are provided for the objective evaluation of voice quality of a speech signal. The device includes: a module for extracting a background noise signal, referred to as a noise signal, from the speech signal; a module for calculating the audio parameters of the noise signal; a... Agent: France Telecom
20120059648 - Voice activity detector (vad) -based multiple-microphone acoustic noise suppression: Acoustic noise suppression is provided in multiple-microphone systems using Voice Activity Detectors (VAD). A host system receives acoustic signals via multiple microphones. The system also receives information on the vibration of human tissue associated with human voicing activity via the VAD. In response, the system generates a transfer function representative... Agent:
20120059652 - Methods and systems for obtaining language models for transcribing communications: A method for transcribing a spoken communication includes acts of receiving a spoken first communication from a first sender to a first recipient, obtaining information relating to a second communication, which is different from the first communication, from a second sender to a second recipient, using the obtained information to... Agent:
20120059651 - Mobile communication device for transcribing a multi-party conversation: A mobile communications device includes a network interface for communicating over a wide-area network, an input/output interface for communicating over a PAN and a display. The communication device also includes one or more processors for executing machine-executable instructions and one or more machine-readable storage media for storing the machine-executable instructions.... Agent: Microsoft Corporation
20120059653 - Methods and systems for obtaining language models for transcribing communications: A method for producing speech recognition results on a device includes receiving first speech recognition results, obtaining a language model, wherein the language model represents information stored on the device, and using the first speech recognition results and the language model to generate second speech recognition results.... Agent:
20120059654 - Speaker-adaptive synthesized voice: An objective is to provide a technique for accurately reproducing features of a fundamental frequency of a target-speaker's voice on the basis of only a small amount of learning data. A learning apparatus learns shift amounts from a reference source F0 pattern to a target F0 pattern of a target-speaker's... Agent: International Business Machines Corporation
20120059655 - Methods and apparatus for providing input to a speech-enabled application program: Some embodiments are directed to allowing a user to provide speech input intended for a speech-enabled application program into a mobile communications device, such as a smartphone, that is not connected to the computer that executes the speech-enabled application program. The mobile communications device may provide the user's speech input... Agent: Nuance Communications, Inc.
20120059656 - Speech signal similarity: A method for determining a similarity between a first audio source and a second audio source includes: for the first audio source, determining a first frequency of occurrence for each of a plurality of phoneme sequences and determining a first weighted frequency for each of the plurality of phoneme sequences... Agent: Nexidia Inc.
20120059657 - Radar microphone speech recognition: A method for detecting and recognizing speech is provided that remotely detects body motions from a speaker during vocalization with one or more radar sensors. Specifically, the radar sensors include a transmit aperture that transmits one or more waveforms towards the speaker, and each of the waveforms has a distinct... Agent:
20120059658 - Methods and apparatus for performing an internet search: Embodiments of the present invention relate to searching for content on the Internet. A user may supply a search query to a device, and the device may issue the search query to a plurality of search engines, including at least one general purpose search engine and at least one site-specific... Agent: Nuance Communications, Inc.
20120059659 - Codebook segment merging: Provided are, among other things, systems, methods and techniques for merging entropy codebook application ranges within an audio signal. According to one embodiment, an audio signal is obtained, the audio signal including quantization indexes, identification of segments of said quantization indexes, and indexes of entropy codebooks that have been assigned... Agent:03/01/2012 > 25 patent applications in 13 patent subcategories. listing by industry category
20120053926 - Interactive input method: A user input is received by a computing device. An interactive input module determines whether the first user input is a first character of a script for a supported language. If the first user input is a first character, the first character is stored in an input buffer. A plurality... Agent: Red Hat, Inc.
20120053927 - Identifying topically-related phrases in a browsing sequence: Browsing sequence phrase identification technique embodiments are presented that generally extract topically-related phrases from the pages visited by a user in a browsing session. The topically-related phrases can be used for a variety of purposes, including aiding a user in re-finding previously visited sites. This phrase identification task is performed... Agent: Microsoft Corporation
20120053929 - Method and mobile device for awareness of language ability: A method and mobile device for awareness of language ability are provided. “Repeated pattern index”-related properties, such as, a vocabulary usage amount, a vocabulary type, or a ratio, a time point, a time length or repeated contents of a repeated voice segment, and “community interaction index”-related properties, such as, a... Agent: Industrial Technology Research Institute
20120053928 - System and method for dynamically applying line breaks in text: A system and method for analyzing a text, and automatically applying line breaks according to set page, text and reader parameters in order to optimize reading speed and comprehension. Line breaks in the text are determined in part using semantic principles, i.e., the strength of the semantic relation between the... Agent:
20120053930 - System and method of providing a spoken dialog interface to a website: Disclosed is a system and method for training a spoken dialog service component from website data. Spoken dialog service components typically include an automatic speech recognition module, a language understanding module, a dialog management module, a language generation module and a text-to-speech module. The method includes converting data from a... Agent: At&t Intellectual Property Ii, L.p.
20120053931 - Speech masking and cancelling and voice obscuration: A non-acoustic sensor is used to measure a user's speech and then broadcasts an obscuring acoustic signal diminishing the user's vocal acoustic output intensity and/or distorting the voice sounds making them unintelligible to persons nearby.... Agent: Lawrence Livermore National Security, LLC
20120053932 - Method and system for automatic transmission of status information: A method for automatic transmission of status information from a first communications terminal set up for speech communication to a second communications terminal set up for text communication is provided. The speech communication between communications terminals is processed over a speech communications server and the text communication between communications terminals... Agent:
20120053933 - Speech synthesizer, speech synthesis method and computer program product: According to one embodiment, a first storage unit stores n band noise signals obtained by applying n band-pass filters to a noise signal. A second storage unit stores n band pulse signals. A parameter input unit inputs a fundamental frequency, n band noise intensities, and a spectrum parameter. A extraction... Agent: Kabushiki Kaisha Toshiba
20120053934 - Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise: Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving recorded background noise for each of the plurality of operating environments; generating a test speech utterance for recognition by a speech recognition engine... Agent: Nuance Communications. Inc.
20120053938 - Advanced voicemail features without carrier voicemail support: In one embodiment, a communication request from a remote requester is intercepted at the computing device. Based on the intercepted communication request, one or more voicemail features are enabled at the computing device, independent of carrier voicemail support. The remote requester may be, for example, a caller or a voicemail... Agent: Google Inc.
20120053937 - Generalizing text content summary from speech content: A text content summary is created from speech content. A focus more signal is issued by a user while receiving the speech content. The focus more signal is associated with a time window, and the time window is associated with a part of the speech content. It is determined whether... Agent: International Business Machines Corporation
20120053935 - Speech recognition model: In one implementation, speech or audio is converted to a searchable format by a speech recognition system. The speech recognition system uses a language model including probabilities of certain words occurring, which may depend on the occurrence of other words or sequences of words. The language model is partially built... Agent: Cisco Technology, Inc.
20120053936 - System and method for generating videoconference transcriptions: A method for generating a transcription of a videoconference includes matching human speech of a videoconference to writable symbols. The human speech is encoded in audio data of the videoconference. The writable symbols are parsed into a plurality of statements. For each statement of the plurality of statements, user profile... Agent: Fujitsu Limited
20120053939 - Speaker verification-based fraud system for combined automated risk score with agent review and associated user interface: Disclosed is method for screening an audio for fraud detection, the method comprising: providing a User Interface (UI) control capable of: a) receiving an audio; b) comparing the audio with a list of fraud audios; c) assigning a risk score to the audio based on the comparison with a potentially... Agent: Victrio
20120053940 - System and operation method for processing door-to-door parcel acceptance information based on voice recognition: Disclosed are a system for processing a door-to-door parcel acceptance and a method thereof. The post office server includes: a voice input block that receives acceptance application and collecting location information for the postal matter by voice and creates corresponding voice data; a door-to-door parcel information process block that acquires... Agent: Electronics And Telecommunications Research Institute
20120053941 - Wireless voice activation apparatus for surgical lasers: A wireless voice activation surgical laser system utilizing wireless transmitter receivers. The present invention integrates wireless communication between a surgical laser, a voice recognition device, and a microphone to allow surgeons to verbally activate or deactivate a surgical laser. The voice recognition device is able to recognize a surgeons commands... Agent:
20120053942 - Information processing apparatus, information processing method, and program: Provided is an information processing apparatus including a pre-score adjustment portion which calculates a pre-score based on context information obtained as observation information, for an intention model as a unit corresponding to each of a plurality of types of intention information registered in advance; a multi-matching portion which determines the... Agent:
20120053943 - Voice dialing using a rejection reference: A voice dialing method includes the steps of receiving an utterance from a user, decoding the utterance to identify a recognition result for the utterance, and communicating to the user the recognition result. If an indication is received from the user that the communicated recognition result is incorrect, then it... Agent: General Motors LLC
20120053945 - Belief tracking and action selection in spoken dialog systems: An action is performed in a spoken dialog system in response to a user's spoken utterance. A policy which maps belief states of user intent to actions is retrieved or created. A belief state is determined based on the spoken utterance, and an action is selected based on the determined... Agent: Honda Motor Co., Ltd.
20120053944 - Method for determining compressed state sequences: A compressed state sequence s is determined directly from the input sequence of data x. A deterministic function ƒ(x) only tracks unique state transitions, and not the dwell times in each state. A polynomial time compressed state sequence inference method outperforms conventional compressed state sequence inference techniques.... Agent:
20120053946 - Combined statistical and rule-based part-of-speech tagging for text-to-speech synthesis: In response to a word of a text sequence, a first part-of-speech (POS) tag is generated using a statistical part-of-speech (POS) tagger based on a corpus of trained text sequences, each representing a likely POS of a word for a given text sequence. A second POS tag is generated using... Agent: Apple Inc.
20120053947 - Web browser implementation of interactive voice response instructions: Web browser implementable instructions are generated from interactive voice instructions that are not natively interpreted by web browsers. Generating web browser implementable instructions in this manner allows for faster and cheaper deployment of voice, video, and/or data services by allowing legacy services based on interactive voice instructions to function seamlessly... Agent: Openwave Systems Inc.
20120053950 - Encoding device, decoding device, and methods therein: Disclosed are an encoding device, a decoding device, and methods therein which eliminate at an early stage the loss of synchronization of the adaptive filters of a terminal at the encoding end and a terminal at the decoding end caused by transmission errors such as packet losses, and suppress deterioration... Agent: Panasonic Corporation
20120053949 - Encoding device, decoding device, encoding method, decoding method and program therefor: There is provided a coding technique capable of reducing the amount of computation in coding while maintaining the efficiency of the coding. The technique uses an input signal and one of a decoded signal decoded from a first code obtained by encoding the input signal and a decoded signal obtained... Agent: Nippon Telegraph And Telephone Corp.
20120053948 - Sparse data compression: The invention relates to compressing of sparse data sets contains sequences of data values and position information therefor. The position information may be in the form of position indices defining active positions of the data values in a sparse vector of length N. The position information is encoded into the... Agent:Previous industry: Data processing: structural design, modeling, simulation, and emulation
Next industry: Data processing: financial, business practice, management, or cost/price determination
RSS FEED for 20130509:
Integrate FreshPatents.com into your RSS reader/aggregator or website to track weekly updates.
For more info, read this article.
Thank you for viewing Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents on the FreshPatents.com website. These are patent applications which have been filed in the United States. There are a variety ways to browse Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patent applications on our website including browsing by date, agent, inventor, and industry. If you are interested in receiving occasional emails regarding Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents we recommend signing up for free keyword monitoring by email.
FreshPatents.com Support - Terms & Conditions
Results in 1.0834 seconds