|Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents - Monitor Patents|
USPTO Class 704 | Browse by Industry: Previous - Next | All
03/2011 | Recent | 13: May | Apr | Mar | Feb | Jan | 12: Dec | Nov | Oct | Sep | Aug | July | June | May | April | Mar | Feb | Jan | 11: Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan | 10: Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan | | 09: Dec | Nov | Oct | Sep | Aug | Jl | Jn | May | Apr | Mar | Fb | Jn | | 2008 | 2007 |
Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression March listing by industry category 03/11Below are recently published patent applications awaiting approval from the USPTO. Recent week's RSS XML file available below.
Listing for abstract view: USPTO application #, Title, Abstract excerpt,Patent Agent. Listing format for list view: USPTO National Class full category number, title of the patent application. 03/31/2011 > 16 patent applications in 15 patent subcategories. listing by industry category
20110077933 - Multiple language/media translation optimization: A mechanism is provided for optimizing a language/media translation map. A user input is received comprising an input language/media selection, one or more output languages/medias selections, and a threshold for at least one of accuracy or throughput of one or more requested language/media translations. For each of the one or... Agent: International Business Machines Corporation
20110077934 - Language translation in an environment associated with a virtual application: Methods and apparatus for language translation in a computing environment associated with a virtual application are presented. For example, a method for providing language translation includes determining languages of a user and a correspondent; determining one or more sequences of translators; determining a selected sequence of selected translators from the... Agent: International Business Machines Corporation
20110077935 - Apparatus and methods for user generated translation: Disclosed are methods and apparatus for enabling user communities around the world to engage in translation of web properties while using such web properties. In certain embodiments, a translation interface is provided with a served web property to allow users to submit translations for user interface (UI) strings from a... Agent: Yahoo! Inc.
20110077936 - System and method for generating vocabulary from network data: A method is provided in one example and includes receiving data propagating in a network environment and separating the data into one or more fields. At least some of the fields are evaluated in order to identify nouns and noun phrases within the fields. The method also includes identifying selected... Agent: Cisco Technology, Inc.
20110077937 - Electronic apparatus with dictionary function and computer-readable medium: An electronic apparatus includes a storage which includes dictionary information, a conjugation chart database which stores conjugation charts for a language stored in the dictionary information so as to cause the charts to correspond to conjugation chart numbers, and a verb-verb conjugation chart correspondence table which stores the conjugation chart... Agent: Casio Computer Co., Ltd.
20110077938 - Data reproduction method and data reproduction apparatus: A reproduction apparatus that reproduces compressed audio data recorded in a recording medium inserts dummy data between data to be concatenated and reproduces the data when performing a specific reproduction of the data obtained by concatenating data which are discontinuously read from the recording medium.... Agent: Panasonic Corporation
20110077939 - Model-based distortion compensating noise reduction apparatus and method for speech recognition: A model-based distortion compensating noise reduction apparatus for speech recognition, includes: a speech absence probability calculator for calculating the probability distribution for absence and existence of a speech using the sound absence and existence information for the frames; a noise estimation updater for estimating a more accurate noise component by... Agent: Electronics And Telecommunications Research Institute
20110077940 - Speech encoding: A method system and program for encoding and decoding a speech signal including error correction data. The method comprises: receiving a speech signal comprising successive frames, for each of a plurality of frames of the speech signal, analysing the speech signal to determine side information and a residual signal, encoding... Agent:
20110077941 - Enabling spoken tags: Techniques for assigning a spoken tag in a telecom web platform are provided. The techniques include receiving a spoken tag, comparing the spoken tag to a set of one or more template tags, if the spoken tag is a match to a template tag, assigning the spoken tag and updating... Agent: International Business Machines Corporation
20110077942 - System and method for handling repeat queries due to wrong asr output: Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for handling expected repeat speech queries or other inputs. The method causes a computing device to detect a misrecognized speech query from a user, determine a tendency of the user to repeat speech queries based on previous user interactions, and... Agent: At&t Intellectual Property I, L.p.
20110077943 - System for generating language model, method of generating language model, and program for language model generation: A first system for generating a language model is a system for generating a language model including: a topic history dependent language model storing unit; a topic history accumulation unit; and a language score calculation unit. In the system for generating the language model, a language score corresponding to history... Agent: Nec Corporation
20110077944 - Speech recognition module and applications thereof: A speech recognition module includes an acoustic front-end module, a sound detection module, and a word detection module. The acoustic front-end module generates a plurality of representations of frames from a digital audio signal and generates speech characteristic probabilities for the plurality of frames. The sound detection module determines a... Agent: Broadcom Corporation
20110077945 - Flexible parameter update in audio/speech coded signals: This invention relates to a method, a computer program product, apparatuses and a system for extracting coded parameter set from an encoded audio/speech stream, said audio/speech stream being distributed to a sequence of packets, and generating a time scaled encoded audio/speech stream in the parameter coded domain using said extracted... Agent: Nokia Corporation
20110077946 - Deriving geographic distribution of physiological or psychological conditions of human speakers while preserving personal privacy: A method including: obtaining, via a plurality of communication devices, a plurality of speech signals respectively associated with human speakers, the speech signals including verbal components and non-verbal components; identifying a plurality of geographical locations, each geographic location associated with a respective one of the plurality of the communication devices;... Agent: International Business Machines Corporation
20110077947 - Conference bridge software agents: Systems and methods are provided to generate a software agent that is initiated to continue the business process flow during a conference. Upon initiating a teleconference in response to a selection associated with the business process or predefined rule associated with the business process that requires a conference, an instance... Agent: Avaya, Inc.
20110077948 - Method and system for containment of usage of language interfaces: Client software is modified by a translator to use unique variant of linguistic interface of a service. An interceptor pre-processes subsequent client service requests from translated unique linguistic interface to standard linguistic interface implemented by service. Usage of linguistic interfaces of service is contained, rendering service incapable of executing arbitrary... Agent: Mcafee, Inc. A Delaware Corporation03/24/2011 > 23 patent applications in 16 patent subcategories. listing by industry category
20110071818 - Man-machine interface for real-time forecasting user's input: A man-machine interface is disclosed. The circle is divided into several angle cells. The required inputted content is placed in the cells, the inputted option direction of motion is detected in real time, the content which the user want to input is forecasted and inputted according to the content in... Agent:
20110071817 - System and method for language identification: A system and method for training a language classifier are disclosed that may include obtaining an initial dictionary-based classifier model, stored in a computer memory, the model including a plurality of classifier n-grams; pruning away selected ones of the n-grams that do not significantly affect a performance of the classifier... Agent:
20110071819 - Apparatus, system, and method for natural language processing: Various embodiments are described for searching and retrieving documents based on a natural language input. A computer-implemented natural language processor electronically receives a natural language input phrase from an interface device. The natural language processor attributes a concept to the phrase with the natural language processor. The natural language processor... Agent:
20110071820 - Voice-quality evaluating system, communication system, test management apparatus, and test communication apparatus: A voice-quality evaluating system, in a secure network that allows a voice packet to pass, transmits and receives communication information for a voice quality testing between a test management apparatus and a test communication apparatus connected to the network and between the test communication apparatuses, for the voice quality testing... Agent: Fujitsu Limited
20110071821 - Receiver intelligibility enhancement system: Embodiments of the invention provide a communication device and methods for enhancing audio signals. A first audio signal buffer and a second audio signal buffer are acquired. Thereafter, the second audio signal is processed based on the linear predictive coding coefficients and gains based on noise power of the first... Agent:
20110071822 - Selective audio/sound aspects: Certain aspects relate to providing an at least one audio source to at least one user. Certain aspects relate to selectively modifying an at least one first sound source to be provided to the at least one user, wherein the at least one first sound source is combined with an... Agent: Searete LLC, A Limited Liability Corporation Of The State Of Delaware
20110071823 - Speech recognition system, speech recognition method, and storage medium storing program for speech recognition: A purpose is to suppress recognition process delay generated due to load in signal processing. Included is a speech input means 10 that inputs a speech signal, an output evaluation means 20 that evaluates whether or not the speech signal input by the speech input means 10 is the speech... Agent:
20110071825 - Device, method and program for voice detection and recording medium: To this end, a voice detection device includes a band-based power calculation unit that calculates a total of signal power values (sub-band power) of signals entered from the microphones from one preset frequency width (sub-band) to another. The voice detection device also includes a band-based noise estimation unit that estimates... Agent:
20110071824 - Systems and methods for multiple pitch tracking: An apparatus includes a function module, a strength module, and a filter module. The function module compares an input signal, which has a component, to a first delayed version of the input signal and a second delayed version of the input signal to produce a multi-dimensional model. The strength module... Agent:
20110071827 - Generation and selection of speech recognition grammars for conducting searches: Various processes are disclosed for generating and selecting speech recognition grammars for conducting searches by voice. In one such process, search queries are selected from a search query log for incorporation into speech recognition grammar. The search query log may include or consist of search queries specified by users without... Agent:
20110071826 - Method and apparatus for ordering results of a query: A method and apparatus for ordering results from a query is provided herein. During operation, a spoken query is received and converted to a textual representation, such as a word lattice. Search strings are then created from the word lattice. For example a set search strings may be created from... Agent: Motorola, Inc.
20110071828 - System and method of speech discriminability assessment, and computer program thereof: A speech discriminability assessment system includes: a biological signal measurement section for measuring an electroencephalogram signal of a user; a presented-speech sound control section for determining a speech sound to be presented to the user by referring to a speech sound database retaining a plurality of monosyllabic sound data; an... Agent:
20110071829 - Image processing apparatus, speech recognition processing apparatus, control method for speech recognition processing apparatus, and computer-readable storage medium for computer program: An image processing apparatus includes a speech input portion that receives an input of speech from a user, a dictionary storage portion that stores a dictionary configured by phrase information pieces for recognizing the speech, a compound phrase generation portion that generates a plurality of compound phrases formed by all... Agent: Konica Minolta Business Technologies, Inc.
20110071830 - Combined lip reading and voice recognition multimodal interface system: The present invention provides a combined lip reading and voice recognition multimodal interface system, which can issue a navigation operation instruction only by voice and lip movements, thus allowing a driver to look ahead during a navigation operation and reducing vehicle accidents related to navigation operations during driving. The combined... Agent: Kia Motors Corporation
20110071832 - Image display device, method, and program: It is an object of the present invention to make an act of viewing an image interactive and further enriched. A microphone 18 inputs a voice signal of a voice uttered by a viewer who is viewing a display image displayed on a display portion 17, and causes the voice... Agent: Casio Computer Co., Ltd.
20110071831 - Method and system for localizing and authenticating a person: The present invention refers to a method for localizing a person comprising the steps carried out in a computing system (1): determining (20) the localization of a telecommunication means (3, 6, 8) or determining a telecommunication means (3, 6, 8) at a specific location; this can be implemented using ANI... Agent: Agnitio, S.l.
20110071833 - Speech retrieval apparatus and speech retrieval method: Disclosed are a speech retrieval apparatus and a speech retrieval method for searching, in a speech database, for an audio file matching an input search term by using an acoustic model serialization code, a phonemic code, a sub-word unit, and a speech recognition result of speech. The speech retrieval apparatus... Agent:
20110071834 - System and method for improving text input in a shorthand-on-keyboard interface: A word pattern recognition system improves text input entered via a shorthand-on-keyboard interface. A core lexicon comprises commonly used words in a language; an extended lexicon comprises words not included in the core lexicon. The system only directly outputs words from the core lexicon. Candidate words from the extended lexicon... Agent:
20110071835 - Small footprint text-to-speech engine: Embodiments of small footprint text-to-speech engine are disclosed. In operation, the small footprint text-to-speech engine generates a set of feature parameters for an input text. The set of feature parameters includes static feature parameters and delta feature parameters. The small footprint text-to-speech engine then derives a saw-tooth stochastic trajectory that... Agent: Microsoft Corporation
20110071836 - System and method for generalized preselection for unit selection synthesis: Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for unit selection synthesis. The method causes a computing device to add a supplemental phoneset to a speech synthesizer front end having an existing phoneset, modify a unit preselection process based on the supplemental phoneset, preselect units from the supplemental... Agent: At&t Intellectual Property I, L.p.
20110071837 - Audio signal correction apparatus and audio signal correction method: According to one embodiment, an audio signal correction apparatus has a characteristic extraction module configured to determine whether an input audio signal is a monaural signal or a stereo signal, on the basis of channel information, and to extract a plurality of characteristic parameters for determining whether the input audio... Agent:
20110071838 - System and methods for recognizing sound and music signals in high noise and distortion: A method for recognizing an audio sample locates an audio file that most closely matches the audio sample from a database indexing a large set of original recordings. Each indexed audio file is represented in the database index by a set of landmark timepoints and associated fingerprints. Landmarks occur at... Agent:
20110071839 - Method and apparatus for encoding audio data: A method for processing audio data includes determining a first common scalefactor value for representing quantized audio data in a frame. A second common scalefactor value is determined for representing the quantized audio data in the frame. A line equation common scalefactor value is determined from the first and second... Agent:03/17/2011 > 20 patent applications in 17 patent subcategories. listing by industry category
20110066421 - User-interactive automatic translation device and method for mobile device: A user-interactive automatic translation device for a mobile device, includes: a camera image controller for converting an image captured by a camera into a digital image; an image character recognition controller for user-interactively selecting a character string region to be translated from the digital image, performing a character recognition function... Agent: Electronics And Telecommunications Research Institute
20110066422 - Apparatus and method for realtime remote interpretation: The most preferred embodiments of the present invention are configured to allow a foreign language interpreter to remotely monitor, control, and interpret various legal proceedings for one or more remote locations, such as courtrooms. The interpreter will use a computer-based system to monitor and control the audio-video functions and communications... Agent:
20110066423 - Speech-recognition system for location-aware applications: An apparatus and associated methods are disclosed that enable a speech-recognition system to perform functions related to the geo-locations of wireless telecommunications terminal users. In accordance with the illustrative embodiment, a geo-spatial grammar is employed that comprises rules concerning the geo-locations of users, and a speech-recognition system uses the geo-spatial... Agent: Avaya Inc.
20110066424 - Text stitching from multiple images: A reading machine has processing for detecting common text between a pair of individual images. The reading machine combines the text from the pair of images into a file or data structure if common text is detected, and determines if incomplete text phrases are present in the common text. If... Agent: K-nfb Reading Technology, Inc.
20110066425 - Systems, methods, and apparatus for automated mapping and integrated workflow of a controlled medical vocabulary: Systems, methods, and apparatus provide clinical terminology services including a controlled medical vocabulary supplemented by local clinical content. An example method includes accessing an initial controlled medical vocabulary including at least one external terminology via a vocabulary management server; processing local clinical content including unstructured local clinical content provided via... Agent:
20110066426 - Real-time speaker-adaptive speech recognition apparatus and method: A speech recognition apparatus and method for real-time speaker adaptation are provided. The speech recognition apparatus may estimate a pitch of a speech section from an inputted speech signal, extract a speech feature for speech recognition based on the estimated pitch, and perform speech recognition with respect to the speech... Agent: Samsung Electronics Co., Ltd.
20110066427 - Receiver intelligibility enhancement system: Embodiments of the invention provide a communication device and methods for enhancing audio signals. A first audio signal buffer and a second audio signal buffer are acquired. Thereafter, the magnitude spectrum calculated from the Fast Fourier Transform (FFT) of the second audio signal is processed based on the Linear Predictive... Agent:
20110066428 - System for adaptive voice intelligibility processing: An adaptive audio system can be implemented in a communication device. The adaptive audio system can enhance voice in an audio signal received by the communication device to increase intelligibility of the voice. The audio system can adapt the audio enhancement based at least in part on levels of environmental... Agent: Srs Labs, Inc.
20110066430 - Robust noise estimation: An enhancement system improves the estimate of noise from a received signal. The system includes a spectrum monitor that divides a portion of the signal at more than one frequency resolution. Adaptation logic derives a noise adaptation factor of the received signal. A plurality of devices tracks the characteristics of... Agent: Qnx Software Systems Co.
20110066429 - Voice activity detector and a method of operation: A voice activity detector (100) includes a frame divider (201) for dividing frames of an input signal into consecutive sub-frames, an energy level estimator (202) for estimating an energy level of the input signal in each of the consecutive sub-frames, a noise eliminator (203) for analyzing the estimated energy levels... Agent: Motorola, Inc.
20110066432 - Content filtering for a digital audio signal: According to some embodiments, content filtering is provided for a digital audio signal.... Agent:
20110066431 - Hand-held input apparatus and input method for inputting data to a remote receiving device: A hand-held input apparatus includes an input unit, a translator and a wireless transmitter. The input unit generates an input signal. The translator receives the input signal from the input unit, converts the input signal to a meaningful text and translates the meaningful text to a translated signal according to... Agent: Mediatek Inc.
20110066433 - System and method for personalization of acoustic models for automatic speech recognition: Disclosed herein are methods, systems, and computer-readable storage media for automatic speech recognition. The method includes selecting a speaker independent model, and selecting a quantity of speaker dependent models, the quantity of speaker dependent models being based on available computing resources, the selected models including the speaker independent model and... Agent: At&t Intellectual Property I, L.p.
20110066434 - Method for speech recognition on all languages and for inputing words using speech recognition: The invention can recognize all languages and input words. It needs m unknown voices to represent m categories of known words with similar pronunciations. Words can be pronounced in any languages, dialects or accents. Each will be classified into one of m categories represented by its most similar unknown voice.... Agent:
20110066435 - Image transmitting apparatus, image transmitting method, and image transmitting program embodied on computer readable medium: An MFP includes an accepting portion to accept an image and a speech, a speech recognition portion to recognize the accepted speech, a display screen generating portion, in response to an event that a keyword included in a predetermined output setting is recognized by the speech recognition portion, to generate... Agent: Konica Minolta Business Technologies, Inc.
20110066436 - Speaker intent analysis system: A speaker intent analysis system and method for validating the truthfulness and intent of a plurality of participants' responses to questions. A computer stores, retrieves, and transmits a series of questions to be answered audibly by participants. The participants' answers are received by a data processor. The data processor analyzes... Agent: The Bezar Family Irrevocable Trust
20110066437 - Methods and apparatus to monitor media exposure using content-aware watermarks: Methods and apparatus to construct and transmit content-aware watermarks are disclosed herein. An example method of creating a content-aware watermark includes selecting at least one word associated with a media composition; representing the word with at least one phonetic notation; obtaining a proxy code for each phonetic notation; and locating... Agent:
20110066438 - Contextual voiceover: A method for providing voice feedback with playback of media on an electronic device is provided. In one embodiment, the method may include determining one or more characteristics of the media with which the voice feedback is associated. For instance, the media may include a song, and the determined characteristics... Agent: Apple Inc.
20110066439 - Dimension measurement system: A dimension measurement system is provided. The dimension measurement system includes a speech I/O device fit in an ear canal of a worker, generating a voice signal from vibration in the air emitted from an eardrum of the worker and propagated inside the ear canal, and outputting the voice signal... Agent:
20110066440 - Audio signal encoding employing interchannel and temporal redundancy reduction: A method of encoding a time-domain audio signal is presented. A device transforms the time-domain signal into a frequency-domain signal including a sequence of sample blocks, wherein each block includes a coefficient for each of multiple frequencies. The coefficients of each block are grouped into frequency bands. For each frequency... Agent: Sling Media Pvt Ltd03/10/2011 > 17 patent applications in 11 patent subcategories. listing by industry category
20110060583 - Automatic translation system based on structured translation memory and automatic translation method using the same: Provided are an automatic translation system based on structured translation memory and an automatic translation method using the same. In the automatic translation system, a translation memory establishment module changes a predetermined language pattern into a part translation pattern and registers the changed part translation pattern in a structured translation... Agent: Electronics And Telecommunications Research Institute
20110060584 - Error correction using fact repositories: The disclosed system and method apply stores of factual information to correct errors in digital text, for example, generated from OCR, speech and/or handwriting recognition devices, and other automatic recognition devices. A text produced by OCR, speech recognition, handwriting recognition, and others may be processed to extract discussed facts. Databases... Agent: International Business Machines Corporation
20110060585 - Inputting method by predicting character sequence and electronic device for practicing the method: The present invention relates to a method of predicting and entering a character string and an electronic device in which the method is implemented. The method of predicting and entering a character string includes a step (S10) of entering a first letter and a second letter in an entry device... Agent:
20110060586 - Voice application network platform: A distributed voice applications system includes a voice applications rendering agent and at least one voice applications agent that is configured to provide voice applications to an individual user. A management system may control and direct the voice applications rendering agent to create voice applications that are personalized for individual... Agent:
20110060587 - Command and control utilizing ancillary information in a mobile voice-to-speech application: In embodiments of the present invention improved capabilities are described for controlling a mobile communication facility utilizing ancillary information comprising accepting speech presented by a user using a resident capture facility on the mobile communication facility while the user engages an interface that enables a command mode for the mobile... Agent:
20110060588 - Method and system for automatic speech recognition with multiple contexts: A method and a system for activating functions including a first function and a second function, wherein the system is embedded in an apparatus, are disclosed. The system includes a control configured to be activated by a plurality of activation styles, wherein the control generates a signal indicative of a... Agent:
20110060589 - Multi-purpose contextual control: A method and a system for activating functions including a first function and a second function, wherein the system is embedded in an apparatus, are disclosed. The system includes a control configured to be activated by a plurality of activation styles, wherein the control generates a signal indicative of a... Agent:
20110060590 - Synthetic speech text-input device and program: A synthetic speech text-input device is provided that allows a user to intuitively know an amount of an input text that can be fit in a desired duration. A synthetic speech text-input device 1 includes: an input unit that receives a set duration in which a speech to be synthesized... Agent: Jujitsu Limited
20110060591 - Issuing alerts to contents of interest of a conference: A method, system, and computer program product for issuing an alert in response to detecting a content of interest in a conference. A listening logic comprising multiple conference engines monitors speakers, topics, and words spoken during a conference. A speech-to-text engine monitors the conference and records a transcription. A word... Agent: International Business Machines Corporation
20110060592 - Iptv system and service method using voice interface: Provided is an IPTV system using voice interface which includes a voice input device, a voice processing device, a query processing and content search device, and a content providing device. The voice processing device performs voice recognition to convert voice into a text. The voice processing device includes a voice... Agent:
20110060594 - Apparatus and method for adaptive audio coding: An audio encoder capable of implementing a plurality of encoding functions, wherein an adaptation controller adjusts the implementation of the encoding functions in response to feedback received by the adaptation controller during use. The adjustment may involve adapting encoding algorithms or selecting alternative encoding algorithms. The encoder may also include... Agent: Apt Licensing Limited
20110060595 - Apparatus and method for adaptive audio coding: An audio encoder capable of implementing a plurality of encoding functions, wherein an adaptation controller adjusts the implementation of the encoding functions in response to feedback received by the adaptation controller during use. The adjustment may involve adapting encoding algorithms or selecting alternative encoding algorithms. The encoder may also include... Agent: Apt Licensing Limited
20110060596 - Method for decoding an audio signal that has a base layer and an enhancement layer: An audio signal may have a BL and an EL, wherein the EL represents additional information for enhancing the quality of the BL audio content. Decoding of such dual-layer signals usually comprises partial decoding of the BL data, wherein frequency bins of the BL are restored, mapping the restored frequency... Agent: Thomson Licensing
20110060597 - Multi-channel audio encoding and decoding: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying... Agent: Microsoft Corporation
20110060593 - Output circuit for audio codec chip: An output circuit for audio codec chip includes a noise eliminating circuit electrically coupled to the audio codec chip for eliminating noise signals. The noise eliminating circuit includes a first switch and a second switch. When the audio codec chip output signals jump from low voltage level to high voltage... Agent: Hon Hai Precision Industry Co., Ltd.
20110060598 - Adaptive grouping of parameters for enhanced coding efficiency: The present invention is based on the finding that parameters including: a first set of parameters of a representation of a first portion of an original signal and a second set of parameters of a representation of a second portion of the original signal can be efficiently encoded when the... Agent: Fraunhofer-gesellschaft Zur Forderung Der Angewandten Forschung E.v.
20110060599 - Method and apparatus for processing audio signals: Methods and apparatuses for encoding and decoding an audio signal are provided, a method of encoding an audio signal including: receiving the audio signal including information about a moving sound source; receiving position information about the moving sound source; generating dynamic track information indicating motion of the moving sound source... Agent: Samsung Electronics Co., Ltd.03/03/2011 > 38 patent applications in 18 patent subcategories. listing by industry category
20110054880 - External content transformation: Techniques and systems for content transformation between devices are disclosed. In one aspect, a system includes a host device that sends content to client devices, and client devices that receive content from the host device in one format and transform the content into a different format. The client devices present... Agent:
20110054881 - Mechanism for local language numeral conversion in dynamic numeric computing: A mechanism for local language numeral conversion in dynamic numeric computing is disclosed. A method of embodiments of the invention includes receiving a string array of numeric data in a local language, wherein the numeric data used in dynamic calculations performed by the application, converting characters of the string array... Agent:
20110054882 - Mechanism for identifying invalid syllables in devanagari script: A mechanism for identifying invalid syllables in Devanagari script is disclosed. A method of embodiments of the invention includes receiving Devanagari text from an application of a computing device for parsing, determining a character type for a character of the Devanagari text, determining a new state associated with the character... Agent:
20110054883 - Speech understanding system using an example-based semantic representation pattern: A speech understanding apparatus includes: a speech recognition unit for recognizing an input speech to produce a speech recognition result; a sentence analysis unit for performing morpheme analysis on a sentence corresponding to the speech recognition result, extracting additional information, and performing syntax analysis; a hierarchy describing unit for describing... Agent:
20110054884 - System for assisting in drafting applications: This invention relates to a Method and System for assisting in drafting applications comprising a server (1), with a processing device (2), a memory device (3) either directly or indirectly connected to said server (1) and software (4) installed on said server (1), wherein said memory device (3) includes information... Agent:
20110054885 - Device and method for a bandwidth extension of an audio signal: For a bandwidth extension of an audio signal, in a signal spreader the audio signal is temporally spread by a spread factor greater than 1. The temporally spread audio signal is then supplied to a demicator to decimate the temporally spread version by a decimation factor matched to the spread... Agent:
20110054886 - Effect device: An effect device may be configured such that when an input audio signal switches from a consonant to a vowel and an input level of the switched vowel is greater than a threshold value Lc (and a variable t is greater than time Ts), an audio effect signal A may... Agent:
20110054887 - Method and apparatus for maintaining speech audibility in multi-channel audio with minimal impact on surround experience: In one embodiment the present invention includes a method of improving audibility of speech in a multi-channel audio signal. The method includes comparing a first characteristic and a second characteristic of the multi-channel audio signal to generate an attenuation factor. The first characteristic corresponds to a first channel of the... Agent:
20110054888 - Device, method and system for detecting unwanted conversational media session: Some embodiments of the invention relate to a method and a system for detecting unwanted conversational media session data. In accordance with one aspect of the invention, a method of detecting unwanted conversation media session data according to some embodiments of the invention may include calculating two or more progressive... Agent:
20110054889 - Enhancing receiver intelligibility in voice communication devices: The intelligibility of speech signals is improved in the many situations where a voice signal is communicated or stored. Means and methods are disclosed for developing a scheme with high voice signal intelligibility without sacrifice of voice quality. The disclosed method comprises certain steps, including, but not limited to: Learning... Agent:
20110054890 - Apparatus and method for audio mapping: A mobile phone, and corresponding method, which is arranged to detect sounds of different types and to indicate to a user the direction from which those sounds are coming from. The mobile phone includes a microphone for recording sound and a display for providing feedback to the user. The phone... Agent:
20110054892 - System for detecting speech interval and recognizing continuous speech in a noisy environment through real-time recognition of call commands: The present invention relates to a continuous speech recognition system that is very robust in a noisy environment. In order to recognize continuous speech smoothly in a noisy environment, the system selects call commands, configures a minimum recognition network in token, which consists of the call commands and mute intervals... Agent:
20110054899 - Command and control utilizing content information in a mobile voice-to-speech application: In embodiments of the present invention improved capabilities are described for controlling a mobile communication facility utilizing content information comprising accepting speech presented by a user using a resident capture facility on the mobile communication facility while the user engages an interface that enables a command mode for the mobile... Agent:
20110054900 - Hybrid command and control between resident and remote speech recognition facilities in a mobile voice-to-speech application: In embodiments of the present invention improved capabilities are described for hybrid command and control between resident and remote speech recognition facilities in controlling a mobile communication facility comprising accepting speech presented by a user using a resident capture facility on the mobile communication facility while the user engages an... Agent:
20110054898 - Multiple web-based content search user interface in mobile search application: In embodiments of the present invention improved capabilities are described for a multiple web-based content search user interface in searching for web content on a mobile communication facility comprising capturing speech presented by a user using a resident capture facility on the mobile communication facility; transmitting at least a portion... Agent:
20110054896 - Sending a communications header with voice recording to send metadata for use in speech recognition and formatting in mobile dictation application: In embodiments of the present invention improved capabilities are described for sending a communications header with voice recording to send metadata for use in speech recognition and formatting when converting voice to text on a mobile communication facility comprising capturing speech presented by a user using a resident capture facility... Agent:
20110054894 - Speech recognition through the collection of contact information in mobile dictation application: In embodiments of the present invention improved capabilities are described for improving speech recognition through the collection of contact information on a mobile communication facility comprising capturing speech presented by a user using a resident capture facility on the mobile communication facility; transmitting at least a portion of the captured... Agent:
20110054893 - System and method for generating user models from transcribed dialogs: Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for generating personalized user models. The method includes receiving automatic speech recognition (ASR) output of speech interactions with a user, receiving an ASR transcription error model characterizing how ASR transcription errors are made, generating guesses of a true transcription and... Agent:
20110054897 - Transmitting signal quality information in mobile dictation application: In embodiments of the present invention improved capabilities are described for transmitting signal quality information when converting voice to text on a mobile communication facility comprising capturing speech presented by a user using a resident capture facility on the mobile communication facility; transmitting at least a portion of the captured... Agent:
20110054895 - Utilizing user transmitted text to improve language model in mobile dictation application: In embodiments of the present invention improved capabilities are described for utilizing user transmitted text to improve language modeling in converting voice to text on a mobile communication facility comprising capturing speech presented by a user using a resident capture facility on the mobile communication facility; transmitting at least a... Agent:
20110054901 - Method and apparatus for aligning texts: A method and apparatus for aligning texts. The method includes acquiring a target text and a reference text and aligning the target text and the reference text at word level based on phoneme similarity. The method can be applied to automatically archiving a multimedia resource and a method of automatically... Agent:
20110054902 - Singing voice synthesis system, method, and apparatus: A singing voice synthesis system is provided. The storage unit stores at least one tune. The tempo unit provides a set of tempo cues in accordance with a selected tune from the at least one tune. The input unit receives a plurality of original voice signals corresponding to the selected... Agent:
20110054903 - Rich context modeling for text-to-speech engines: Embodiments of rich text modeling for speech synthesis are disclosed. In operation, a text-to-speech engine refines a plurality of rich context models based on decision tree-tied Hidden Markov Models (HMMs) to produce a plurality of refined rich context models. The text-to-speech engine then generates synthesized speech for an input text... Agent:
20110054904 - Electronic shopping assistant with subvocal capability: A mobile device suitable for use by a user in a store includes a subvocal message (SVM) module to detect an SVM from the user. The SVM includes data that indicates an item in the store. A transmitter transmits a request after detecting the SVM. The request includes information indicating... Agent:
20110054905 - Voice interactive service system and method for providing different speech-based services: A voice interactive service system provides different speech-based services to a plurality of users. Using a communication terminal, the services are accessed via a telecommunication network through service-specific connectivity ports. The system comprises processing cores which have different configurations of speech processing resources for performing different services. For performing a... Agent:
20110054906 - Multimedia keepsake with customizable content: a multimedia playback keepsake and a method for its production are provided. The keepsake includes a control processor and a playback processor. The control processor is associated with storage containing a content program to be presented and contains interrupts corresponding to points in the content program where custom content information... Agent:
20110054907 - Audio interface unit for supporting network services: Techniques for providing network services at an audio interface unit include determining, based on spoken sounds of a user of an apparatus received at a microphone of the apparatus, whether to present audio data received from a different apparatus. If it is determined to present the received audio data, then... Agent:
20110054908 - Image processing system, image processing apparatus and information processing apparatus: An image processing system includes an information processing apparatus and an image processing apparatus connected to each other via a network. The information processing apparatus has an application installed thereon to give a new function to the image processing apparatus. The image processing apparatus transmits to the information processing apparatus,... Agent:
20110054909 - Localizing the position of a source of a voice signal: The invention relates to localizing the position of a person speaking by using pictures of a pattern (21) on an object (20) worn by the person. The object (20) carries a complex pattern (21) that is optimized for determining the orientation of the object (20), the distance from the object... Agent:
20110054910 - System and method for automatic temporal adjustment between music audio signal and lyrics: A system provided herein may perform automatic temporal alignment between music audio signal and lyrics with higher accuracy than ever. A non-fricative section extracting 4 extracts non-fricative sound sections, where no fricative sounds exist, from the music audio signal. An alignment portion 17 includes a phone model 15 for singing... Agent:
20110054913 - Asynchronous sampling rate converter for audio applications: In recent years, it has become commonplace for portable devices to generate analog audio signals from numerous sources, meaning that the codecs employed in these portable devices need to be able to utilize various digital bit streams at different sampling rates. To date, however, the circuitry for asynchronous sampling rate... Agent:
20110054915 - Computing circuits and method for running an mpeg-2 aac or mpeg-4 aac audio decoding algorithm on programmable processors: The present invention relates to computing circuits and method for running an MPEG-2 AAC or MPEG-4 AAC algorithm efficiently, which is used as an audio compression algorithm in multi-channel high-quality audio systems, on programmable processors. In accordance with the present invention, the IMDCT process which takes large part of the... Agent:
20110054911 - Enhanced audio decoder: Methods, systems, and apparatus are presented for decoding an audio signal that includes bandwidth extension data. An audio signal that includes core audio data and bandwidth extension data can be received in a decoder. The core audio data can be associated with a core portion of an audio signal, such... Agent:
20110054914 - Method for reduction of aliasing introduced by spectral envelope adjustment in real-valued filterbanks: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels... Agent:
20110054916 - Multi-channel audio encoding and decoding: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying... Agent:
20110054912 - System and method of storing telephone conversations: A method and system of storing telephone conversation data to a third party database storage unit is disclosed. The method includes detecting a telephone call initiated in a mobile telephone, recording telephone conversation data, detecting a termination of the telephone call, and transferring the recorded telephone conversation data to a... Agent:
20110054917 - Apparatus and method for structuring bitstream for object-based audio service, and apparatus for encoding the bitstream: Provided are a method and apparatus for structuring a bitstream for an object-based audio service, and an apparatus for encoding the bitstream. A method of structuring a bitstream, may include: configuring the bitstream by separating the bitstream into a file header and frames of audio objects that are separated using... Agent:Previous industry: Data processing: structural design, modeling, simulation, and emulation
Next industry: Data processing: financial, business practice, management, or cost/price determination
RSS FEED for 20130516:
Integrate FreshPatents.com into your RSS reader/aggregator or website to track weekly updates.
For more info, read this article.
Thank you for viewing Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents on the FreshPatents.com website. These are patent applications which have been filed in the United States. There are a variety ways to browse Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patent applications on our website including browsing by date, agent, inventor, and industry. If you are interested in receiving occasional emails regarding Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents we recommend signing up for free keyword monitoring by email.
FreshPatents.com Support - Terms & Conditions
Results in 0.88423 seconds