|Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents - Monitor Patents|
USPTO Class 704 | Browse by Industry: Previous - Next | All
01/2012 | Recent | 13: May | Apr | Mar | Feb | Jan | 12: Dec | Nov | Oct | Sep | Aug | July | June | May | April | Mar | Feb | Jan | 11: Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan | 10: Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan | | 09: Dec | Nov | Oct | Sep | Aug | Jl | Jn | May | Apr | Mar | Fb | Jn | | 2008 | 2007 |
Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression January recently filed with US Patent Office 01/12Below are recently published patent applications awaiting approval from the USPTO. Recent week's RSS XML file available below.
Listing for abstract view: USPTO application #, Title, Abstract excerpt,Patent Agent. Listing format for list view: USPTO National Class full category number, title of the patent application. 01/26/2012 > 32 patent applications in 18 patent subcategories. recently filed with US Patent Office
20120022850 - Statistical machine translation processing: A method of statistical machine translation (SMT) is provided. The method comprises generating reordering knowledge based on the syntax of a source language (SL) and a number of alignment matrices that map sample SL sentences with sample target language (TL) sentences. The method further comprises receiving a SL word string... Agent: Microsoft Corporation
20120022852 - Apparatus, system, and method for computer aided translation: An apparatus for assisting a human translator includes a source text module, a translator workspace module, a parsing module, a selection module, and a glossary module. The source text module receives source text in a source language. The translator workspace module displays a translator workspace field that is editable by... Agent:
20120022851 - On-demand translation of application text: Embodiments of the present invention provide a method, system and computer program product for on-demand translation of text. In an embodiment of the invention, a method for on-demand translation of text can include receiving in a dynamic translation module executing in memory by at least one processor of a host... Agent: International Business Machines Corporation
20120022853 - Multi-modal input on an electronic device: A computer-implemented input-method editor process includes receiving a request from a user for an application-independent input method editor having written and spoken input capabilities, identifying that the user is about to provide spoken input to the application-independent input method editor, and receiving a spoken input from the user. The spoken... Agent:
20120022856 - Browsing of contextual information: Systems and methods for searching and browsing a data store of contextually related data objects. The system includes a search/browse module that receives a search query. The search/browse module identifies data objects that match the search query and generates sentences from data objects that are contextually related to the matching... Agent: Radiant Logic, Inc.
20120022858 - Handheld electronic device and associated method employing a multiple-axis input device and providing a learning function in a text disambiguation environment: A handheld electronic device includes a reduced QWERTY keyboard and is enabled with disambiguation software. The device provides output in the form of a default output and a number of variants. The output is based largely upon the frequency, i.e., the likelihood that a user intended a particular output, but... Agent: Research In Motion Limited
20120022854 - Information processing device, information processing method, and information processing program: An apparatus and method provide logic for processing information. In one implementation, an apparatus includes a receiving unit configured to receive a selection of displayed content from a user. An obtaining unit is configured to obtain data corresponding to the selection. The data includes text data. An identification unit is... Agent:
20120022855 - Searching and browsing of contextual information: Systems and methods for searching and browsing a data store of contextually related data objects. The system includes a search/browse module that receives a search query. The search/browse module identifies data objects that match the search query and generates sentences from data objects that are contextually related to the matching... Agent: Radiant Logic, Inc.
20120022857 - System and method for a cooperative conversational voice user interface: A cooperative conversational voice user interface is provided. The cooperative conversational voice user interface may build upon short-term and long-term shared knowledge to generate one or more explicit and/or implicit hypotheses about an intent of a user utterance. The hypotheses may be ranked based on varying degrees of certainty, and... Agent: Voicebox Technologies, Inc.
20120022859 - Automatic marking method for karaoke vocal accompaniment: An automatic marking method for Karaoke vocal accompaniment is provided. In the method, pitch, beat position and volume of a singer are compared with the original pitch, beat position and volume of the theme of a song to generate a score of pitch, a score of beat and a score... Agent:
20120022860 - Speech and noise models for speech recognition: An audio signal generated by a device based on audio input from a user may be received. The audio signal may include at least a user audio portion that corresponds to one or more user utterances recorded by the device. A user speech model associated with the user may be... Agent: Google Inc.
20120022861 - Parallel entropy encoder and parallel entropy decoder: An entropy encoder block for use in a context adaptive encoder and an entropy decoder block for use in a context adaptive decoder is presented. The encoder block includes a plurality of encoding elements, for processing encoding search tree look tables corresponding to encoding probabilities used by the context adaptive... Agent: Certicom Corp.
20120022862 - Speech recognition circuit and method: A speech recognition circuit comprising a circuit for providing state identifiers which identify states corresponding to nodes or groups of adjacent nodes in a lexical tree, and for providing scores corresponding to said state identifiers, the lexical tree comprising a model of words; a memory structure for receiving and storing... Agent:
20120022863 - Method and apparatus for voice activity detection: s
20120022864 - Method and device for classifying background noise contained in an audio signal: Embodiments of methods and devices for classifying background noise contained in an audio signal are disclosed. In one embodiment, the device includes a module for extracting from the audio signal a background noise signal, termed the noise signal. Also included is a second that calculates a first parameter, termed the... Agent: France Telecom
20120022866 - Language model selection for speech-to-text conversion: Methods, computer program products and systems are described for converting speech to text. Sound information is received at a computer server system from an electronic device, where the sound information is from a user of the electronic device. A context identifier indicates a context within which the user provided the... Agent:
20120022867 - Speech to text conversion: Methods, computer program products and systems are described for speech-to-text conversion. A voice input is received from a user of an electronic device and contextual metadata is received that describes a context of the electronic device at a time when the voice input is received. Multiple base language models are... Agent:
20120022865 - System and method for efficiently reducing transcription error using hybrid voice transcription: A system and method for efficiently reducing transcription error using hybrid voice transcription is provided. A voice stream is parsed from a call into utterances. An initial transcribed value and corresponding recognition score are assigned to each utterance. A transcribed message is generated for the call and includes the initial... Agent:
20120022868 - Word-level correction of speech input: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word... Agent: Google Inc.
20120022869 - Acoustic model adaptation using geographic information: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing speech recognition accuracy. In one aspect, a method includes receiving an audio signal that corresponds to an utterance recorded by a mobile device, determining a geographic location associated with the mobile device, adapting one or... Agent: Google, Inc.
20120022870 - Geotagged environmental audio for enhanced speech recognition accuracy: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing speech recognition accuracy. In one aspect, a method includes receiving geotagged audio signals that correspond to environmental audio recorded by multiple mobile devices in multiple geographic locations, receiving an audio signal that corresponds to an... Agent: Google, Inc.
20120022871 - Speech recognition circuit using parallel processors: A speech recognition circuit comprises a memory containing lexical data for word recognition, the lexical data comprising a plurality of lexical data structures stored in each of a plurality of parts of the memory; and a parallel processor structure connected to the memory to process speech parameters by performing parallel... Agent:
20120022872 - Automatically adapting user interfaces for hands-free interaction: A user interface for a system such as a virtual assistant is automatically adapted for hands-free use. A hands-free context is detected via automatic or manual means, and the system adapts various stages of a complex interactive system to modify the user experience to reflect the particular limitations of such... Agent: Apple Inc.
20120022874 - Disambiguation of contact information using historical data: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for disambiguating contact information. A method includes receiving an audio signal, generating an affinity score based on a frequency with which a user has previously communicated with a contact associated with an item of contact information, and... Agent: Google Inc.
20120022873 - Speech recognition language models: Methods, computer program products and systems are described for forming a speech recognition language model. Multiple query-website relationships are determined by identifying websites that are determined to be relevant to queries using one or more search engines. Clusters are identified in the query-website relationships by connecting common queries and connecting... Agent:
20120022875 - Synchronizing visual and speech events in a multimodal application: Exemplary methods, systems, and products are disclosed for synchronizing visual and speech events in a multimodal application, including receiving from a user speech; determining a semantic interpretation of the speech; calling a global application update handler; identifying, by the global application update handler, an additional processing function in dependence upon... Agent: Nuance Communications, Inc.
20120022876 - Voice actions on computing devices: A computer-implemented method includes receiving spoken input at a computing device from a user of the computing device, the spoken input including a carrier phrase and a subject to which the carrier phrase is directed, providing at least a portion of the spoken input to a server system in audio... Agent: Google Inc.
20120022877 - Dynamic range improvement technique: Apparatus and methods are disclosed for detecting and progressively attenuating specific frequencies prevalent in an audio signal. In contrast to conventional wide-band enhancement techniques over long time frames, narrow bandwidths and short attenuation times employed are commensurate with resonances and timing typical of speech. Apparent dynamic range is therefore increased... Agent:
20120022879 - Methods and apparatus for embedding codes in compressed audio data streams: Methods and apparatus for embedding codes in compressed audio data streams are disclosed. An example apparatus disclosed herein to embed a code in a compressed audio data stream comprises an unpacking unit to determine a plurality of transform coefficients associated with the compressed audio data stream, the plurality of transform... Agent:
20120022878 - Signal de-noising method, signal de-noising apparatus, and audio decoding system: In the field of audio encoding/decoding technologies, a signal de-noising method is provided. The method includes: selecting, according to a degree of inter-frame correlation of a frame where a spectral coefficient to be adjusted resides, at least two spectral coefficients having high correlation with the spectral coefficient to be adjusted;... Agent: Huawei Technologies Co., Ltd.
20120022880 - Forward time-domain aliasing cancellation using linear-predictive filtering: In a coder, a method for producing forward aliasing cancellation (FAC) parameters for cancelling time-domain aliasing caused to a coded audio signal in a first transform-coded frame by a transition between the first transform-coded frame using a first coding mode with overlapping window and a second frame using a second... Agent:
20120022881 - Audio encoder, audio decoder, encoded audio information, methods for encoding and decoding an audio signal and computer program: An audio decoder for providing a decoded audio information on the basis of an encoded audio information includes a window-based signal transformer configured to map a time-frequency representation, which is described by the encoded audio information, to a time-domain representation. The window-based signal transformer is configured to select a window,... Agent:01/19/2012 > 26 patent applications in 17 patent subcategories. recently filed with US Patent Office
20120016655 - Dynamic language translation of web site content: Methods, systems, and computer readable medium for providing translated web content. A request is received from a user for content in a second language translated from content in a first language from a first Internet source. The content in the first language is obtained and divided into one or more... Agent:
20120016656 - Dynamic language translation of web site content: Methods, systems, and computer readable medium for providing a translated message in telecommunications. A request is received for translating a current message in a first language into a current message in a second language destined for a user. Whether the current message in the first language has been previously translated... Agent:
20120016657 - Method of and a system for translation: A translation system for translating source text from a first language to target text in a second language. The system comprises a translation memory (TM) module that stores translation segments. The TM module is operable to generate a TM target text output in response to source text. A statistical translation... Agent: Dublin City University
20120016658 - Input method editor: Methods, systems, and apparatus, including computer program products, in which an input method editor receives graphemes in a first writing system and identifies lexical items in a second writing system based on the graphemes in the first writing system. In one implementation, a method is provided. The method includes receiving... Agent: Google Inc.
20120016659 - Display apparatus for work machine and language rewriting system for display apparatus: The invention provides for making it possible to readily display information relating to a work machine, even when the mode of national language differs between the user and the vendor. The invention is characterized by a user information displaying means for displaying user information relating to the work machine to... Agent: Kubota Corporation
20120016660 - Parsing culturally diverse names: Provided are techniques for parsing a name. A name to be parsed is received. A culture of the name is identified. One or more name phrases from the name are identified. Statistics for the one or more name phrases are identified. It is determined whether to perform a first parsing... Agent: International Business Machines Corporation
20120016663 - Identifying related names: Provided are techniques for identifying related names. A collection of names from different languages is stored, wherein each of the names has a native orthographic form and a romanized form. An input name is received in a known encoding scheme. An alphabet of the input name is determined based on... Agent: International Business Machines Corporation
20120016664 - Language analysis apparatus, language analysis method, and language analysis program: A language analysis apparatus of the invention includes division rules, each of which is classified into one of levels according to the degree of risk of causing analysis accuracy problems when applied; a division point candidate generation unit 21 which, when a character string whose length is greater than the... Agent: Nec Corporation
20120016662 - Method and apparatus for processing biometric information using distributed computation: An approach is provided for providing biometric information processing using distributed computation. A biometric information processing infrastructure determines to receive an input including, at least in part, biometric information. The biometric information processing infrastructure selects one or more analyses for processing the input. The biometric information processing infrastructure also determines... Agent: Nokia Corporation
20120016661 - System, method and device for intelligent textual conversation system: A method of intelligent textual markup in an information exchange includes: determining semantic elements in said information exchange; determining relations between said semantic elements; representing said semantic elements as nodes in a directed graph; and representing said relations as edges connecting said nodes. A data processing system for enabling a... Agent:
20120016665 - Sound masking system and masking sound generation method: In a masking sound generation apparatus, a CPU analyzes a speech utterance speed of a received sound signal. Then, the CPU copies the received sound signal into a plurality of sound signals and performs the following processing on each of the sound signals. Namely, the CPU divides each of the... Agent: Yamaha Corporation
20120016666 - Audiovisual (av) device and control method thereof: According to one embodiment, an AV device comprises a receiving section, a processing section, a storage section and a control section. The receiving section receives a digital voice signal. The processing section applies a predetermined signal processing operation to the digital voice signal received by the receiving section. The storage... Agent:
20120016668 - Energy envelope perceptual correction for high band coding: In accordance with an embodiment, A method of encoding an audio bitstream at an encoder includes encoding an original low band signal at the encoder by using a closed loop analysis-by-synthesis approach to obtain a coded low band signal, encoding an original high band signal at the encoder by using... Agent: Futurewei Technologies, Inc.
20120016667 - Spectrum flatness control for bandwidth extension: In accordance with an embodiment, a method of decoding an encoded audio bitstream at a decoder includes receiving the audio bitstream, decoding a low band bitstream of the audio bitstream to get low band coefficients in a frequency domain, and copying a plurality of the low band coefficients to a... Agent: Futurewei Technologies, Inc.
20120016669 - Apparatus and method for voice processing and telephone apparatus: A voice processing apparatus includes a voice signal acquiring unit that acquires a voice signal converted to plural frequency bands from an input signal having a narrowed band; an expanding unit that generates based on a narrowband component of the voice signal acquired by the voice signal acquiring unit, an... Agent: Fujitsu Limited
20120016670 - Methods and apparatuses for identifying audible samples for use in a speech recognition capability of a mobile device: Techniques for provided which may be implemented using various methods and/or apparatuses in a mobile device to allow for speech recognition based, at least in part, on context information associated with at least a portion of at least one navigational region, e.g., associated with a location of the mobile device.... Agent: Qualcomm Incorporated
20120016671 - Tool and method for enhanced human machine collaboration for rapid and accurate transcriptions: A system and methods for transcribing text from audio and video files including a set of transcription hosts and an automatic speech recognition system. ASR word-lattices are dynamically selected from either a text box or word-lattice graph wherein the most probable text sequences are presented to the transcriptionist. Secure transcriptions... Agent:
20120016672 - Systems and methods for assessment of non-native speech using vowel space characteristics: Computer-implemented systems and methods are provided for assessing non-native speech proficiency. A non-native speech sample is processed to identify a plurality of vowel sound boundaries in the non-native speech sample. Portions of the non-native speech sample are analyzed within the vowel sound boundaries to extract vowel characteristics. The vowel characteristics... Agent:
20120016673 - Speaker recognition via voice sample based on multiple nearest neighbor classifiers: A speaker recognition system generates a codebook store with codebooks representing voice samples of speaker, referred to as trainers. The speaker recognition system may use multiple classifiers and generate a codebook store for each classifier. Each classifier uses a different set of features of a voice sample as its features.... Agent: Microsoft Corporation
20120016674 - Modification of speech quality in conversations over voice channels: Techniques are disclosed for modifying speech quality in a conversation over a voice channel. For example, a method for modifying a speech quality associated with a spoken utterance transmittable over a voice channel comprises the following steps. The spoken utterance is obtained prior to an intended recipient of the spoken... Agent: International Business Machines Corporation
20120016675 - Broadcast system using text to speech conversion: A broadcast signal receiver comprises a text data receiver for receiving broadcast text data for display to a user in relation to a user interface; a text-to-speech (TTS) converter for converting received text data into an audio speech signal, the TTS converter being operable to detect whether a word for... Agent: Sony Europe Limited
20120016677 - Method and device for audio signal classification: The present invention discloses a method and a device for audio signal classification, and relates to the field of communications technologies, which solve a problem of high complexity of type classification of audio signals in the prior art. In the present invention, after an audio signal to be classified is... Agent: Huawei Technologies Co., Ltd.
20120016676 - System and method for writing digits in words and pronunciation of numbers, fractions, and units: Disclosed is a system and method for converting a digital number to text and for pronouncing the digital number. The system includes a filtration system for determining whether the digital number has nonnumeric symbols and for generating a filtrated number, an analyzing system for analyzing the filtrated number, a composition... Agent: King Abdulaziz City For Science And Technology
20120016678 - Intelligent automated assistant: An intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions. The system can be implemented using any of a number of different platforms, such as the web, email, smartphone,... Agent: Apple Inc.
20120016679 - Adapting masking thresholds for encoding audio data: According to one embodiment, an improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated... Agent:
20120016680 - Audio decoder and decoding method using efficient downmixing: A method, an apparatus, a computer readable storage medium configured with instructions for carrying out a method, and logic encoded in one or more computer-readable tangible medium to carry out actions. The method is to decode audio data that includes N.n channels to M.m decoded audio channels, including unpacking metadata... Agent:01/12/2012 > 23 patent applications in 16 patent subcategories. recently filed with US Patent Office
20120010870 - Electronic dictionary and dictionary writing system: Described herein is a computer implemented method for creating content for electronic dictionaries. An exemplary system includes a user interface, entry filtration system, and interface tools for dictionary entry comparison, entry merge, and visual markup of changes. Many dictionaries may be accessed and used in one user interface window. A... Agent:
20120010869 - Visualizing automatic speech recognition and machine: An automated speech processing method, system and computer program product are disclosed. In one embodiment, a speech-to-text (STT) engine is used for converting an audio input to text data in a source language, and a machine translation (MT) engine is used for translating this text data to text data in... Agent: International Business Machines Corporation
20120010871 - Information processing apparatus, method of controlling the same, and program: An information processing apparatus includes a display apparatus that is provided with a button that can be used by users of different native language-types with names that are registered as character strings in language-types displayable on the display apparatus. Accordingly, when a user switches language-types to be displayed, the button... Agent: Canon Kabushiki Kaisha
20120010875 - Classifying text via topical analysis, for applications to speech recognition: An assignment device (1) assigns word class information (WKI) to one or more words of text information (ETI). Based on word-class sequence information (WK-AI) formed from this assigned word class information (WKI), actions (A) are executed in order to notify the user of conflicts or to provide the user with... Agent: Nuance Communications Austria Gmbh
20120010874 - Method and system for providing a representative phrase based on keyword searches: Provided is a method and system for providing a representative phrase with respect to a real time popular keyword, which may determine programs including a popular keyword from broadcast information, and may generate a representative phrase with respect to the popular keyword using the determined programs, thereby providing the representative... Agent: Nhn Corporation
20120010872 - Method and system for semantic searching: In one embodiment, there is provided a computer-implemented method and system for implementing the method. The method comprises: preliminarily analyzing at least one corpus of natural language text comprising for each sentence of each natural language text of the corpus, performing syntactic analysis using linguistic descriptions to generate at least... Agent: Abbyy Software Ltd
20120010873 - Sentence translation apparatus and method: Disclosed herein are a sentence translation apparatus and method. The sentence translation apparatus includes a voice recognition unit, a morphemic part-of-speech tagging unit, a pause extraction unit, and a sentence separation unit. The voice recognition unit creates a sentence in a first language based on results of recognition of a... Agent: Electronics And Telecommunications Research Institute
20120010876 - Voice integration platform: A voice integration platform and method provide for integration of a voice interface with a data system that includes stored data. The voice integration platform comprises one or more generic software components, the generic software components being configured to enable development of a specific voice user interface that is designed... Agent: Ben Franklin Patent Holding LLC
20120010877 - System and method for performing speech synthesis with a cache of phoneme sequences: Disclosed are systems, methods, and computer readable media for performing speech synthesis. The method embodiment comprises applying a first part of a speech synthesizer to a text corpus to obtain a plurality of phoneme sequences, the first part of the speech synthesizer only identifying possible phoneme sequences, for each of... Agent: At&t Intellectual Property Ii, L.p.
20120010878 - Communication apparatus: Provided is a communication apparatus for direct communication between networks of different types. The communication apparatus includes a transmission data selector determining whether or not data input from a first communication network is speech data, a data processor digitizing and packetizing the data transferred from the transmission data selector, and... Agent: Electronics And Telecommunications Research Institute
20120010879 - Speech encoding/decoding device: A linear prediction coefficient of a signal represented in a frequency domain is obtained by performing linear prediction analysis in a frequency direction by using a covariance method or an autocorrelation method. After the filter strength of the obtained linear prediction coefficient is adjusted, filtering may be performed in the... Agent: Ntt Docomo, Inc.
20120010880 - Apparatus, method and computer program for generating a representation of a bandwidth-extended signal on the basis of an input signal representation using a combination of a harmonic bandwidth-extension and a non-harmonic bandwidth-extension: An apparatus for generating a representation of a bandwidth-extended signal on the basis of an input signal representation includes a phase vocoder configured to obtain values of a spectral domain representation of a first patch of the bandwidth-extended signal on the basis of the input signal representation. The apparatus also... Agent:
20120010882 - Constrained and controlled decoding after packet loss: A technique is described herein for reducing audible artifacts in an audio output signal generated by decoding a received frame in a series of frames representing an encoded audio signal in a predictive coding system. In accordance with the technique, it is determined if the received frame is one of... Agent: Broadcom Corporation
20120010881 - Monaural noise suppression based on computational auditory scene analysis: The present technology provides a robust noise suppression system which may concurrently reduce noise and echo components in an acoustic signal while limiting the level of speech distortion. An acoustic signal may be received and transformed to cochlear domain sub-band signals. Features such as pitch may be identified and tracked... Agent:
20120010883 - Transcription data extraction: A computer program product, for performing data determination from medical record transcriptions, resides on a computer-readable medium and includes computer-readable instructions for causing a computer to obtain a medical transcription of a dictation, the dictation being from medical personnel and concerning a patient, analyze the transcription for an indicating phrase... Agent: Escription, Inc.
20120010884 - Systems and methods for manipulating electronic content based on speech recognition: Systems and methods are disclosed for displaying electronic multimedia content to a user. One computer-implemented method for manipulating electronic multimedia content includes generating, using a processor, a speech model and at least one speaker model of an individual speaker. The method further includes receiving electronic media content over a network;... Agent: Aol, Inc.
20120010885 - System and method for unsupervised and active learning for automatic speech recognition: A system and method is provided for combining active and unsupervised learning for automatic speech recognition. This process enables a reduction in the amount of human supervision required for training acoustic and language models and an increase in the performance given the transcribed and un-transcribed data.... Agent: At&t Intellectual Property Ii, L.p.
20120010886 - Language identification: A language identification system suitable for use with voice data transmitted through either a telephonic or computer network systems is presented. Embodiments that automatically select the language to be used based upon the content of the audio data stream are presented. In one embodiment the content of the data stream... Agent:
20120010887 - Speech recognition and voice training data storage and access methods and apparatus: Embodiments include a speech recognition system and a personal speech profile data (PSPD) storage device that is physically distinct from the speech recognition system. In the speech recognition system, a PSPD interface receives voice training data, which is associated with an individual, from the PSPD storage device. A speech input... Agent: Honeywell International Inc.
20120010888 - Method and system for speech synthesis and advertising service: Methods and systems for providing a network-accessible text-to-speech synthesis service are provided. The service accepts content as input. After extracting textual content from the input content, the service transforms the content into a format suitable for high-quality speech synthesis. Additionally, the service produces audible advertisements, which are combined with the... Agent:
20120010889 - Voice interaction method of mobile terminal based on voicexml and mobile terminal: The present invention discloses a voice interaction method of a mobile terminal based on VoiceXML and a mobile terminal, which comprises: converting received voice information into a VoiceXML document, parsing the VoiceXML document according to a preset VoiceXML document framework, searching the information of the function which needs to be... Agent:
20120010890 - Power-optimized wireless communications device: The present invention is an Always On, Hands-free, Speech Activated, Power-optimized Wireless Communications Device with associated base. The unique value of the device is that a person can use the device at any time, 24×7, with hands-free operation. People can wear it 24×7 on their body either around their neck... Agent:
20120010891 - Apparatus and method for encoding/decoding multichannel signal: An apparatus and method for encoding/decoding a multi-channel signal may be provided. The apparatus of encoding a multi-channel signal may insert information about whether to encode a phase parameter indicating phase information of a plurality of channels, included in the multi-channel signal, in a bitstream of the multi-channel signal. The... Agent: Samsung Electronics Co., Ltd.01/05/2012 > 24 patent applications in 12 patent subcategories. recently filed with US Patent Office
20120004898 - Contextual input method: A input method selects a character from a plurality of characters of a logographic script, and identifies characters proximate the selected character. One or more candidate characters are then selected based on a composition input and the proximate characters.... Agent: Google Inc.
20120004899 - Dynamic ad selection for ad delivery systems: Systems and systems are disclosed for a portable device that employs voice recognition and/or encoding/decoding techniques which may be employed to gather, analyze and identify the media's content class, language being spoken, topic of conversation and/or other information which may be useful in selecting targeted advertisements. The portable device uses... Agent:
20120004900 - Method for automatically setting language types of push-based services, client, and server thereof: A method for automatically setting language types of push-based services is applied to a client, and includes the steps of: receiving a beacon signal which indicating a push-based service; reading a language setting of the client; generating a language code according to the language setting; transmitting the language code; and... Agent:
20120004902 - Computerized selection for healthcare services: A method for producing healthcare data records from graphical inputs by computer users. Includes generating a plurality of user input categories, displaying on a graphical display icons that correspond to a first of the user input categories and receiving a first user selection of a first icon of the plurality... Agent: Zeus Data Solutions
20120004904 - Method and system for providing representative phrase: A method and system for providing a representative phrase corresponding to a real time (current time) popular keyword. The method and system may extend a representative criterion word, determined by analyzing morphemes of words in documents grouped into a cluster, and may combine the extended representative criterion word and the... Agent: Nhn Corporation
20120004901 - Phonetic keys for the japanese language: Various embodiments of phonetic keys for the Japanese language are described herein. A Kana rule set is applied to Kana characters provided by a user. The Kana characters are defined in an alphabetic language based on the sound of the Kana characters. A full phonetic key is then generated based... Agent:
20120004903 - Rule generation: A method for implementing at least one rule for an application is described. The method includes receiving an input rule. Based on the input rule, a program executable code is generated. The generated program executable code can then be associated with the application.... Agent: Tata Consultancy Services Limited
20120004905 - Techniques for creating computer generated notes: Text is extracted from and information resource such as documents, emails, relational database tables and other digitized information sources. The extracted text is processed using a decomposition function to create. Nodes are a particular data structure that stores elemental units of information. The nodes can convey meaning because they relate... Agent: Make Sence, Inc.
20120004906 - Method for separating signal paths and use for improving speech using electric larynx: In order to improve the speech quality of an electric larynx (EL) speaker, the speech signal of which is digitized by suitable means, the following steps are carried out: a) dividing a single-channel speech signal into a series of frequency channels by transferring it from a time domain into a... Agent:
20120004907 - System and method for biometric acoustic noise reduction: Embodiments of the invention provide a communication device and methods for generating enhanced audio signals. An audio signal comprising a speech signal and a noise signals is acquired at the communication device. A noise processor of the communication device detects a pitch estimation of the audio signal. Thereafter, the audio... Agent:
20120004908 - Voice recognition terminal: A voice recognition terminal executes a local voice recognition process and utilizes an external center voice recognition process. The terminal includes: a voice message synthesizing element for synthesizing at least one of a voice message to be output from a speaker according to the external center voice recognition process and... Agent: Denso Corporation
20120004909 - Speech audio processing: A speech processing engine is provided that in some embodiments, employs Kalman filtering with a particular speaker's glottal information to clean up an audio speech signal for more efficient automatic speech recognition.... Agent:
20120004911 - Method and apparatus for identifying video program material or content via nonlinear transformations: A system for identification of video content in a video signal is provided via a sound track audio signal. The audio signal is processed with filtering and non linear transformations to extract voice signals from the sound track channel. The extracted voice signals are coupled to a speech recognition system... Agent: Rovi Technologies Corporation
20120004910 - System and method for speech processing and speech to text: Systems and method for processing speech from a user is disclosed. In the system of the present invention, the user's speech is received as input audio stream. The input audio stream is converted text that corresponds to the input audio stream. The converted text is converted to an echo audio... Agent:
20120004912 - Method and system for using input signal quality in speech recognition: A method and system for using input signal quality in an automatic speech recognition system. The method includes measuring the quality of an input signal into a speech recognition system and varying a rejection threshold of the speech recognition system at runtime in dependence on the measurement of the input... Agent: Nuance Communications, Inc.
20120004914 - Audio human verification: A system generates an audio challenge that includes a first voice and one or more second voices, the first voice being audibly distinguishable, by a human, from the one or more second voices. The first voice conveys first information and the second voice conveys second information. The system provides the... Agent: Tell Me Networks C/o Microsoft Corporation
20120004913 - Method and apparatus for controlling operation of portable terminal using microphone: A method for controlling an operation of a portable terminal using a microphone includes detecting an operation mode of the portable terminal and driving an audio recognition mode according to the detected operation mode to activate the microphone, converting a signal, inputted through the microphone, into digital data and detecting... Agent: Samsung Electronics Co., Ltd.
20120004915 - Conversational speech analysis method, and conversational speech analyzer: The invention provides a conversational speech analyzer which analyzes whether utterances in a meeting are of interest or concern. Frames are calculated using sound signals obtained from a microphone and a sensor, sensor signals are cut out for each frame, and by calculating the correlation between sensor signals for each... Agent:
20120004916 - Speech signal processing device: A speech signal processing device is equipped with a power acquisition unit, a probability distribution acquisition unit, and a correspondence degree determination unit. The power acquisition unit accepts an inputted speech signal and, based on the accepted speech signal, acquires power representing the intensity of a speech sound represented by... Agent: Nec Corporation
20120004917 - Audible post-it system: An audible post-it system includes a post-it note printed with an index and an optical reading and recording device having an optical module, a switch, a storage device, an audio recording device, an audio playing device and a processor. The optical reading and recording device reads an image of the... Agent: Generalplus Technology Inc.
20120004920 - Data embedding system: A data hiding system is described for hiding data within an audio signal. The system can be used for watermarking, data communications, audience surveying etc. The system hides data in an audio signal by adding artificial echoes whose polarity varies with the data to be hidden. In one embodiment, each... Agent: Intrasonics S.a.r.l.
20120004918 - Full-band scalable audio codec: A scalable audio codec for a processing device determines first and second bit allocations for each frame of input audio. First bits are allocated for a first frequency band, and second bits are allocated for a second frequency band. The allocations are made on a frame-by-frame basis based on the... Agent: Plycom, Inc.
20120004921 - Method for retrieving audio signal stored on photograph: A method of decoding coded data provided on a photograph using a reader includes steps of irradiating an image-side of the photograph from a first end thereof to an opposing second end thereof with infra-red illumination; receiving infra-red illumination reflected from the image; processing the reflected infra-red illumination to locate... Agent: Silverbrook Research Pty Ltd.
20120004919 - Three-dimensional glasses with bluetooth audio decode: Audio associated with three-dimensional image content is enabled to be heard by a user without interfering with other users. A device network includes a display system (master) and a wearable device (slave). The wearable device includes a glasses frame, earphones, and left and right eye shuttering lenses. The wearable device... Agent: Broadcom CorporationPrevious industry: Data processing: structural design, modeling, simulation, and emulation
Next industry: Data processing: financial, business practice, management, or cost/price determination
RSS FEED for 20130516:
Integrate FreshPatents.com into your RSS reader/aggregator or website to track weekly updates.
For more info, read this article.
Thank you for viewing Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents on the FreshPatents.com website. These are patent applications which have been filed in the United States. There are a variety ways to browse Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patent applications on our website including browsing by date, agent, inventor, and industry. If you are interested in receiving occasional emails regarding Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents we recommend signing up for free keyword monitoring by email.
FreshPatents.com Support - Terms & Conditions
Results in 0.687 seconds