|Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents - Monitor Patents|
USPTO Class 704 | Browse by Industry: Previous - Next | All
Recent | 13: Jun | May | Apr | Mar | Feb | Jan | 12: Dec | Nov | Oct | Sep | Aug | July | June | May | April | Mar | Feb | Jan | 11: Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan | 10: Dec | Nov | Oct | Sep | Aug | Jul | Jun | May | Apr | Mar | Feb | Jan | | 09: Dec | Nov | Oct | Sep | Aug | Jl | Jn | May | Apr | Mar | Fb | Jn | | 2008 | 2007 |
Data processing: speech signal processing, linguistics, language translation, and audio compression/decompressionBelow are recently published patent applications awaiting approval from the USPTO. Recent week's RSS XML file available below.
Listing for abstract view: USPTO application #, Title, Abstract excerpt,Patent Agent. Listing format for list view: USPTO National Class full category number, title of the patent application. 06/13/2013 > 34 patent applications in 19 patent subcategories.
20130151231 - Multi-lingual knowledge base: Mechanisms and methods for enabling customers to manage multi-lingual knowledge bases, so that end users can access articles based on a language the end user chooses, while also providing publishers with tools to manage articles in different languages and to translate them, either using an external vendor or leveraging in... Agent: Salesforce.com Inc.
20130151232 - System and method for enriching spoken language translation with dialog acts: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable media for enriching spoken language translation with dialog acts. The method includes receiving a source speech signal, tagging dialog acts associated with the received source speech signal using a classification model, dialog acts being domain independent descriptions of an intended action... Agent: At&t Intellectual Property I, L.p.
20130151230 - Techniques for assisting a human translator in translating a document including at least one tag: A computer-implemented method technique includes receiving, at a server, a document including at least one tag. The technique replaces each tag of the document with a placeholder to obtain a modified document. The technique obtains a machine translation of the modified document to obtain a first translated document. The technique... Agent: Google Inc.
20130151233 - Automatic language sensitive, event based activity feeds: A post is generated that identifies different types of activity in a computer system, such as changes to the data in the computer system. The post is generated in a language-neutral way. An activity feed generator generates a language-specific post and distributes it, in an activity feed, to a set... Agent: Microsoft Corporation
20130151234 - Techniques for input of a multi-character compound consonant or vowel and transliteration to another language using a touch computing device: A technique is presented for fast input of multi-character compound consonants and vowels on a touch computing device. The technique provides for fast input of multi-character compound consonants and vowels by enabling a user to touch an initial character on a first layout of characters, then slide his/her finger in... Agent: Google Inc.
20130151236 - Computer implemented semantic search methodology, system and computer program product for determining information density in text: A method, computer program product and system are disclosed for determining the semantic density of textualized digital media is (a measure of how much information is conveyed in a sentence or clause relative to its length). The more semantically dense text is, the more information it conveys in a given... Agent:
20130151237 - Dynamic method for emoticon translation: A vehicle communication system is provided and may include at least one communication device that audibly communicates information within the vehicle. A controller may receive a character string from an external device and may determine if the character string represents an emoticon. The controller may translate the character string into... Agent: Chrysler Group LLC
20130151238 - Generation of natural language processing model for an information domain: Embodiments relate to a method, apparatus and program product and for generating a natural language processing model for an information domain. The method derives a skeleton of a natural language lexicon from a source model and uses it to form a dictionary. It also applies a set of syntactical rules... Agent: International Business Machines Corporation
20130151240 - Interactive fact checking system: A fact checking system is able to verify the correctness of information and/or characterize information by comparing the information with one or more sources. The fact checking system automatically monitors, processes, fact checks information and indicates a status of the information. The fact checking system is able to be interactive... Agent:
20130151235 - Linguistic key normalization: Systems, methods, and apparatuses including computer program products are provided for training machine learning systems. In some implementations, a method is provided. The method includes receiving a collection of phrases, normalizing a plurality of phrases of the collection of phrases, the normalizing being based at least in part on lexicographic... Agent: Google Inc.
20130151239 - Orthographical variant detection apparatus and orthographical variant detection program: Provided is an orthographical variant detection apparatus which detects orthographical variant candidates with a high precision. The orthographical variant detection apparatus includes a term extraction unit that extracts terms from document data, a similarity computation unit that computes similarity of an arbitrary pair of the extracted terms, an orthographical variant... Agent: Kabushiki Kaisha Toshiba
20130151241 - Method of embedding digital information into audio signal machine-readable storage medium and communication terminal: A method for embedding digital information into an audio signal, is provided. The method includes dividing the digital information into low-priority data and high-priority data; dividing the audio signal into first and second signal parts; embedding at least one echo signal into the first signal part; embedding a communication signal... Agent: Samsung Electronics Co., Ltd.
20130151242 - Method to select active channels in audio mixing for multi-party teleconferencing: An apparatus comprising an ingress port configured to receive a signal comprising a plurality of encoded audio signals corresponding to a plurality of sources; and a processor coupled to the ingress port and configured to calculate a parameter for each of the plurality of encoded audio signals, wherein each parameter... Agent: Futurewei Technologies, Inc.
20130151243 - Voice modulation apparatus and voice modulation method using the same: A voice modulation apparatus is provided. The voice modulation apparatus includes an audio signal input unit which receives an audio signal from an external source; an extraction unit which extracts property information relating to a voice from the audio signal; a storage unit which stores the extracted property information; a... Agent: Samsung Electronics Co., Ltd.
20130151244 - Harmonicity-based single-channel speech quality estimation: Speech quality estimation technique embodiments are described which generally involve estimating the human speech quality of an audio frame in a single-channel audio signal. A representation of a harmonic component of the frame is synthesized and used to compute a non-harmonic component of the frame. The synthesized harmonic component representation... Agent: Microsoft Corporation
20130151246 - Adaptive voice activity detection: Encoding audio signals with selecting an encoding mode for encoding the signal categorizing the signal into active segments having voice activity and non-active segments having substantially no voice activity by using categorization parameters depending on the selected encoding mode and encoding at least the active segments using the selected encoding... Agent: Core Wireless Licensing S.a.r.i.
20130151247 - Method and device for suppressing residual echoes: The present invention discloses a method and a device for suppressing residual echoes. The method comprises: performing adaptive filtering on M transmitter signals respectively to obtain M adaptive filtered signals; performing array-filtering on the M−1 adaptive filtered signals other than the first adaptive filtered signal to obtain M−1 array-filter output... Agent: Goertek Inc.
20130151248 - Apparatus, system, and method for distinguishing voice in a communication stream: An apparatus for distinguishing a voice is described. In one embodiment, the apparatus includes a server with a communication interface, a frame generator, and a sound analyzer. The communication interface processes an incoming communication stream with an echo canceller to cancel echo in the communication stream. The frame generator operates... Agent:
20130151249 - Information presentation device, information presentation method, information presentation program, and information transmission system: An information presentation device includes an audio signal input unit configured to input an audio signal, an image signal input unit configured to input an image signal, an image display unit configured to display an image indicated by the image signal, a sound source localization unit configured to estimate direction... Agent: Honda Motor Co., Ltd.
20130151251 - Automatic dialog replacement by real-time analytic processing: An automated method and apparatus for automatic dialog replacement having an optional I/O interface converts an A/V stream into a format suitable for automated processing. The I/O interface feeds the A/V stream to a dubbing engine for generating new dubbed dialog from said A/V stream. A dubber/slicer replaces the original... Agent: Advanced Micro Devices, Inc.
20130151250 - Hybrid speech recognition: Described is a technology by which speech is locally and remotely recognized in a hybrid way. Speech is input and recognized locally, with remote recognition invoked if locally recognized speech data was not confidently recognized. The part of the speech that was not confidently recognized is sent to the remote... Agent: Lenovo (singapore) Pte. Ltd
20130151252 - System and method for standardized speech recognition: Disclosed herein are systems, methods, and computer-readable storage media for selecting a speech recognition model in a standardized speech recognition infrastructure. The system receives speech from a user, and if a user-specific supervised speech model associated with the user is available, retrieves the supervised speech model. If the user-specific supervised... Agent: At&t Intellectual Property I, L.p.
20130151253 - System and method for targeted tuning of a speech recognition system: A system and method of targeted tuning of a speech recognition system are disclosed. A particular method includes detecting that a frequency of occurrence of a particular type of utterance satisfies a threshold. The method further includes tuning a speech recognition system with respect to the particular type of utterance.... Agent: At&t Intellectual Property I, L.p. (formerly Known As Sbc Knowledge Ventures, L.p.)
20130151254 - Speech recognition using speech characteristic probabilities: A speech recognition module includes an acoustic front-end module, a sound detection module, and a word detection module. The acoustic front-end module generates a plurality of representations of frames from a digital audio signal and generates speech characteristic probabilities for the plurality of frames. The sound detection module determines a... Agent: Broadcom Corporation
20130151255 - Method and device for extending bandwidth of speech signal: A method for extending a bandwidth of a speech signal received, according to an embodiment of the present invention, includes: transforming the received speech signal into a frequency domain by decoding the received speech signal; normalizing the transformed speech signal; differentiating a voiced sound period or unvoiced sound period from... Agent: Gwangju Institute Of Science And Technology
20130151256 - System and method for singing synthesis capable of reflecting timbre changes: Herein provided is a system for singing synthesis capable of reflecting not only pitch and dynamics changes but also timbre changes of a user's singing. A spectral transform surface generating section 119 temporally concatenates all the spectral transform curves estimated by a second spectral transform curve estimating section 117 to... Agent: National Institute Of Advanced Industrial Science And Technology
20130151257 - Apparatus and method for providing emotional context to textual electronic communication: An apparatus and method for including emotional context in textual electronic communication transmissions. The emotional context is conveyed symbolically through standardized alternations in the manner in which the text is displayed without the inclusion of additional graphics, thereby increasing the communicative value of textual electronic communication. An important advantage of... Agent:
20130151258 - Context based online advertising: A software and/or hardware facility for inferring user context and delivering advertisements, such as coupons, using natural language and/or sentiment analysis is disclosed. The facility may infer context information based on a user's emotional state, attitude, needs, or intent from the user's interaction with or through a mobile device. The... Agent: Microsoft Corporation
20130151259 - User interface that reflects social attributes in user notifications: Arrangements described herein relate to providing an audio message. A calendar event can be detected. A voice relating to the detected calendar event can be automatically selected. A background sound relating to the detected calendar event can be automatically selected. The audio message can be generated using the automatically selected... Agent: Motorola Mobility, LLC
20130151261 - Analog signal transfer system, variable compressor, and variable expander: An analog signal transfer system includes a transmission apparatus including a variable compressor that variably compresses input signals exponentially according to the amplitudes of the input signals; and a reception apparatus including a variable expander that variably expands the compressed signals exponentially according to the amplitudes of the compressed signals.... Agent:
20130151260 - Apparatus and method for audio encoding: A method and apparatus provides for encoding an audio signal. A bit rate value is received. A set of energy thresholds based on the bit rate value is selected. The set of energy thresholds is one of a plurality of sets of energy thresholds. The energy thresholds of each set... Agent: Motorola Mobility, Inc.
20130151263 - Method and device for processing audio signals: The present invention provides a method for processing audio signals, and the method comprises the steps of: receiving input audio signals corresponding to a plurality of spectral coefficients; obtaining location information that indicates a location of a particular spectral coefficient among said spectral coefficients, on the basis of energy of... Agent: Lg Electronics Inc.
20130151262 - Resampling output signals of qmf based audio codecs: An apparatus for processing an audio signal includes a configurable first audio signal processor for processing the audio signal in accordance with different configuration settings to obtain a processed audio signal, wherein the apparatus is adapted so that different configuration settings result in different sampling rates of the processed audio... Agent: Fraunhofer-gesellschaft Zur Foerderung Der Angewandten Forschung E.v.06/06/2013 > 41 patent applications in 21 patent subcategories.
20130144592 - Automatic spelling correction for machine translation: Methods, systems, and apparatus, including computer program products, for correcting spelling in text. A text input is received for translation. One or more suspect words in the text input are identified. For each suspect word, one or more candidate words are identified. A score for the text input and scores... Agent: Google Inc.
20130144595 - Language translation based on speaker-related information: Techniques for ability enhancement are described. Some embodiments provide an ability enhancement facilitator system (“AEFS”) configured to automatically translate utterances from a first to a second language, based on speaker-related information determined from speaker utterances and/or other sources of information. In one embodiment, the AEFS receives data that represents an... Agent:
20130144596 - Localization framework for dynamic text: An apparatus and method providing a localization framework capable of localizing dynamic text is disclosed herein. The localization framework is configured to automatically identify and prioritize certain text contained within an application code base to be translated. Such text is pre-processed prior to translation to facilitate accurate and complete translation... Agent: Zynga Inc.
20130144599 - Message translations: Systems for translating text messages in an instant messaging system comprise a translation engine for translating text messages into a preferred language of a recipient of the text messages. The systems are preferably configured to send and receive the text messages and to determine whether the text messages that are... Agent: At&t Intellectual Property I, L.p.
20130144593 - Minimum error rate training with a large number of features for machine learning: Systems, methods, and apparatuses including computer program products for machine learning. A method is provided that includes determining model parameters for a plurality of feature functions for a linear machine learning model, ranking the plurality of feature functions according to a quality criterion, and selecting, using the ranking, a group... Agent:
20130144597 - Simultaneous translation of open domain lectures and speeches: Speech translation systems and methods for simultaneously translating speech between first and second speakers, wherein the first speaker speaks in a first language and the second speaker speaks in a second language that is different from the first language. The speech translation system may comprise a resegmentation unit that merge... Agent: Mobile Technologies, LLC
20130144594 - System and method for collaborative language translation: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for presenting a machine translation and alternative translations to a user, where a selection of any particular alternative translation results in the re-ranking of the remaining alternatives. The system then presents these re-ranked alternatives to the user, who can continue... Agent: At&t Intellectual Property I, L.p.
20130144598 - Translation device, translation method and recording medium: A translation device includes a text obtaining section for obtaining a text of an original document written in a first language, a translation word obtaining section for obtaining translation words of a second language for each of words or collocations included in the text obtained by the text obtaining section,... Agent: Sharp Kabushiki Kaisha
20130144600 - Adaptive pattern learning for bilingual data mining: Embodiments for the adaptive learning of translation layout patterns to mine bilingual data are disclosed. In accordance with at least one embodiment, the adaptive learning of patterns to mine bilingual data includes processing a bilingual web page into a plurality bilingual snippet pairs. The embodiment also includes determining one or... Agent: Microsoft Corporation
20130144601 - Handheld electronic device having selectable language indicator for language selection and method therefor: A method of enabling the selection of a language to be employed as a method input language by a disambiguation routine of a handheld electronic device having stored therein a plurality of method input languages and disambiguation routine number, includes detecting a selection of a language, detecting as an ambiguous... Agent: Research In Motion Limited
20130144607 - Character-based automated text summarization: Methods, devices, systems and tools are presented that allow the summarization of text, audio, and audiovisual presentations, such as movies, into less lengthy forms. High-content media files are shortened in a manner that preserves important details, by splitting the files into segments, rating the segments, and reassembling preferred segments into... Agent:
20130144603 - Enhanced voice conferencing with history: Techniques for ability enhancement are described. Some embodiments provide an ability enhancement facilitator system (“AEFS”) configured to enhance voice conferencing among multiple speakers. Some embodiments of the AEFS enhance voice conferencing by recording and presenting voice conference history information based on speaker-related information. The AEFS receives data that represents utterances... Agent:
20130144608 - Incorporation of variables into textual content: Embodiments of the invention provide techniques for incorporating variable values into textual content. In one embodiment, an abstract phrase including a text phrase and a variable at a particular position in the text phrase is received. The abstract phrase may include multiple variables. A text value for the variable is... Agent: Facebook, Inc.
20130144602 - Quantitative type data analyzing device and method for quantitatively analyzing data: A method for quantitatively analyzing data is applied to a computer system for determining whether a document under test is sensitive. The method obtains sample message from the computer system, partitions content of the sample message to derive at least one original paragraph. The method then partitions the original paragraph... Agent: Institute For Information Industry
20130144606 - System and method for using data and derived features to automatically generate a narrative story: A system and method for automatically generating a narrative story receives data and information pertaining to a domain event. The received data and information and/or one or more derived features are then used to identify a plurality of angles for the narrative story. The plurality of angles is then filtered,... Agent: Northwestern University
20130144604 - Systems and methods for extracting attributes from text content: Systems and method for extracting attributes from text content are described. Example embodiments may include a computer implemented method for extracting attributes from text data, wherein the text data is obtained from at least one information source. As described, the implementation may include receiving, from a user, an address for... Agent: Infosys Limited
20130144605 - Text mining analysis and output system: A natural language authoring system that organizes technical, financial, legal and market information into Point of View specific analytical, visual and narrative decision-support content. The expert system transforms a user's point of view into a tailored narrative and/or visualization report. Expert rules embed interactive advertising, such as affiliate URL links,... Agent: Mehrman Law Office, PC
20130144610 - Action generation based on voice data: An automated technique is disclosed for processing audio data and generating one or more actions in response thereto. In particular embodiments, the audio data can be obtained during a phone conversation and post-call actions can be provided to the user with contextually relevant entry points for completion by an associated... Agent: Microsoft Corporation
20130144611 - Coding device, decoding device, coding method, and decoding method: A coding device includes: a pitch contour detection unit which detects a pitch contour of an input audio signal; a dynamic time warping unit which determines the number of pitch nodes based on the pitch contour and generates a first time warping parameter including information indicating the determined number of... Agent:
20130144613 - Half-rate vocoder: Encoding a sequence of digital speech samples into a bit stream includes dividing the digital speech samples into one or more frames, computing model parameters for a frame, and quantizing the model parameters to produce pitch bits conveying pitch information, voicing bits conveying voicing information, and gain bits conveying signal... Agent: Digital Voice Systems, Inc.
20130144612 - Pitch period segmentation of speech signals: A method for automatic segmentation of pitch periods of speech waveforms takes a speech waveform, a corresponding fundamental frequency contour of the speech waveform, that can be computed by some standard fundamental frequency detection algorithm, and optionally the voicing information of the speech waveform, that can be computed by some... Agent: Synvo Gmbh
20130144614 - Bandwidth extender: An apparatus for extending the bandwidth of an audio signal, the apparatus being configured to: generate an excitation signal from an audio signal, wherein in the audio signal comprises a plurality of frequency components; extract a feature vector from the audio signal, wherein the feature vector comprises at least one... Agent: Nokia Corporation
20130144615 - Method and apparatus for processing an audio signal based on an estimated loudness: An apparatus comprising at least one processor and at least one memory including computer program code. The at least one memory and the computer program code is configured to, with the at least one processor, cause the apparatus at least to determine a loudness estimate of a first audio signal,... Agent: Nokia Corporation
20130144617 - Background noise cancelling device and method: A background noise cancelling device for removing a background noise from an input signal in which the background noise is mixed in a voice signal to produce an output signal includes: storage a unit for preliminarily storing a predictable background noise, which is the background noise, as a stored background... Agent: Nec Corporation
20130144616 - System and method for machine-mediated human-human conversation: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for processing speech. A system configured to practice the method monitors user utterances to generate a conversation context. Then the system receives a current user utterance independent of non-natural language input intended to trigger speech processing. The system compares the... Agent: At&t Intellectual Property I, L.p.
20130144618 - Methods and electronic devices for speech recognition: A disclosed embodiment provides a speech recognition method to be performed by an electronic device. The method includes: collecting user-specific information that is specific to a user through the user's usage of the electronic device; recording an utterance made by the user; letting a remote server generate a remote speech... Agent:
20130144619 - Enhanced voice conferencing: Techniques for ability enhancement are described. Some embodiments provide an ability enhancement facilitator system (“AEFS”) configured to enhance voice conferencing among multiple speakers. In one embodiment, the AEFS receives data that represents utterances of multiple speakers who are engaging in a voice conference with one another. The AEFS then determines... Agent:
20130144620 - Method, system and program for verifying the authenticity of a website using a reliable telecommunication channel and pre-login message: Various embodiments of the present invention for validating the authenticity of a website are provided. An example of a method according to the present invention comprises providing a website having an artifact, receiving a communication from a user, at a service provider, for validating the website associated with a service... Agent: Telcordia Technologies, Inc.
20130144621 - Systems and methods for assessment of non-native spontaneous speech: Computer-implemented systems and methods are provided for assessing non-native spontaneous speech pronunciation. Speech recognition on digitized speech is performed using a non-native acoustic model trained with non-native speech to generate word hypotheses for the digitized speech. Time alignment is performed between the digitized speech and the word hypotheses using a... Agent: Educational Testing Service
20130144622 - Speech processing device and speech processing method: A speech processing device which can accurately extract a conversation group from among a plurality of speakers, even when a conversation group formed of three or more people is present. This device (400) comprises: a spontaneous speech detection unit (420) and a direction-specific speech detection unit (430) which separately detect,... Agent:
20130144623 - Visual presentation of speaker-related information: Techniques for ability enhancement are described. Some embodiments provide an ability enhancement facilitator system (“AEFS”) configured to determine and present speaker-related information based on speaker utterances. In one embodiment, the AEFS receives data that represents an utterance of a speaker received by a hearing device of the user, such as... Agent:
20130144624 - System and method for low-latency web-based text-to-speech without plugins: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for reducing latency in web-browsing TTS systems without the use of a plug-in or Flash® module. A system configured according to the disclosed methods allows the browser to send prosodically meaningful sections of text to a web server. A TTS... Agent: At&t Intellectual Property I, L.p.
20130144625 - Systems and methods document narration: Disclosed are techniques and systems to provide a narration of a text in multiple different voices. In some aspects, systems and methods described herein can include receiving a user-based selection of a first portion of words in a document where the document has a pre-associated first voice model and overwriting... Agent: K-nfb Reading Technology, Inc.
20130144626 - Rap music generation: The preferred embodiments of this invention convert common human speeches into rap music. Computer programs change the timing intervals, amplitudes, and/or frequencies of the sound signals of a common speech to follow rap music beats. The resulting rap music also can overlap with background music and/or video images to achieve... Agent:
20130144627 - Voice control circuit for starting electronic devices: A control circuit employed in an electronic device includes a microphone, a level conversion circuit, and a voice processing circuit. The voice processing circuit includes a voice operated switch connected between the microphone and the level conversion circuit. The microphone picks up voice commands, the voice operated switch receives the... Agent: Hong Fu Jin Precision Industry (shenzhen) Co., Ltd.
20130144628 - Voice interface to nfc applications: Technologies for transferring Near Field Communications information on a computing device include storing information corresponding to services in a database on the computing device, receiving a voice input corresponding to a name of a requested service, and retrieving the information corresponding to the requested service from the database. Such technologies... Agent:
20130144629 - System and method for continuous multimodal speech and gesture interaction: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for processing multimodal input. A system configured to practice the method continuously monitors an audio stream associated with a gesture input stream, and detects a speech event in the audio stream. Then the system identifies a temporal window associated with... Agent: At&t Intellectual Property I, L.p.
20130144631 - Audio signal processing apparatus and audio signal processing method: An audio signal processing apparatus that processes a bit stream generated by coding an audio signal on a frame-by-frame basis, the bit stream including, for each frame, coded data representing the audio signal, additional data and attribute information, the audio signal processing apparatus including a decoding unit configured to decode... Agent: Panasonic Corporation
20130144630 - Multi-channel audio encoding and decoding: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying... Agent: Microsoft Corporation
20130144632 - Frame error concealment method and apparatus, and audio decoding method and apparatus: A frame error concealment method is provided that includes predicting a parameter by performing a regression analysis on a group basis for a plurality of groups formed from a first plurality of bands forming an error frame and concealing an error in the error frame by using the parameter predicted... Agent: Samsung Electronics Co., Ltd.05/30/2013 > 26 patent applications in 17 patent subcategories.
20130138421 - Automatic human language translation: A computer device for performing automatic human language translation includes memory storing program code of an application program and program code of a translation utility. A processor executes program code of the application program configured to display localized text in a first language on a display screen of the computer... Agent: Micromass Uk Limited
20130138422 - Multilingual speech recognition and public announcement: Embodiments of the present invention provide a system, method, and program product to deliver an announcement to people, such as a public announcement. A computer receives input representative of audio from one or more people speaking in one or more natural languages. The computer processes the input to identify the... Agent: International Business Machines Corporation
20130138426 - Automated content generation: Described are computer-based methods and apparatuses, including computer program products, for automated content generation. In some examples, the method includes generating content metadata from document content via natural language processing based on one or more context parameters associated with the document content. The method can further include receiving user feedback... Agent: Raytheon Company
20130138424 - Context-aware interaction system using a semantic model: The subject disclosure is directed towards detecting symbolic activity within a given environment using a context-dependent grammar. In response to receiving sets of input data corresponding to one or more input modalities, a context-aware interactive system processes a model associated with interpreting the symbolic activity using context data for the... Agent: Microsoft Corporation
20130138423 - Contextual search for modeling notations: A method, an apparatus, and a computer program product for contextual-based search of modeling notations to be used in a model. The method comprises obtaining a contextual property of a notation to be used in a diagram, wherein the contextual property defines a context of a usage of the notation... Agent: International Business Machines Corporation
20130138427 - Fraud detection using text analysis: In one embodiment, a method executed by at least one processor includes receiving text from submitted by a user. The method also includes determining a text score for the received text by comparing a first set of phrases included in the received text to a second set of phrases. The... Agent: Match.com, Lp
20130138429 - Method and apparatus for information searching: Techniques for performing searches using synonym pairs generated from data mining are described herein. These techniques may include receiving, by a server, a query including a keyword. The server may generate multiple synonym pairs associated with the keyword by mining multiple item descriptions under a certain context, and then calculate... Agent: Alibaba Group Holding Limited
20130138430 - Methods and apparatus to classify text communications: Methods and apparatus to classify text communications are disclosed. An example method includes determining a first score indicating a likelihood that a text belongs to a first classification mode by combining a first sentence score and a second sentence score retrieved from an index, the first sentence score indicating a... Agent:
20130138425 - Multiple rule development support for text analytics: Methods, computer program products and systems are provided for applying text analytics rules to a corpus of documents. The embodiments facilitate selection of a document from the corpus within a graphical user interface (GUI), where the GUI opens the selected document to display text of the selected document and also... Agent: International Business Machines Corporation
20130138428 - Systems and methods for automatically detecting deception in human communications expressed in digital form: An apparatus and method for determining whether text is deceptive has a computer programmed with software that automatically analyzes text in digital form by at least one of statistical analysis of psycho-linguistic cues, IP geo-location, gender analysis, authorship analysis, and analysis to detect coded/camouflaged messages. The computer has truth data... Agent: The Trustees Of The Stevens Institute Of Technology
20130138431 - Speech signal transmission and reception apparatuses and speech signal transmission and reception methods: A speech signal transmission apparatus includes an extractor to extract speech signals from speech source signals collected by a plurality of microphones, a power calculator to calculate powers of speech signals of multiple channels and set any one of the speech signals of the multiple channels as a reference speech... Agent: Samsung Electronics Co., Ltd.
20130138432 - Speech encoding/decoding device: A linear prediction coefficient of a signal represented in a frequency domain is obtained by performing linear prediction analysis in a frequency direction by using a covariance method or an autocorrelation method. After the filter strength of the obtained linear prediction coefficient is adjusted, filtering may be performed in the... Agent: Ntt Docomo, Inc.
20130138433 - Switching off dtx for music: The invention relates to a method for disabling a discontinuous transmission node DTX of a speech encoder if a music signal is detected in a call input signal. The music signal is detected by determining an activity factor corresponding to the relation of sound signal periods relative to scheme signal... Agent: Telefonaktiebolaget L M Ericsson (publ)
20130138434 - Noise suppression device: A noise suppression device includes: a power spectrum calculator converting an input signal of time domain into power spectra of frequency domain; a voice/noise determination unit determining whether the power spectra indicate voice or noise; a noise spectrum estimation unit estimating noise spectra of the power spectra; a period component... Agent: Mitsubishi Electric Corporation
20130138435 - Character-based automated shot summarization: Methods, devices, systems and tools are presented that allow the summarization of text, audio, and audiovisual presentations, such as movies, into less lengthy forms. High-content media files are shortened in a manner that preserves important details, by splitting the files into segments, rating the segments, and reassembling preferred segments into... Agent:
20130138436 - Discriminative pretraining of deep neural networks: Discriminative pretraining technique embodiments are presented that pretrain the hidden layers of a Deep Neural Network (DNN). In general, a one-hidden-layer neural network is trained first using labels discriminatively with error back-propagation (BP). Then, after discarding an output layer in the previous one-hidden-layer neural network, another randomly initialized hidden layer... Agent: Microsoft Corporation
20130138437 - Speech recognition apparatus based on cepstrum feature vector and method thereof: A speech recognition apparatus, includes a reliability estimating unit configured to estimate reliability of a time-frequency segment from an input voice signal; and a reliability reflecting unit configured to reflect the reliability of the time-frequency segment to a normalized cepstrum feature vector extracted from the input speech signal and a... Agent: Electronics And Telecommunications Research Institute
20130138438 - Systems and methods for capturing, publishing, and utilizing metadata that are associated with media files: Systems for recording, searching for, and obtaining metadata that are relevant to a plurality of media files are disclosed. The systems generally include a server that is configured to receive, index, and store a plurality of media files, which are received by the server from a plurality of sources, within... Agent:
20130138439 - Interface for setting confidence thresholds for automatic speech recognition and call steering applications: An interactive user interface is described for setting confidence score thresholds in a language processing system. There is a display of a first system confidence score curve characterizing system recognition performance associated with a high confidence threshold, a first user control for adjusting the high confidence threshold and an associated... Agent: Nuance Communications, Inc.
20130138440 - Speech recognition with parallel recognition tasks: The subject matter of this specification can be embodied in, among other things, a method that includes receiving an audio signal and initiating speech recognition tasks by a plurality of speech recognition systems (SRS's). Each SRS is configured to generate a recognition result specifying possible speech included in the audio... Agent:
20130138441 - Method and system for generating search network for voice recognition: Disclosed is a method of generating a search network for voice recognition, the method including: generating a pronunciation transduction weighted finite state transducer by implementing a pronunciation transduction rule representing a phenomenon of pronunciation transduction between recognition units as a weighted finite state transducer; and composing the pronunciation transduction weighted... Agent: Electronics And Telecommunications Research Institute
20130138442 - Systems and methods for recognizing sound and music signals in high noise and distortion: A method for recognizing an audio sample locates an audio file that closely matches the audio sample from a database indexing a large set of original recordings. Each indexed audio file is represented in the database index by a set of landmark timepoints and associated fingerprints. Landmarks occur at reproducible... Agent: Shazam Investments Limited
20130138443 - Voice-screen ars service system, method for providing same, and computer-readable recording medium: A method for providing a voice-screen ARS service on a terminal, according to an embodiment of the present invention, uses an application installed on the terminal to connect to an IVR system of a client company via a voice call and connects a data call to a VARS service server.... Agent: Call Gate Co., Ltd.
20130138444 - Modification of operational data of an interaction and/or instruction determination process: It is inter alia disclosed to perform at least one of operating an interaction process with a user of the medical apparatus and determining, based on a representation of at least one instruction given by the user, at least one instruction operable by the medical apparatus. Therein, the at least... Agent: Sanofi-aventis Deutschland Gmbh
20130138445 - Apparatus and method for determining bit rate for audio content: An apparatus and a method for determining a bit rate of audio content, and more particularly, an audio content bit rate determining apparatus and a method capable of quickly and correctly identifying audio content compressed at a constant bit rate from among audio content compressed at a variable bit rate... Agent: Samsung Electronics Co. Ltd.
20130138446 - Audio decoder, audio object encoder, method for decoding a multi-audio-object signal, multi-audio-object encoding method, and non-transitory computer-readable medium therefor: An audio decoder for decoding a multi-audio-object signal having an audio signal of a first type and an audio signal of a second type encoded therein is described, the multi-audio-object signal having a downmix signal and side information, the side information having level information of the audio signals of the... Agent: Fraunhofer-gesellschaft Zur Foerderung Der Angewandten Forschung E.v.05/23/2013 > 37 patent applications in 22 patent subcategories.
20130132064 - Apparatus and method for decoding using joint tokenization and translation: Disclosed are a joint decoding apparatus and a joint decoding method that joins a tokenization process and a translation process. More particularly, the present disclosure can generate all available candidate tokens, reduce translation errors, and obtain an optimal translation result by jointly conducting a decoding by simultaneously conducting the tokenization... Agent: Sk Planet Co., Ltd.
20130132065 - Acquiring accurate machine translation: A method is disclosed for translating a sentence from a source language or input language into an output language. The method includes analyzing a source sentence using linguistic descriptions of the source language, constructing a language-independent semantic structure to represent the meaning of the source sentence, and generating an output... Agent:
20130132066 - Techniques for performing translation of messages: Techniques for providing a translation environment interface to a user are disclosed herein. The techniques include receiving a message template to be translated, the message template including a text portion and one or more template placeholders, parsing the message template to identify the text portion and the template placeholders, generating... Agent: Google Inc.
20130132068 - Device, method and computer readable storage medium for displaying multiple language characters: A method to display multiple language characters is provided. The method comprises a number of steps. A multiple language character data is stored. The multiple language character data comprises a common character part and a plurality of offset parts. The common character part comprises a plurality of common characters and... Agent: Institute For Information Industry
20130132067 - Multi-lingual output device: This application discloses A multi-lingual output device for output of transactional information for a given customer, the device that includes a data base for determining what transaction information needs to be outputted, the local language in which the information is to be outputted, and the preferred language of the customer... Agent:
20130132069 - Text to speech synthesis for texts with foreign language inclusions: A speech output is generated from a text input written in a first language and containing inclusions in a second language. Words in the native language are pronounced with a native pronunciation and words in the foreign language are pronounced with a proficient foreign pronunciation. Language dependent phoneme symbols generated... Agent: Nuance Communications, Inc.
20130132070 - Computer-based construction of arbitrarily complex formal grammar expressions: A method, system and computer program product for building an expression, including utilizing any formal grammar of a context-free language, displaying an expression on a computer display via a graphical user interface, replacing at least one non-terminal display object within the displayed expression with any of at least one non-terminal... Agent: International Business Machines Corporation
20130132072 - Engine for human language comprehension of intent and command execution: The invention provides a computer system for interacting with a user. A set of concepts initially forms a target set of concepts. An input module receives a language input from the user. An analysis system executes a plurality of narrowing cycles until a concept packet having at least one concept... Agent:
20130132071 - Method and apparatus for automatically analyzing natural language to extract useful information: An automatic language-processing system uses a human-curated lexicon to associate words and word groups with broad sentiments such as fear or anger, and topics such as accounting fraud or earnings projections. Grammar processing further characterizes the sentiments or topics with logical (“is” or “is not”), conditional (probability), temporal (past, present,... Agent:
20130132073 - Systems and methods of building and using custom word lists: Standard word lists that are often used for such operations as predictive text, spell checking, and word completion are based on general linguistic data that might not accurately reflect actual text usage patterns of particular users. Systems and methods of building and using a custom word list for use in... Agent: Research In Motion Limited
20130132074 - Method and system for reproducing and distributing sound source of electronic terminal: There is provided a method of reproducing and distributing a sound source of en electronic terminal. The method includes a step of starting to simultaneously reproduce a stream of an MR (Music Recorded) sound source file and a stream of an AR (All Recorded) sound source file that a voice... Agent:
20130132075 - Methods and arrangements in a telecommunications network: The present invention relates to a postfilter and a postfilter control to be associated with a postfilter for improving perceived quality of speech reconstructed at a speech decoder. The postfilter control comprises means for measuring stationarity of a speech signal reconstructed at a decoder, means for determining a coefficient to... Agent: Telefonaktiebolaget L M Ericsson (publ)
20130132076 - Smart rejecter for keyboard click noise: According to various embodiments of the invention, a new and effective keyboard click noise reduction scheme is presented. The keyboard click noise reduction scheme may have various processing units including: Dynamic Signal Modeler, Smart Model Selector, Adaptive Filtering Module, Keyboard/Impulse Noise and Voice Activity Detectors, and a Post-Processing Unit. By... Agent: Creative Technology Ltd
20130132077 - Semi-supervised source separation using non-negative techniques: Systems and methods for semi-supervised source separation using non-negative techniques are described. In some embodiments, various techniques disclosed herein may enable the separation of signals present within a mixture, where one or more of the signals may be emitted by one or more different sources. In audio-related applications, for instance,... Agent:
20130132078 - Voice activity segmentation device, voice activity segmentation method, and voice activity segmentation program: The voice activity segmentation device comprises: a first voice activity segmentation means for determining a voice-active segment (first voice-active segment) and a voice-inactive segment (first voice-inactive segment) in a time-series of input sound by comparing a threshold value and a feature value of the time-series of the input sound; a... Agent: Nec Corporation
20130132081 - Contents providing scheme using speech information: An apparatus for providing contents based on speech information is provided. The apparatus includes a speech information reception unit configured to receive speech information from a first device, a device identification unit configured to receive device information of the first device from the first device and identify the first device... Agent: Kt Corporation
20130132079 - Interactive speech recognition: A first plurality of audio features associated with a first utterance may be obtained. A first text result associated with a first speech-to-text translation of the first utterance may be obtained based on an audio signal analysis associated with the audio features, the first text result including at least one... Agent: Microsoft Corporation
20130132080 - System and method for crowd-sourced data labeling: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for crowd-sourced data labeling. The system requests a respective response from each of a set of entities. The set of entities includes crowd workers. Next, the system incrementally receives a number of responses from the set of entities until at... Agent: At&t Intellectual Property I, L.p.
20130132082 - Systems and methods for concurrent signal recognition: Methods and systems for recognition of concurrent, superimposed, or otherwise overlapping signals are described. A Markov Selection Model is introduced that, together with probabilistic decomposition methods, enable recognition of simultaneously emitted signals from various sources. For example, a signal mixture may include overlapping speech from different persons. In some instances,... Agent:
20130132083 - Generic framework for large-margin mce training in speech recognition: A method and apparatus for training an acoustic model are disclosed. A training corpus is accessed and converted into an initial acoustic model. Scores are calculated for a correct class and competitive classes, respectively, for each token given the initial acoustic model. Also, a sample-adaptive window bandwidth is calculated for... Agent: Microsoft Corporation
20130132084 - System and method for performing dual mode speech recognition: A system and method for performing dual mode speech recognition, employing a local recognition module on a mobile device and a remote recognition engine on a server device. The system accepts a spoken query from a user, and both the local recognition module and the remote recognition engine perform speech... Agent: Soundhound, Inc.
20130132085 - Systems and methods for non-negative hidden markov modeling of signals: Methods and systems for non-negative hidden Markov modeling of signals are described. For example, techniques disclosed herein may be applied to signals emitted by one or more sources. In some embodiments, methods and systems may enable the separation of a signal's various components. As such, the systems and methods disclosed... Agent:
20130132086 - Methods and systems for adapting grammars in hybrid speech recognition engines for enhancing local sr performance: A speech recognition method includes providing a processor communicatively coupled to each of a local speech recognition engine and a server-based speech recognition engine. A first speech input is inputted into the server-based speech recognition engine. A first recognition result from the server-based speech recognition engine is received at the... Agent:
20130132087 - Audio interface: Methods, systems, and apparatus are generally described for providing an audio interface.... Agent: Empire Technology Development LLC
20130132088 - Apparatus and method for recognizing emotion based on emotional segments: An apparatus and method to recognize a user's emotion based on emotional segments are provided. An emotion recognition apparatus includes a sampling unit configured to extract sampling data from input data for emotion recognition. The emotion recognition apparatus further includes a data segment creator configured to segment the sampling data... Agent:
20130132089 - Configurable speech recognition system using multiple recognizers: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers... Agent: Nuance Communications, Inc.
20130132090 - Voice data retrieval system and program product therefor: A voice data retrieval system including an inputting device of inputting a keyword, a phoneme converting unit of converting the inputted keyword in a phoneme expression, a voice data retrieving unit of retrieving a portion of a voice data at which the keyword is spoken based on the keyword in... Agent: Hitachi, Ltd.
20130132091 - Dynamic pass phrase security system (dpss): There is disclosed an n-dimensional biometric security system as well as a method of identifying and validating a user through the use of a automated random one-time passphrase generation. The use of tailored templates to generate one-time phase phrase text as well as the use of update subscriptions of templates... Agent: Ibiometrics, Inc.
20130132092 - Method, apparatus, and program for certifying a voice profile when transmitting text messages for synthesized speech: A mechanism is provided for authenticating and using a personal voice profile. The voice profile may be issued by a trusted third party, such as a certification authority. The personal voice profile may include information for generating a digest or digital signature for text messages. A speech synthesis system may... Agent: Nuance Communications, Inc.
20130132093 - System and method for generating challenge items for captchas: Challenge items for an audible based electronic challenge system are generated using a variety of techniques to identify optimal candidates. The challenge items are intended for use in a computing system that discriminates between humans and text to speech (TTS) system.... Agent: John Nicholas And Kristin Gross Trust U/a/d April 13, 2010
20130132095 - Audio pattern matching for device activation: A system and method are disclosed for activating an electric device from a standby power mode to a full power mode. The system may include one or more microphones for monitoring audio signals in the vicinity of the electric device, and a standby power activation unit including a low-power microprocessor... Agent: Microsoft Corporation
20130132094 - System and method for voice actuated configuration of a controlling device: A speech recognition engine is provided voice data indicative of at least a brand of a target appliance. The speech recognition engine uses the voice data indicative of at least a brand of the target appliance to identify within a library of codesets at least one codeset that is cross-referenced... Agent: Universal Electronics Inc.
20130132096 - Systems and techniques for producing spoken voice prompts: Methods and systems are described in which spoken voice prompts can be produced in a manner such that they will most likely have the desired effect, for example to indicate empathy, or produce a desired follow-up action from a call recipient. The prompts can be produced with specific optimized speech... Agent: Eliza Corporation
20130132098 - Apparatus and method for coding and decoding multi-object audio signal with various channel including information bitstream conversion: Provided is an apparatus and method for coding and decoding multi-object audio signals with various channels and providing backward compatibility with a conventional spatial audio coding (SAC) bitstream. The apparatus includes: an audio object coding unit for coding audio-object signals inputted to the coding apparatus based on a spatial cue... Agent: Electronics And Telecommunications Research Institute
20130132097 - Apparatus for processing an audio signal and method thereof: An apparatus for processing an audio signal and method thereof are disclosed. The present invention includes receiving a downmix signal and side information; extracting extension type identifier indicating whether extension area includes a residual signal from the side information; when the extension type identifier indicates that the extension area includes... Agent: Lg Electronics Inc.
20130132099 - Coding device, decoding device, and methods thereof: Provided are a coding device, a decoding device, and methods thereof, with which it is possible to implement high sound quality coding and decoding in layered coding (scalable coding or embedded coding) wherein each layer comprises a plurality of bit rates (multi-rate). In the coding device (100), a feature analysis... Agent: Panasonic Corporation
20130132100 - Apparatus and method for codec signal in a communication system: The present invention relates to a codec apparatus and method for coding/decoding speech and audio signals in a communication system. In accordance with the present invention, a speech and audio signal in a time domain is transformed into a speech and audio signal in a frequency domain and calculating frequency... Agent: Electronics And Telecommunications Research InstitutePrevious industry: Data processing: structural design, modeling, simulation, and emulation
Next industry: Data processing: financial, business practice, management, or cost/price determination
RSS FEED for 20130613:
Integrate FreshPatents.com into your RSS reader/aggregator or website to track weekly updates.
For more info, read this article.
Thank you for viewing Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents on the FreshPatents.com website. These are patent applications which have been filed in the United States. There are a variety ways to browse Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patent applications on our website including browsing by date, agent, inventor, and industry. If you are interested in receiving occasional emails regarding Data processing: speech signal processing, linguistics, language translation, and audio compression/decompression patents we recommend signing up for free keyword monitoring by email.
FreshPatents.com Support - Terms & Conditions
Results in 1.19735 seconds