|
FREE patent keyword monitoring and additional FREE benefits. |
|
|
Data Processing: Speech Signal Processing, Linguistics, Language Translation, And Audio Compression/decompression > Speech Signal Processing > Synthesis > Image To Speech Image To SpeechImage To Speech patent applications listed are from June 2005 to current and include Date, Patent Application Number, Patent Title, Patent Abstract summary and are linked to the corresponding patent application page.11/09/06 - 20060253286 - Text-to-speech synthesis system The present invention is intended to provide a text-to-speech synthesis apparatus, including a storage for storing phoneme data of a plurality of speakers; a selector for selecting one of the plurality of speakers in accordance with an operation performed by a user; a searcher for searching the storage for phoneme ... 10/26/06 - 20060241945 - Control of settings using a command rotor Systems and methods are provided for adjusting parameter levels. A method can include providing a plurality of parameters, each parameter being adjustable over a dimension; enabling a set of keys to adjust the plurality of parameters; selecting from amongst the plurality of parameters a parameter for adjustment using a first ... 10/19/06 - 20060235692 - Bandwidth efficient digital voice communication system and method A bandwidth efficient digital voice communication system (10) can include a speech-to-text converter (22) for converting a voice signal to a text representation, a speech parameter extractor (28) for extracting user identifiable parameters from a voice signal, and a text-to-speech converter (44) for converting the text representation and the user ... 10/12/06 - 20060229874 - Speech synthesizer, speech synthesizing method, and computer program A speech synthesizer includes a speech storage section for storing the speech of each of a plurality of speakers, a feature information storage section for storing speaker feature information which shows a feature as to the utterance of each of the speakers specified from speech, a reading feature designation section ... 10/12/06 - 20060229873 - Methods and apparatus for adapting output speech in accordance with context of communication A technique for producing speech output in an automatic dialog system is provided. Communication is received from a user at the automatic dialog system. A context of the communication from the user is detected in a context detector of the automatic dialog system. A message is provided to the user ... 10/12/06 - 20060229872 - Methods and apparatus for conveying synthetic speech style from a text-to-speech system A technique for producing speech output in a text-to-speech system is provided. A message is created for communication to a user in a natural language generator of the text-to-speech system. The message is annotated in the natural language generator with a synthetic speech output style. The message is conveyed to ... 10/05/06 - 20060224386 - Text information display apparatus equipped with speech synthesis function, speech synthesis method of same, and speech synthesis program A text information display apparatus equipped with a speech synthesis function able to clearly display a linked portion by speech and enabling easy recognition of a change from a link, provided with a controller for referring to the display rules of text to be converted to speech when converting text ... 10/05/06 - 20060224385 - Text-to-speech conversion in electronic device field A solution for text-to-speech conversion is provided. According to the solution, it is checked whether or not a character string comprises a character combination which does not represent a word. If the character string comprises a character combination which does not represent a word, the function of the character combination ... 09/28/06 - 20060217982 - Semiconductor chip having a text-to-speech system and a communication enabled device The present invention relates to a semiconductor chip having a Text-To-Speech (TTS) system and an applying means for use in a communication enabled device. The TTS system converts input text messages into output audio messages, the output audio message having characteristics set by a voice parameter set. The semiconductor chip ... 09/28/06 - 20060217981 - Device for generating speech, apparatus connectable to or incorporating such a device, and computer program product therefor A control unit extracts at least a part of data that is displayed on a display and sends the extracted part of the displayed data to a speech generating device. The speech generating device includes a conversion circuit that converts the received data to a speech signal. The conversion circuit ... 09/14/06 - 20060206333 - Speaker-dependent dialog adaptation A simulation environment for adapting a speech model (e.g., baseline model) to a user is provided. The user can interact with a base parametric speech model (e.g., statistical model with learnable parameters such as a Bayesian network) and give positive and/or negative feedback when the dialog system has performed what ... 09/07/06 - 20060200352 - Speech synthesis method In a phoneme-selection-type speech synthesis apparatus, sound quality when a suitable phoneme is not found is prevented from being deteriorated without changing an input sentence. A plurality of pieces of reading prosody information are obtained. The cost when an optimum phoneme sequence is selected with respect to each of the ... 08/24/06 - 20060190261 - Method and device of speech recognition and language-understanding analyis and nature-language dialogue system using the same A method of speech recognition and language-understanding analysis is provided. According to a segmental word-concept-tag compound N-gram model, an input speech is divided into a plurality of segmental phrases. Each segmental phrase is attached a tag to indicate whether said segmental phrase is a meaningful segmental phrase or a meaningless ... 07/20/06 - 20060161437 - Text-to-speech synthesis system The present invention is intended to provide a text-to-speech synthesis apparatus, including a storage for storing phoneme data of a plurality of speakers; a selector for selecting one of the plurality of speakers in accordance with an operation performed by a user; a searcher for searching the storage for phoneme ... 07/13/06 - 20060155542 - Show & tell tech A system and method for delivering instructions in an easy-to-understand format designated as “Show and Tell”—Tech (ST-Tech) Format. For people who cannot read and understand medications or health instructions, ST-Tech offers useful, effective and safety features based on color code, pictures, voice and icons to help ensure accuracy in the ... 06/29/06 - 20060143012 - Voice synthesizing apparatus, voice synthesizing system, voice synthesizing method and storage medium There are provided a voice outputting apparatus, a voice outputting system, a voice outputting method and a storage medium which, when the synthetic voices of a plurality of text data are to be uttered in overlapping relationship with each other, voice-synthesize the plurality of text data with different kinds of ... 06/29/06 - 20060143011 - Information processing apparatus and information processing system An information processing apparatus comprises a first timer that times a first time; a second timer that times a second time which is different from the first time; and a switching unit that switches, in accordance with a user, between a first power saving control mode in which shift is ... 06/22/06 - 20060136213 - Speech synthesis apparatus and speech synthesis method A speech synthesis apparatus which can appropriately transform a voice characteristic of a speech is provided. The speech synthesis apparatus includes an element storing unit in which speech elements are stored, a function storing unit in which transformation functions are stored, an adaptability judging unit which derives a degree of ... 06/22/06 - 20060136212 - Method and apparatus for improving text-to-speech performance In a device (100), a method (200) is provided for improving text-to-speech performance. The method includes the steps of determining (202) if a text expression from an application operating in the device is in a vocabulary, selecting (204) a corresponding speech expression from the vocabulary if the text expression is ... 06/15/06 - 20060129403 - Method and device for speech synthesizing and dialogue system thereof A method and a device for speech synthesizing are provided. The method is used for generating a speech answer in a speech dialogue system, in which the speech dialogue system includes a speech recognizing process for recognizing a speech input inputted from a user to generate a textual answer. The ... 06/15/06 - 20060129402 - Method for reading input character data to output a voice sound in real time in a portable terminal A method for reading input character data to output a voice sound in real time in a portable terminal. A character input is monitored, and a voice sound corresponding to input character data is output whenever the character input is performed in a preset minimum reading unit. A user can ... 06/15/06 - 20060129401 - Speech segment clustering and ranking A system, method, and apparatus for identifying problematic speech segments is provided. The system includes a clustering module for generating a first cluster of one or more consecutive speech segments if the consecutive speech segments satisfy a predetermined filtering test, and for generating a second cluster comprising at least one ... 06/15/06 - 20060129400 - Method and system for converting text to lip-synchronized speech in real time A method and system for presenting lip-synchronized speech corresponding to the text received in real time is provided. A lip synchronization system provides an image of a character that is to be portrayed as speaking text received in real time. The lip synchronization system receives a sequence of text corresponding ... 06/08/06 - 20060122836 - Dynamic switching between local and remote speech rendering A multimodal browser for rendering a multimodal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodal document, and a voice browser component for rendering voice-based content, if any, of the multimodal document. The voice browser component ... 06/01/06 - 20060116879 - Context enhancement for text readers A method, system and apparatus for enhancing the audible presentation of addressing information disposed in content processed in a text reader. In an aspect of the present invention, a method for enhancing the audible presentation of addressing information disposed in content processed in a text reader can include translating the ... 05/18/06 - 20060106609 - Speech synthesis system To provide a speech synthesis apparatus which can prevent confusing its users and deteriorating the quality of synthesized speech resulting from incompleteness of the sentences to be read out, and thus can read out speech which is easily understandable to the user. The speech synthesis apparatus includes: an incomplete part-of-sentence ... 05/11/06 - 20060100878 - Synthesis-based pre-selection of suitable units for concatenative speech A system and computer-readable medium are disclosed that synthesize speech from text using a triphone unit selection database. The instructions on the computer-readable medium control a computing device to perform the steps: receiving input text, selecting a plurality of N phoneme units from the triphone unit selection database as candidate ... 05/11/06 - 20060100877 - Generating and relating text to audio segments A method, apparatus and system for generating speech minutes. The method comprises the steps of displaying status indicators of respective audio (speech) stream chunks received and text information thereof on a GUI display and establishing the tagging between each audio stream chunk and the corresponding text information by dragging and ... 05/04/06 - 20060095264 - Unit selection module and method for chinese text-to-speech synthesis This invention relates to a unit selection module for Chinese Text-to-Speech (TTS) synthesis, mainly comprising a probabilistic context free grammar (PCFG) parser, a latent semantic indexing (LSI) module, and a modified variable-length unit selection scheme; any Chinese sentence is firstly input and then parsed into a context-free grammar (CFG) by ... 04/20/06 - 20060085195 - Voice output device and voice output method The voice output apparatus, which enhances a robustness of an interface between a user and the apparatus by transmitting, information to the user via text message and voice message, is comprised of: a display unit (107) for displaying a text message that is apparatus-transmitting information to be transmitted to the ... 04/13/06 - 20060080102 - Method and system for improving the fidelity of a dialog system Embodiments of the present invention recite a method and system for improving the fidelity of a dialog system. In one embodiment, a first input generated by a user of a first system operating in a first modality is accessed. In embodiments of the present invention, the first system also generates ... 04/06/06 - 20060074674 - Method and system for statistic-based distance definition in text-to-speech conversion A method for distance definition in a text-to-speech conversion system by applying Gaussian Mixture Model (GMM) to a distance definition. According to an embodiment, the text that is to be subjected to text-to-speech conversion is analyzed to obtain a text with descriptive prosody annotation; clustering is performed for samples in ... 04/06/06 - 20060074673 - Pronunciation synthesis system and method of the same A pronunciation synthesis system and method. The pronunciation synthesis system may pre-analyze a word to decompose the word into word root(s) and/or affix(es). The pronunciation synthesis system may include at least an analyzing module, a searching module, a pronunciation module, and a synthesizing module. The pronunciation synthesis system may be ... 03/30/06 - 20060069567 - Methods, systems, and products for translating text to speech Methods, systems, and products are disclosed for translating text to speech. One such method receives content for translation to speech, identifies a textual sequence in the content, and correlates the textual sequence to a phrase. A voice file storing multiple phrases is accessed, with the voice file mapping each phrase ... 03/30/06 - 20060069566 - Segment set creating method and apparatus A segment set before updating is read, and clustering considering a phoneme environment is performed to it. For each cluster obtained by the clustering, a representative segment of a segment set belonging to the cluster is generated. For each cluster, a segment belonging to the cluster is replaced with the ... 03/02/06 - 20060047514 - Method and apparatus for synthesizing speech A method for synthesizing speech includes an obtaining step of obtaining a speech message, and a resuming step of resuming speech output of the speech message according to resumption data representing a resumption mode of the speech message when the speech output of the speech message is suspended in the ... 02/23/06 - 20060041430 - Text-to-speech and image generation of multimedia attachments to e-mail A multi-mail system and method is disclosed in which a sender may convey and a recipient can realize emotional aspects associated with substantive content of a multi-mail message by receiving a message that is more than textual in nature. Voice recognition technology and programmatic relation of sound and graphics may ... 02/23/06 - 20060041429 - Text-to-speech system and method A system and method for generating synthetic speech, which operates in a computer implemented Text-To-Speech system. The system comprises at least a speaker database that has been previously created from user recordings, a Front-End system to receive an input text and a Text-To-Speech engine. The Front-End system generates multiple phonetic ... 02/09/06 - 20060031072 - Electronic dictionary apparatus and its control method An electronic dictionary apparatus and its control method are provided. A database contains entry words and advanced phonetic information corresponding to each entry word. A dictionary search section searches the database using an entry word specified by a user as a search key and acquires the advanced phonetic information corresponding ... 02/02/06 - 20060025998 - Information-processing apparatus, information-processing methods, recording mediums, and programs The present invention provides an information-processing apparatus for communicating with an other information-processing apparatus, which is connected to the information-processing apparatus through a network. The apparatus includes reproduction means for synchronously reproducing content data common to the other apparatus, user-information receiver means for receiving a voice and image of an ... 01/19/06 - 20060015343 - Identifying phonetically irregular words A technique is described that identifies at least one phonetically irregular word in a text passage and displays the identified at least one phonetically irregular word in a readable format different from other portions of the text passage. Related apparatuses, techniques, systems, computer program products are also described. ... 01/19/06 - 20060015342 - Document mode processing for portable reading machine enabling document navigation Controlling a reading machine while reading a document to a user by receiving an image of a document, accessing a knowledge base that provides data that identifies sections in the document and processing user commands to select a section of the document. The reading machine applies text-to-speech to a text ... 01/12/06 - 20060009977 - Speech synthesis apparatus A speech synthesis apparatus, which can embed unchangeable additional information into synthesized speech without causing a deterioration of speech quality and restriction by bands, includes a language processing unit which generates synthesized speech generation information necessary for generating synthesized speech in accordance with a language string, a prosody generating unit ... 01/12/06 - 20060009976 - Method for transforming image imto music The invention is a method for transforming image into music, more particularly, is a method for transforming image into sound at first and then editing the sound into music. In the method, the dynamic image or static image is captured by using the image capture apparatus. An image datum is ... 12/29/05 - 20050288932 - Reducing processing latency in optical character recognition for portable reading machine A portable reading device includes a computing device and a computer readable medium storing a computer program product to receive an image and select a section of the image to process. The product processes the section of the image with a first process and when the first process is finished ... 12/08/05 - 20050273337 - Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition When a speaker-independent voice-recognition (SIVR) system recognizes a spoken utterance that matches a phonetic representation of a speech element belonging to a predefined vocabulary, it may play a synthesized speech fragment as a means for the user to verify that the utterance was correctly recognized. When a speech element in ... 12/01/05 - 20050267758 - Converting text-to-speech and adjusting corpus The present invention provides a method and apparatus for text to speech conversion, and a method and apparatus for adjusting a corpus. The method for text to speech comprises: text analysis step for parsing the text to obtain descriptive prosody annotations of the text based on a TTS model generated ... 12/01/05 - 20050267757 - Handling of acronyms and digits in a speech recognition and text-to-speech engine A method is disclosed for the detection of acronyms and digits and for finding the pronunciations for them. The method can be incorporated as part of an Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) system. Moreover, the method can be part of Multi-Lingual Automatic Speech Recognition (ML-ASR) and TTS systems. ... 11/17/05 - 20050256716 - System and method for generating customized text-to-speech voices A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source ... 10/20/05 - 20050234725 - Method and system for flexible usage of a graphical call flow builder A method (10) of developing call flows can simply include a determination (12) whether an alternative speech field is filled. If the alternative speech field is not filled, then the description text is used (16) in a description field as a default for text for speech output. The description field ... 10/20/05 - 20050234724 - System and method for improving text-to-speech software intelligibility through the detection of uncommon words and phrases Disclosed is a system and method for improving the intelligibility of speech output by a speech synthesizer by determining if uncommon words exist in the text, and if it is determined that an uncommon word exists in the text, pausing the output of the synthesized speech of the uncommon word ... 10/13/05 - 20050228672 - Method and system of dynamically adjusting a speech output rate to match a speech input rate A method (10) and system of adjusting a speech output rate to match a speech input rate can include the steps of receiving (12) speech input, computing (14) a speech input rate, and dynamically adjusting (18 or 26) a speech output rate to match the speech input rate. If the ... 10/13/05 - 20050228671 - System and method for utilizing speech recognition to efficiently perform data indexing procedures A system and method for utilizing speech recognition to efficiently perform data indexing procedures includes an authoring module that coordinates an authoring procedure for creating an index file that has pattern word sets corresponding to data objects stored in a memory of a host electronic device. The pattern word sets ... 10/06/05 - 20050222844 - Method and apparatus for generating spatialized audio from non-three-dimensionally aware applications One embodiment of the present invention provides a system that facilitates generating spatialized audio from non-three-dimensional aware applications. The system operates by intercepting parameters associated with audio use from an application. The system then obtains location information of a display window associated with the application within a three-dimensional display. Next, ... 09/29/05 - 20050216267 - Method and system for computer-aided speech synthesis Method and system for computer-aided speed synthesis for synthesizing electronic text by performing a predefined series of rules-based analyses in a predefined order, each of the analyses operating in a graduated manner to convert respective electronic text into electronic lexicons, and announcing analog speech based on the results of the ... 09/08/05 - 20050197840 - Device for event prediction on booting a motherboard An event predictor on a motherboard has a storage device, a date filter, a text analyzer, and a speech database. The storage device stores an event table containing a plurality of events. The date filter is connected to the storage device for receiving date information provided by the motherboard such ... 09/08/05 - 20050197839 - Apparatus, medium, and method for generating record sentence for corpus and apparatus, medium, and method for building corpus using the same A method, medium, and apparatus for generating a record sentence to establish a speech corpus, including generating a synthesized sentence of speech and synthesis information related to speech synthesis by performing speech synthesis for a predetermined sentence of text, selecting an unseen sentence including an unseen unit according to the ... 09/08/05 - 20050197838 - Method for text-to-pronunciation conversion capable of increasing the accuracy by re-scoring graphemes likely to be tagged erroneously The present invention provides a method for text-to-pronunciation conversion capable of increasing the accuracy by re-scoring graphemes likely to be tagged erroneously. Grapheme segmentation and phoneme tagging are first applied to an input word to generate at least one grapheme-phoneme pair sequence, and the score of each grapheme-phoneme pair sequence ... 09/08/05 - 20050197837 - Enhanced multilingual speech recognition system A speech recognition system comprising: a language identification unit for identifying the language of a text item entry; at least one separate pronunciation modelling unit including a phoneme set and pronunciation model for at least one language; means for activating the pronunciation modelling unit including the phoneme set and pronunciation ... 09/01/05 - 20050192807 - Hierarchical approach for the statistical vowelization of arabic text Advantageously, the text is completed according to a model hierarchy giving higher priority to longer chunks of text, ie sentences (310, 315, 320) then multiword phrases (330, 335, 340), then words (350, 355, 360) and finally character groups (370, 375, 380, 390). ... 08/25/05 - 20050187773 - Voice synthesis system A voice synthesis system for interactive voice services comprises a voice server connected to a packet network dispensing a voice service to a user terminal by executing a service file associated with the voice service. An HTTP client in the voice server transmits a request containing a text to be ... 08/25/05 - 20050187772 - Systems and methods for synthesizing speech using discourse function level prosodic features Techniques are provided for synthesizing speech using discourse function level prosodic features. An output text is determined. The discourse functions within the text are determined based on a theory of discourse analysis such as the Unified Linguistic Discourse Model. The salient prosodic features associated with the discourse functions are identified ... 08/11/05 - 20050177369 - Method and system for intuitive text-to-speech synthesis customization A system for tuning the text-to-speech conversion process having a text-to-speech engine that converts the input text into a processed text form which includes speech features. A visual editing interface displaying the processed text form using graphical indicators on an output device to allow a user to edit the text ... 06/09/05 - 20050125228 - Digital electronic correction pen with audio pronunciation and spell check capabilities, with built-in memory. also calculation processing... thereof... The digital smart pen has the capabilities to help user pronounce unknown or unfamiliar text, by verbal means. Also, this invention has the operations to do a spell check to text, to assist user in correct spelling of the word or words. Plus, the smart pen has the ability to ... 06/02/05 - 20050119891 - Method and apparatus for speech synthesis without prosody modification A speech synthesizer is provided that concatenates stored samples of speech units without modifying the prosody of the samples. The present invention is able to achieve a high level of naturalness in synthesized speech with a carefully designed training speech corpus by storing samples based on the prosodic and phonetic ... 06/02/05 - 20050119890 - Speech synthesis apparatus and speech synthesis method The present invention includes: a characteristic parameter DB 106 that holds, with respect to each speech-unit, speech-unit data indicating a loan word attribute and acoustic characteristics; a language analysis unit 104 and a prosody prediction unit 109 that obtain text data and respectively predict a loan word attribute and acoustic ... ### FreshPatents.com Support |