or
Bookmark and Share
Speech synthesis apparatus with output editing
 
   
Document Number
US Patent 5842167
Issued Date
November 24, 1998
Link
Map
Abstract
A speech synthesis apparatus for synthesizing speech from text data, having a voice characteristic, a tone, a rhythm, etc. which corresponds to the contents of edition on the text data displayed on a screen, by converting a volume, a speed, a pitch, a voice characteristic, etc. of a voice on judging the contents of the edition, such as an edition of a size, a spacing, a font and so on of a character, on the text data displayed on a screen.
Tags:
Description:
Amusing 0%
Clever 0%
Complex 0%
Efficient 0%
Historic 0%
Important 0%
Innovative 0%
Interesting 0%
Practical 0%
Simple 0%
Number of Claims:
39
Comments:
no comments yet
Owner
Published
November 24, 1998
Application Number
08/653,075
Filed
May 21, 1996
US Classification
704/260   704/276
Int'l Classification
G10L   13/08   (20060101)   G10L   13/00   (20060101)  
Assistant Examiner
Attorney/Law Firm
Priority Data
May 29, 1995 [JP] 7-130773
USPTO Field of Search
395/2.7   395/2.77   395/2.87   395/2.69   395/2.44   704/260   704/261   704/268   704/278   704/235   704/276  
Related Patents
6175820 - Capture and application of sender voice dynamics to enhance communication in a speech-to-text environment - Owned by International Business Machines Corporation (Armonk, NY)

A method for providing voice dynamics of human utterances converted to and represented by text within a data processing system. A plurality of predetermined parameters for recognition and representation of dynamics in human utterances are selected. An enhanced human speech recognition software program is created implementing the predetermined parameters on a data processing system. The enhanced software program includes an ability to monitor and record human voice dynamics and provide speech-to-text recognition. The dynamics in a human utterance is captured utilizing the enhanced human speech recognition software. The human utterance is converted into a textual representation utilizing the speech-to-text ability of the software. Finally, the dynamics are merged along with the textual representation of the human utterance to produce a marked-up text document on the data processing system.

6477495 - Speech synthesis system and prosodic control method in the speech synthesis system - Owned by Hitachi, Ltd. (Tokyo,JP)

A prosodic parameter for an input text is computed by storing a sentence of vocalized speech in a speech corpus memory, searching for a stored text having a similar prosody to an input text as a key to the speech corpus and modifying the prosodic parameter based upon the search results. Because a plurality of prosodic parameters are handled as a linking data, a synthesized sound similar to natural speech having a natural intonation and prosody is produced.

6785649 - Text formatting from speech - Owned by International Business Machines Corporation (Armonk, NY)

A method for converting speech to text and vice versa. The method for converting speech to text includes receiving a spoken input having a non-verbal characteristic, and automatically generating a text output, responsive to the spoken input, having a variable format characteristic corresponding to the non-verbal characteristic of the spoken input. The method for converting text to speech includes receiving a text input having a given variable format characteristic and synthesizing speech corresponding to the text input and having a non-verbal characteristic corresponding to the variable format characteristic of the text input.

7487092 - Interactive debugging and tuning method for CTTS voice building - Owned by International Business Machines Corporation (Armonk, NY)

A method, a system, and an apparatus for identifying and correcting sources of problems in synthesized speech which is generated using a concatenative text-to-speech (CTTS) technique. The method can include the step of displaying a waveform corresponding to synthesized speech generated from concatenated phonetic units. The synthesized speech can be generated from text input received from a user. The method further can include the step of displaying parameters corresponding to at least one of the phonetic units. The method can include the step of displaying the original recordings containing selected phonetic units. An editing input can be received from the user and the parameters can be adjusted in accordance with the editing input.

7433822 - Method and apparatus for encoding and decoding pause information - Owned by Research In Motion Limited (Ontario,CA)

At an audio source, pause information is added to audio data, the combination of which is subsequently packetized. The resulting packets are transmitted to an audio destination via a network in which different packets may be subjected to varying levels of delay. At the audio destination, the pause information may be used to insert pauses at appropriate times to accommodate the occurrence of delays in packet delivery. In one embodiment, pauses are inserted based on a hierarchy of pause types. During pauses, audio filler information may be injected. In this manner, the effects of variable network delays upon reconstructed audio may be mitigated.

Claims
Description
About| FAQs| Terms & Disclaimer| Link to Us| Contact Us