CN101523483B - Method for the rendition of text information by speech in a vehicle - Google Patents

Method for the rendition of text information by speech in a vehicle Download PDF

Info

Publication number
CN101523483B
CN101523483B CN2007800382076A CN200780038207A CN101523483B CN 101523483 B CN101523483 B CN 101523483B CN 2007800382076 A CN2007800382076 A CN 2007800382076A CN 200780038207 A CN200780038207 A CN 200780038207A CN 101523483 B CN101523483 B CN 101523483B
Authority
CN
China
Prior art keywords
text element
pronunciation information
text
specific pronunciation
automobile
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2007800382076A
Other languages
Chinese (zh)
Other versions
CN101523483A (en
Inventor
S·泽尔朔普
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Audi AG
Original Assignee
Audi AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Audi AG filed Critical Audi AG
Publication of CN101523483A publication Critical patent/CN101523483A/en
Application granted granted Critical
Publication of CN101523483B publication Critical patent/CN101523483B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01CMEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
    • G01C21/00Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
    • G01C21/26Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network
    • G01C21/34Route searching; Route guidance
    • G01C21/36Input/output arrangements for on-board computers
    • G01C21/3605Destination input or retrieval
    • G01C21/3608Destination input or retrieval using speech input, e.g. using speech recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination

Abstract

The invention relates to a method for the rendition of text information by speech in a vehicle according to which the following steps are carried out: a) Preparation of text elements in a unit external to the vehicle; b) Production and preparation of specific pronunciation information for the respective text elements; c) Transfer of the text elements and the specific pronunciation information to aprocessing unit in the vehicle; d) Assignment of the specific pronunciation information to the respective text elements; e) Rendition of the text elements, taking into consideration the specific pron unciation information, by an electronic speech device in the vehicle.

Description

In automobile, pass through the method for voice reproduction text message
Technical field
The present invention relates to a kind of method of in automobile, passing through the voice reproduction text message.
Background technology
The such existing system of known for example navigational system in automobile, they can reproduce the information that is stored as text module (Textbausteine) by acoustic speech signal ground.These systems are limited to the basic text element (Basis-Textelement) of storage, and it is reproduced to have only the basic text element of conduct to pass through voice.Can not this system of expansion.
In addition, the system of the known text message that wherein can in automobile, receive from the outside by voice reproduction.In this important problem is can not be by voice free burial ground for the destitute and reproduce these text messages understandably undoubtedly.
Summary of the invention
Therefore, the technical problem to be solved in the present invention provides a kind of method, utilizes this method can improve in automobile by the voice reproduction text message.
This technical matters by a kind of in automobile the method by the voice reproduction text message solve, wherein carry out following steps:
A) in the unit of automobile external, provide text element;
B) produce and provide specific pronunciation information for each text element;
C) described text element and described specific pronunciation information are sent to the processing unit of automotive interior;
D) described specific pronunciation information is distributed to the corresponding text element;
E) under the situation of considering described specific pronunciation information, reproduce described text element by the electronic speech device in the automobile,
Wherein, in the unit of automotive interior, before voice output system puts into operation, store basic text element and corresponding basic pronunciation information; And
The text element that is sent in the automobile is compared with basic text element, and do not considering simultaneously that the specific pronunciation information of text element is used for the voice output of text.
Embodiment
In the method according to the invention, in automobile, pass through the voice signal rendition of text information.The text message that reproduces is provided in the unit of automobile external as text element.Text element also can produce in the unit of this automobile external in principle.
In addition, produce and provide specific pronunciation information for each text element.Text element and specific pronunciation information will be transmitted or be sent to the processing unit of automotive interior.Specific pronunciation information is assigned to the corresponding text element.Under the situation of considering specific pronunciation information, reproduce text element by the electronic speech device in the automobile.By this mode, can notify a plurality of differences and personalized text message by the voice reproduction of the remarkable improvement in the automobile.Especially optimize text message by the supplementary that provides as specific pronunciation information is provided from the outside, can significantly improve the no doubt and the intelligibility of voice signal.Free burial ground for the destitute and reproduce very complicated text understandably undoubtedly thus.
Preferably, at automobile external specific pronunciation information is distributed to the corresponding text element.Can improve the dirigibility of waiting to reproduce text thus.Can significantly reduce needed electronics storage space in the automobile in addition.
But, also can specific pronunciation information be distributed to the corresponding text element at automotive interior.
Preferably, pronunciation information is stored in the database, wherein according to needing this database of search to search needed each information.
Preferably, text element and specific pronunciation information are sent in the automobile when automobilism, especially wirelessly transmit.
Preferably, specific pronunciation information and/or produced with standardized format at the distribution of text message.Preferably, can be specific pronunciation information and/or produce with SSML (SSML) language at the distribution of text message at this.
Preferably, in the unit or processing unit of automotive interior, before basic coming into operation, and therefore before offering the final user of voice output system, basic text element is stored with corresponding basic pronunciation information.
Preferably, be sent in the automobile text element and basic text element relatively, and do not considering to be sent to the correct voice output that text element in the automobile is used for text simultaneously with the specific pronunciation information of text element.
Preferably, by the digital broadcasting medium, especially transmit text element and specific pronunciation information by digital broadcast network.
Under text element, comprise single speech and sentence element or whole sentence.Can also under a text element, comprise a plurality of sentences.
By reading aloud text message according to the template and the pronunciation scheme (Ausspracheschemata) of storage, phonetic synthesis produces voice signal according to text message.The software that is used for voice output as the basis is called as phonetic synthesis or text-to-speech (TTS) engine.Tts engine can be by adding for each speech in the pronunciation or sentence structure, being supported as the pronunciation information of grammer to text.This for example can be used in the navigational system.Tts engine has the following advantages: people can work under the declaimer's who does not have nature situation, but also can produce new so-called prompting afterwards, i.e. text output.Be stored in the automobile by the audio file of optimizing that tts engine produced, and inquired by incident, be equivalent to current navigation output, wherein for example according to reach and next target between should after 200m, turn round left by the voice signal explanation at a distance of specific range.Sentence element is dynamically combined by the module (Bausteinen) that is stored in the automobile.These basic text elements are stored in the system as basic information, so that can guarantee basic function prevailingly aspect the voice output of text message.But this is the finite aggregate that provides in advance regularly of text element, and it is not enough for very different text messages and expression.
Now, can optimize the voice output of very different text messages, wherein realize by in the unit of automobile external, carrying out this optimization basically at automobile external by the method according to this invention or its preferred implementation.So, in optimization, produce conversion script (Transskript), i.e. the method for voice production (Lautsprache) specific to tts engine.This conversion script can dynamically be sent in the automobile, perhaps can be stored in the automobile after transmitting.Then, in automobile, carry out the audio frequency output of sound.The text of reading with auxiliary content or specific pronunciation information can be in automobile by tts engine and so-calledly be converted into audio frequency output similarly from car conversion (Offboardumsetzung).The significant advantage that can obtain is thus, new multiple different content of text messages can be provided in the automobile afterwards, and reproduce out by this system with improved voice output.Thereby, can especially wirelessly transmit text message content by broadcast medium, and by voice signal free burial ground for the destitute output text message content undoubtedly in automobile.So the auxiliary content that externally produces as specific pronunciation information can be used for the pronunciation of the no doubt of automobile, and guarantee remarkable improvement to intelligibility.The content of optimizing at pronunciation also can be sent to automobile by communication service.
Tts engine can be explained optimization and carry out gratifying output.In addition, significantly reduced needed storage space by this method, because storage is 10 to 100 times of the needed storage space of text that comprise optimization with the text formatting storage as the needed storage space of basic text element that has corresponding basic pronunciation information in a large number of basic base (Wortbasis) in this system.Therefore, preferably, represent to optimize text message at voice, and produce audio file from car ground or at automobile external, and in automobile an output audio file.
So, preferably, voice-optimizing (Sprachoptimierung) is described with normalized form, thus different tts engines explanatory content in the same manner.This is especially especially favourable under the situation that message is dynamically introduced, because these message must be handled by all receivers.A kind of possible standard of voice-optimizing is the SSML language, for example can define a subclass by it, and corresponding receiver system supports this subclass and transmitting element that this subclass is provided.
Particularly advantageously be, with the basis of Automatic Optimal as the voice output of very different text messages.For example can stipulate that continuous updating is the text message that communication service is sent, thereby be very bothersome content manual examination (check) pronunciation characteristics at this.Can improve this point by Automatic Optimal.
A kind of exemplary approach about Automatic Optimal is that at first input text, and pronunciation data storehouse is loaded with specific pronunciation information.Then, the text element and the basic text element of the text that transmitted compared, and be the additional corresponding pronunciation rule of the text.Owing to both there had been the pronunciation information of having stored and having distributed in advance for basic text element, have again specific to the pronunciation information of the text element that transmits with the text, therefore whole text can be based on each pronunciation information, and says whole text with the possible pronunciation of the best.Can't be even transmit by basic text element understanding or the textual portions that is not covered by basic text element, also free burial ground for the destitute and clearly represent these almost unacquainted text elements undoubtedly by voice signal, because also distributed specific pronunciation information for these text elements, these specific pronunciation information are that the individual car of taking leave of produces, and are additionally transmitted together as supplementary.
So the output of whole text can be carried out or automatically reproduced by the determined moment of autoist.Therefore, autoist can oneself be determined the moment and the duration of reproduction.
Can stipulate in addition, can carry out aftertreatment by editor, especially manual aftertreatment.Can realize thus improving once more, and as if start mode of learning.

Claims (10)

1. method by the voice reproduction text message in automobile, wherein carry out following steps:
A) in the unit of automobile external, provide text element;
B) produce and provide specific pronunciation information for each text element;
C) described text element and described specific pronunciation information are sent to the processing unit of automotive interior;
D) described specific pronunciation information is distributed to the corresponding text element;
E) under the situation of considering described specific pronunciation information, reproduce described text element by the electronic speech device in the automobile,
It is characterized in that, in the unit of automotive interior, before voice output system puts into operation, store basic text element and corresponding basic pronunciation information; And
The text element that is sent in the automobile is compared with basic text element, and do not considering simultaneously that the specific pronunciation information of text element is used for the voice output of text.
2. method according to claim 1 is characterized in that, at automobile external described specific pronunciation information is distributed to the corresponding text element.
3. method according to claim 1 is characterized in that, at automotive interior described specific pronunciation information is distributed to the corresponding text element.
4. according to the described method of one of claim 1 to 3, it is characterized in that described specific pronunciation information is stored in the database, wherein said database is searched according to needs.
5. according to the described method of one of claim 1 to 3, it is characterized in that described text element and described specific pronunciation information are sent in the automobile when automobilism.
6. method according to claim 5 is characterized in that, described text element and described specific pronunciation information are transmitted wirelessly in the automobile when automobilism.
7. according to the described method of one of claim 1 to 3, it is characterized in that described specific pronunciation information and/or produce with standardized format at the distribution of text element.
8. method according to claim 7 is characterized in that, described specific pronunciation information and/or produce with the SSML language at the distribution of text element.
9. according to the described method of one of claim 1 to 3, it is characterized in that, transmit described text element and described specific pronunciation information by broadcast medium.
10. method according to claim 9 is characterized in that, transmits described text element and described specific pronunciation information by digital broadcast network.
CN2007800382076A 2006-11-29 2007-10-19 Method for the rendition of text information by speech in a vehicle Expired - Fee Related CN101523483B (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
DE102006056286.0A DE102006056286B4 (en) 2006-11-29 2006-11-29 A method of reproducing text information by voice in a vehicle
DE102006056286.0 2006-11-29
PCT/EP2007/009073 WO2008064742A1 (en) 2006-11-29 2007-10-19 Method for the rendition of text information by speech in a vehicle

Publications (2)

Publication Number Publication Date
CN101523483A CN101523483A (en) 2009-09-02
CN101523483B true CN101523483B (en) 2013-07-24

Family

ID=38988102

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2007800382076A Expired - Fee Related CN101523483B (en) 2006-11-29 2007-10-19 Method for the rendition of text information by speech in a vehicle

Country Status (3)

Country Link
CN (1) CN101523483B (en)
DE (1) DE102006056286B4 (en)
WO (1) WO2008064742A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102014209358A1 (en) 2014-05-16 2015-11-19 Ford Global Technologies, Llc Device and method for speech recognition, in particular in a vehicle
DE102015107601A1 (en) 2014-05-16 2015-11-19 Ford Global Technologies, Llc Device and method for speech recognition, in particular in a vehicle
CN105606117A (en) * 2014-11-18 2016-05-25 深圳市腾讯计算机系统有限公司 Navigation prompting method and navigation prompting apparatus
DE102015211101A1 (en) 2015-06-17 2016-12-22 Volkswagen Aktiengesellschaft Speech recognition system and method for operating a speech recognition system with a mobile unit and an external server

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0725382A2 (en) * 1995-02-03 1996-08-07 Robert Bosch Gmbh Method and device providing digitally coded traffic information by synthetically generated speech
US5899975A (en) * 1997-04-03 1999-05-04 Sun Microsystems, Inc. Style sheets for speech-based presentation of web pages

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6446040B1 (en) * 1998-06-17 2002-09-03 Yahoo! Inc. Intelligent text-to-speech synthesis
US6463413B1 (en) * 1999-04-20 2002-10-08 Matsushita Electrical Industrial Co., Ltd. Speech recognition training for small hardware devices
DE19942869A1 (en) * 1999-09-08 2001-03-15 Volkswagen Ag Operating method for speech-controlled device for motor vehicle involves ad hoc generation and allocation of new speech patterns using adaptive transcription
GB0029576D0 (en) * 2000-12-02 2001-01-17 Hewlett Packard Co Voice site personality setting
ES2208212T3 (en) * 2000-12-18 2004-06-16 Siemens Aktiengesellschaft PROCEDURE AND PROVISION FOR THE RECOGNITION OF AN INDEPENDENT VOICE OF THE ANNOUNCER FOR A TELECOMMUNICATIONS TERMINAL OR DATA TERMINALS.
DE10324198A1 (en) * 2003-05-28 2004-12-23 Traveltainer Beteiligungs-Gmbh Information provision method for a vehicle, e.g. traffic reports, in which information is comprised of an information and marking part with the latter part being used to evaluate whether the information part is relevant or not
US20050043067A1 (en) * 2003-08-21 2005-02-24 Odell Thomas W. Voice recognition in a vehicle radio system
DE102005061505B4 (en) * 2005-12-22 2018-04-12 Audi Ag Method for providing information in a vehicle

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0725382A2 (en) * 1995-02-03 1996-08-07 Robert Bosch Gmbh Method and device providing digitally coded traffic information by synthetically generated speech
US5899975A (en) * 1997-04-03 1999-05-04 Sun Microsystems, Inc. Style sheets for speech-based presentation of web pages

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Paul Taylor and Amy Isard.SSML: A speech synthesis markup language.《Speech Communication》.1997,第21卷(第1期), *

Also Published As

Publication number Publication date
DE102006056286A1 (en) 2008-06-12
CN101523483A (en) 2009-09-02
WO2008064742A1 (en) 2008-06-05
DE102006056286B4 (en) 2014-09-11

Similar Documents

Publication Publication Date Title
US10621997B2 (en) Information providing system, information providing method, and computer-readable recording medium
EP3176782B1 (en) Apparatus and method for outputting obtained pieces of related information
US7873517B2 (en) Motor vehicle with a speech interface
EP2053595B1 (en) Text pre-processing for text-to-speech generation
US10542360B2 (en) Reproduction system, terminal device, method thereof, and non-transitory storage medium, for providing information
US8583441B2 (en) Method and system for providing speech dialogue applications
AU2015297647B2 (en) Information management system and information management method
JP2009300537A (en) Speech actuation system, speech actuation method and in-vehicle device
CN103151037A (en) Correcting unintelligible synthesized speech
CN103124318A (en) Method of initiating a hands-free conference call
CN101523483B (en) Method for the rendition of text information by speech in a vehicle
KR100578547B1 (en) Information distributing apparatus, information transmitting apparatus, information receiving apparatus, and information distributing method
JP6729494B2 (en) Information management system and information management method
JP2012168349A (en) Speech recognition system and retrieval system using the same
JP6596903B2 (en) Information providing system and information providing method
JP2020190756A (en) Management device and program
US11250704B2 (en) Information provision device, terminal device, information provision system, and information provision method
JP6971557B2 (en) Management equipment and programs
JP4845249B2 (en) Method for wirelessly transmitting information between a communication system in a vehicle and a central computer outside the vehicle
JP3805065B2 (en) In-car speech synthesizer
JPH10228294A (en) Voice synthesizer
JP2010079190A (en) Method of updating dictionary for speech synthesis, terminal device, and speech synthesis system
JP2974813B2 (en) Traffic information broadcasting system
JPH10200468A (en) Synthesized voice data communication method, transmitter and receiver
WO2010109575A1 (en) Voice information outputting apparatus

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20130724

Termination date: 20191019

CF01 Termination of patent right due to non-payment of annual fee