00項(xiàng)目預(yù)研語音_第1頁
00項(xiàng)目預(yù)研語音_第2頁
00項(xiàng)目預(yù)研語音_第3頁
00項(xiàng)目預(yù)研語音_第4頁
00項(xiàng)目預(yù)研語音_第5頁
已閱讀5頁,還剩4頁未讀 繼續(xù)免費(fèi)閱讀

下載本文檔

版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)

文檔簡介

1、項(xiàng)目預(yù)研Engine 項(xiàng)目組華技大學(xué)軟件學(xué)院2005信 息父項(xiàng)名稱父項(xiàng)標(biāo)識(shí)版本子項(xiàng)文檔名稱子項(xiàng)文檔標(biāo)識(shí)版本修 改 信 息修 改 者日 期舊版本修改標(biāo)識(shí)原 因?qū)?核日 期新版本批 準(zhǔn)日 期配 置 信 息項(xiàng)目名稱移動(dòng)項(xiàng)目標(biāo)識(shí)Engine-BM-2005-01版 本1.0文檔名稱項(xiàng)目預(yù)研文檔標(biāo)識(shí)PDS-2005-01版 本1.0編 輯撰寫人時(shí) 間2005-5-15版 本1.0審 核 批 準(zhǔn)審 核日 期2005-5-17版 本1.0批 準(zhǔn)日 期2005-5-17項(xiàng)目預(yù)研1 引言1.1 編寫目的項(xiàng)目預(yù)研目的旨在梳理語音開發(fā)流程,確定應(yīng)用開發(fā)框架。1.2 背景項(xiàng)目名稱:移動(dòng)(b-mobile 讓隨身移動(dòng))

2、項(xiàng)目委托項(xiàng)目開發(fā)系統(tǒng)開發(fā):華:華技大學(xué)軟件學(xué)院技大學(xué)軟件學(xué)院Engine 項(xiàng)目組:VS.NET、Speech Application SDK VerBeta 1.11.3 定義Speech Application SDK: 微軟語音應(yīng)用開發(fā)包1.4 參考資料Speech Application SDK 開系統(tǒng)需求說明檔2 應(yīng)用程序組織模式Speech Application SDK 程序模式之2.1基礎(chǔ)篇(SALT)The SAPI API provides a high-levelerface betspeech engines. SAPI implements all the low-le

3、veln an application ands needed tocontrol and manage the real-time operations of various speech engines.The two basic types of SAPI engines are text-to-speech (TTS) systems andspeechspoken spokenrecognizers. TTS systems synthesize text strings and filesoaudio using synthetic voi. Speech recognizers

4、convert humanaudioo readable text strings and files.Speech Application SDK 程序模式之2.2componentsoftheSpeechApplication PlatformRequired ComponentsDeploying a speech-enabled Web application using SALT markuprequires three components.1.An ASP.NET serverThe Web server generates Wges containing HTML, SALT,

5、and embedded script. The script controls the dialogue flow forvoice-onlyeractions. For example, if there are severalprompts on a page, the script defines the order in which the audio prompts play.2.A Speech ServerSpeech Server recognizes speech, and plays audio prompts and responses.3.A cntThe Speec

6、h Platform supports two types of cnts:ephonyApplication Servicnts, and multimodal cnts with averofernet ernetExplorer running either Speech Add-in for Explorer or Speech Add-in forPocketernet Explorer.The following diagram illustrates these elements and the types ofinformation they pros. It also ill

7、ustrates the relationship of theseelements to the Visual Studio .NET 2003 Speech Development Tools.3Common Usage ScenariosThis section illustrates three deployment configurations for commondeployment scenariost the Speech Platform supports.3.1ephony Scenariohis scenario,ephony Application Servi(TAS)

8、 is the cnt. Aephone acts as the terminal device, and connects to TAS through astandardephony board. Theephony board provides theerfacebetn theephone and TAS. At run time, TAS res on the Webserver for application logic, and on Speech Server for audio signalprosing.When the user dials a phone number

9、for aephony service, the callconnects to TAS. TAS assotes theephone call wivoice-onlySALTreter. Then TAS connects to the Web server and loads thedefault page for the applicationt provides the service for which thecaller is dialing. As the callereracts with the application, TASpasses audio and dual t

10、one multi-frequency (DTMF) input from thecaller to Speech Server, which performs speech recognition (SR),text-to-speech (TTS), and DTMF prosing.The SASDK includes a number of Dialog Speech Controlst supportComputer-Supportedmunications Applications (CSTA)servi. These include the AnswerCall, Transfer

11、Call, MakeCall,and DisconnectCall controls. Developers can use these controls toanswer, transfer, initiate, and disconnectephone calls, as well asgather call information, and send and receive CSTA events. TheSASDK also includes a SmexMessage (Simple Messaging Exten) controlt developers can use to se

12、nd and receive raw CSTA messages.3.2Desktop Multimodal Scenariohis scenario, the cnt isernet Explorer with SpeechAdd-in forernet Explorer installed. ASP.NETspeech-enabled Web application pages reside on the Web server.When the user enters a URL inernet Explorer, the Web serveropens the applications

13、default page. The Web server sends HTML,SALT, and JScript to the Speech Add-in on the desktop. SALT markuphe pagest the Web server sends to the cnt trigger speechrecognition and text-to-speech synthesis. In order to implement SALTfunctionality, at run time the Speech Add-in instantiates a sharedSAPI

14、 SR engine. If nesary, the Speech Add-in also instantiates aTTS and a prompt engine on the cnt. These engines on the desktopcnt perform all prompting, speech recognition, and text-to-speechsynthesis.Note Multimodal applications using a desktop c nt can be deployed using only the SASDK.3.3Windows Mob

15、iMultimodal Scenarioased Pocket PC 2003 (Pocket PC)his scenario, the cnt is Pocketernet Explorer with the SpeechAdd-in forPocketernet Explorer installed. ASP.NETspeech-enabled Web application pages reside on the Web server,along with the application grammars, and a configuration filecontaining the U

16、RL to the Speech Servert performs speechprosing.When the user enters a URL on Pocket PC, the Web server opens theapplications default .aspx page. The Web server also sends the URLpoing to Speech Server. The paget the Web server sendscontains HTML, SALT, and JScript. When the user taps aspeech-enable

17、d HTML element and talks, Pocket PC sends the audio to Speech Server. Along with the compressed audio, Pocket PC sendseither an inline recognition grammar, or a poer to the location of anexternally-d recognition grammart is bound totspeech-enabled element. If the recognition grammar is an inlinegram

18、mar, Speech Server loads the grammar and performs speechrecognition. If the grammar is an externally-d grammar, SpeechServerdownloads a copy of the grammar, loads the grammar,and then performs speech recognition.After the recognizer finishes, Speech Server sends SemMarkupLanguage (SML) output to the Pocket PC along wiudio for promptsif the application dialogue flow requires the appli

溫馨提示

  • 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
  • 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
  • 5. 人人文庫網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
  • 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
  • 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。

評(píng)論

0/150

提交評(píng)論