HTML5 Speech API: Difference between revisions

 
(7 intermediate revisions by 2 users not shown)
Line 5: Line 5:
*Speech Input API  
*Speech Input API  
*Text to Speech API
*Text to Speech API
The incubator group is still discussing the APIs and their features. The latest draft can be found here : [http://www.w3.org/2005/Incubator/htmlspeech/live/NOTE-htmlspeech.html]
The group's mailing lists can be found here: [http://lists.w3.org/Archives/Public/public-xg-htmlspeech/]
<br>


== 1.Speech Input API  ==
== 1.Speech Input API  ==
Line 28: Line 33:
*The developer should be able to choose what speech engine to use.  
*The developer should be able to choose what speech engine to use.  
*The developer should be able to start, stop, handle errors and multiple requests as required.
*The developer should be able to start, stop, handle errors and multiple requests as required.
 
<br>
== 2.Text To Speech API  ==
== 2.Text To Speech API  ==


The text to speech API will be based on google's proposal(http://lists.w3.org/Archives/Public/public-xg-htmlspeech/2011Feb/att-0022/htmltts-draft.html).This API can be used for speech translation, turn by turn navigation, dialog systems etc.  
The text to speech API will be based on google's proposal([http://lists.w3.org/Archives/Public/public-xg-htmlspeech/2011Feb/att-0022/htmltts-draft.html]).This API can be used for speech translation, turn by turn navigation, dialog systems etc.  


==== API Design -  ====
==== API Design -  ====
Line 40: Line 45:
*The API should itself be independent of the underlying speech synthesizer. If speech synthesis is not supported, appropriate text should be displayed. <br>  
*The API should itself be independent of the underlying speech synthesizer. If speech synthesis is not supported, appropriate text should be displayed. <br>  
*What speech engines is yet to be decided.
*What speech engines is yet to be decided.
<br>
== Tentative Schedule ==
=== First Half ===
(May 24th - June 7th) - Implementing the media capture API <br>
(June 8th - June 14th) -  Implementing the SpeechRequest API using google's speech reco server along with unit tests for the same. <br>
(June 15th - June 30th) - Low activity due to exams. <br>
(July 1st - July 8th) - Finish whatever coding is left of Speech Request <br>
(July 8th - July 13th) - Tying up loose ends, documentation, code review. By the end of this period, i would like to have the Speech Input API working perfectly. <br>
(July 13th - July 16th) - Mid-term evaluations <br>
 If time permits, I'll look at native speech engines and how they can be implemented.
=== Second Half ===
(July 17th - July 24th) - Research and decide on possible speech synthesis engines<br>
(July 25th - Aug 8th) - Work on the API implementation and unit tests. <br>
(Aug 8th - Aug 15th) - Tying up loose ends, documentation, code review. <br>
(Aug 16th - Aug 22nd) - Bug fixing and miscellaneous tasks. Committing code to mozilla repos and google code.<br>
(Aug 23rd) - Firm pencil down date.<br>


== Additional Resources ==
== Additional Resources ==
Line 47: Line 70:
== Updates ==
== Updates ==
*[[GSoC Update 1 - HTML5 Speech API|Update 1]]
*[[GSoC Update 1 - HTML5 Speech API|Update 1]]
*[[GSoC Update 2 - HTML5 Speech API|Update 2]]
*[[GSoC Update 3 - HTML5 Speech API|Update 3]]
*[[GSoC Update 4 - HTML5 Speech API|Update 4]]
*[[GSoC Update 5 - HTML5 Speech API|Update 5]]
18

edits