18
edits
Rohandalvi (talk | contribs) |
|||
| (7 intermediate revisions by 2 users not shown) | |||
| Line 5: | Line 5: | ||
*Speech Input API | *Speech Input API | ||
*Text to Speech API | *Text to Speech API | ||
The incubator group is still discussing the APIs and their features. The latest draft can be found here : [http://www.w3.org/2005/Incubator/htmlspeech/live/NOTE-htmlspeech.html] | |||
The group's mailing lists can be found here: [http://lists.w3.org/Archives/Public/public-xg-htmlspeech/] | |||
<br> | |||
== 1.Speech Input API == | == 1.Speech Input API == | ||
| Line 28: | Line 33: | ||
*The developer should be able to choose what speech engine to use. | *The developer should be able to choose what speech engine to use. | ||
*The developer should be able to start, stop, handle errors and multiple requests as required. | *The developer should be able to start, stop, handle errors and multiple requests as required. | ||
<br> | |||
== 2.Text To Speech API == | == 2.Text To Speech API == | ||
The text to speech API will be based on google's proposal(http://lists.w3.org/Archives/Public/public-xg-htmlspeech/2011Feb/att-0022/htmltts-draft.html).This API can be used for speech translation, turn by turn navigation, dialog systems etc. | The text to speech API will be based on google's proposal([http://lists.w3.org/Archives/Public/public-xg-htmlspeech/2011Feb/att-0022/htmltts-draft.html]).This API can be used for speech translation, turn by turn navigation, dialog systems etc. | ||
==== API Design - ==== | ==== API Design - ==== | ||
| Line 40: | Line 45: | ||
*The API should itself be independent of the underlying speech synthesizer. If speech synthesis is not supported, appropriate text should be displayed. <br> | *The API should itself be independent of the underlying speech synthesizer. If speech synthesis is not supported, appropriate text should be displayed. <br> | ||
*What speech engines is yet to be decided. | *What speech engines is yet to be decided. | ||
<br> | |||
== Tentative Schedule == | |||
=== First Half === | |||
(May 24th - June 7th) - Implementing the media capture API <br> | |||
(June 8th - June 14th) - Implementing the SpeechRequest API using google's speech reco server along with unit tests for the same. <br> | |||
(June 15th - June 30th) - Low activity due to exams. <br> | |||
(July 1st - July 8th) - Finish whatever coding is left of Speech Request <br> | |||
(July 8th - July 13th) - Tying up loose ends, documentation, code review. By the end of this period, i would like to have the Speech Input API working perfectly. <br> | |||
(July 13th - July 16th) - Mid-term evaluations <br> | |||
If time permits, I'll look at native speech engines and how they can be implemented. | |||
=== Second Half === | |||
(July 17th - July 24th) - Research and decide on possible speech synthesis engines<br> | |||
(July 25th - Aug 8th) - Work on the API implementation and unit tests. <br> | |||
(Aug 8th - Aug 15th) - Tying up loose ends, documentation, code review. <br> | |||
(Aug 16th - Aug 22nd) - Bug fixing and miscellaneous tasks. Committing code to mozilla repos and google code.<br> | |||
(Aug 23rd) - Firm pencil down date.<br> | |||
== Additional Resources == | == Additional Resources == | ||
| Line 47: | Line 70: | ||
== Updates == | == Updates == | ||
*[[GSoC Update 1 - HTML5 Speech API|Update 1]] | *[[GSoC Update 1 - HTML5 Speech API|Update 1]] | ||
*[[GSoC Update 2 - HTML5 Speech API|Update 2]] | |||
*[[GSoC Update 3 - HTML5 Speech API|Update 3]] | |||
*[[GSoC Update 4 - HTML5 Speech API|Update 4]] | |||
*[[GSoC Update 5 - HTML5 Speech API|Update 5]] | |||
edits