SpeechToText: Difference between revisions
Jump to navigation
Jump to search
| Line 8: | Line 8: | ||
* Vista Speech API | * Vista Speech API | ||
* Julius http://julius.sourceforge.jp/en_index.php | * Julius http://julius.sourceforge.jp/en_index.php | ||
* WaveToText (Windows shareware, uses MS Speech API) http://www.research-lab.com/vexp007read.htm | |||
== Training Corpuses, Resources == | == Training Corpuses, Resources == | ||
Latest revision as of 12:15, 19 June 2009
Speech to Text
[edit | edit source]This is a page with resources, planning, and progress for taking audio and outputting text. Even a not great version of this technique has multiple applications, such as extracting keywords from conversations.
Software Frameworks
[edit | edit source]- CMU Sphinx http://cmusphinx.sourceforge.net/html/cmusphinx.php
- Vista Speech API
- Julius http://julius.sourceforge.jp/en_index.php
- WaveToText (Windows shareware, uses MS Speech API) http://www.research-lab.com/vexp007read.htm
Training Corpuses, Resources
[edit | edit source]- http://www.voxforge.org/
- grey's srt idea for movies (basically - use SRT files or Closed Captioning as "automated" training to audio from films)