FREE Registration is required
Overview:
This paper presents an audio-visual speech recognition framework based on articulatory features, which tries to combine the advantages of both areas, and shows a better recognition accuracy compared to a phone-based recognizer. In one's approach, the paper uses HMMs to model abstract articulatory classes, which are extracted in parallel from both the speech signal and the video frames. The N-best outputs of these independent classifiers are combined to decide on the best articulatory feature tuples. By mapping these tuples to phones, a phone stream can be generated. A lexical search finally maps this phone stream to meaningful word transcriptions. The paper demonstrates the potential of the approach by a preliminary experiment on the GRID database, which contains continuous English voice commands for a small vocabulary task.
(Is this item miscategorized? Does it need more tags? Let us know.)
| Format: | Size: | 182 KB | |
| Date: | Jun 2007 | ||
| Pages: | 5 |
People who downloaded this item also downloaded
![]() |
Speech Recognition: Accelerating the Adoption of Electronic Medical Records |
![]() |
Use Speech Recognition |
![]() |
UML Basics: The Class Diagram |
Top results from Voice Recognition
White Papers, Webcasts, and Resources
- Create new value from System z assets, reduce costs with Web technology IBMFind out how you can integrate and enhance your System z assets faster when you use the version 6.1 update to IBM WebSphere Portal on...
- Live Webcast: Is Your Enterprise Architected for 2010? SybaseEnterprise Architecture (EA) initiatives plagued by 'analysis paralysis'? Keep things moving forward into 2010 with this iterative approach to EA.
- Whitepaper:Intelligent Data Management with Dell Product Group Dell EqualLogicRead about IDM, a new concept that makes it easier and more affordable to manage and leverage your company's data throughout its lifecycle.
Premier Vendor Content Whitepapers, webcasts & resources from our Power Center Sponsors
Featured Training Courses
- Implementing and Administering Windows 7 in the Enterprise
- CCNA Boot Camp v2.0
- VMware vSphere: Install, Configure, Manage [V4]
- Certified Ethical Hacker
- Management and Leadership Skills
- Browse all Training Courses
SmartPlanet
- Thought-provoking progressive ideas on diverse topics that intersect with technology, business, and life, and matter to the world at large. Visit SmartPlanet
- More from IBM
- Can your business work smarter? Learn more about Lotus Symphony
- Learn how to work smarter and optimize cost using the IBM Smart SOA approach Download the eBook
- Smarter ways to make smarter products Read the brief from IBM


