A 1000-word vocabulary, speaker-independent, continuous live-mode speech recognizer implemented in a single FPGA

Please use this identifier to cite or link to this item: https://doi.org/10.1145/1216919.1216928

DC Field	Value
dc.title	A 1000-word vocabulary, speaker-independent, continuous live-mode speech recognizer implemented in a single FPGA
dc.contributor.author	Lin E.C.
dc.contributor.author	Yu K.
dc.contributor.author	Rutenbar R.A.
dc.contributor.author	Chen T.
dc.date.accessioned	2018-08-21T05:07:13Z
dc.date.available	2018-08-21T05:07:13Z
dc.date.issued	2007
dc.identifier.citation	Lin E.C., Yu K., Rutenbar R.A., Chen T. (2007). A 1000-word vocabulary, speaker-independent, continuous live-mode speech recognizer implemented in a single FPGA. ACM/SIGDA International Symposium on Field Programmable Gate Arrays - FPGA : 60-68. ScholarBank@NUS Repository. https://doi.org/10.1145/1216919.1216928
dc.identifier.isbn	1595936009
dc.identifier.isbn	9781595936004
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/146269
dc.description.abstract	The Carnegie Mellon In Silico Vox project seeks to move best-quality speech recognition technology from its current software-only form into a range of efficient all-hardware implementations. The central thesis is that, like graphics chips, the application is simply too performance hungry, and too power sensitive, to stay as a large software application. As a first step in this direction, we describe the design and implementation of a fully functional speech-to-text recognizer on a single Xilinx XUP platform. The design recognizes a 1000 word vocabulary, is speaker-independent, recognizes continuous (connected) speech, and is a "live mode" engine, wherein recognition can start as soon as speech input appears. To the best of our knowledge, this is the most complex recognizer architecture ever fully committed to a hardware-only form. The implementation is extraordinarily small, and achieves the same accuracy as state-of-the-art software recognizers, while running at a fraction of the clock speed. Copyright 2007 ACM.
dc.source	Scopus
dc.subject	DSP
dc.subject	FPGA
dc.subject	In silico vox
dc.subject	Speech recognition
dc.type	Conference Paper
dc.contributor.department	OFFICE OF THE PROVOST
dc.contributor.department	DEPARTMENT OF COMPUTER SCIENCE
dc.description.doi	10.1145/1216919.1216928
dc.description.sourcetitle	ACM/SIGDA International Symposium on Field Programmable Gate Arrays - FPGA
dc.description.page	60-68
dc.published.state	published
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM