Please use this identifier to cite or link to this item:
https://doi.org/10.1145/1216919.1216928
DC Field | Value | |
---|---|---|
dc.title | A 1000-word vocabulary, speaker-independent, continuous live-mode speech recognizer implemented in a single FPGA | |
dc.contributor.author | Lin E.C. | |
dc.contributor.author | Yu K. | |
dc.contributor.author | Rutenbar R.A. | |
dc.contributor.author | Chen T. | |
dc.date.accessioned | 2018-08-21T05:07:13Z | |
dc.date.available | 2018-08-21T05:07:13Z | |
dc.date.issued | 2007 | |
dc.identifier.citation | Lin E.C., Yu K., Rutenbar R.A., Chen T. (2007). A 1000-word vocabulary, speaker-independent, continuous live-mode speech recognizer implemented in a single FPGA. ACM/SIGDA International Symposium on Field Programmable Gate Arrays - FPGA : 60-68. ScholarBank@NUS Repository. https://doi.org/10.1145/1216919.1216928 | |
dc.identifier.isbn | 1595936009 | |
dc.identifier.isbn | 9781595936004 | |
dc.identifier.uri | http://scholarbank.nus.edu.sg/handle/10635/146269 | |
dc.description.abstract | The Carnegie Mellon In Silico Vox project seeks to move best-quality speech recognition technology from its current software-only form into a range of efficient all-hardware implementations. The central thesis is that, like graphics chips, the application is simply too performance hungry, and too power sensitive, to stay as a large software application. As a first step in this direction, we describe the design and implementation of a fully functional speech-to-text recognizer on a single Xilinx XUP platform. The design recognizes a 1000 word vocabulary, is speaker-independent, recognizes continuous (connected) speech, and is a "live mode" engine, wherein recognition can start as soon as speech input appears. To the best of our knowledge, this is the most complex recognizer architecture ever fully committed to a hardware-only form. The implementation is extraordinarily small, and achieves the same accuracy as state-of-the-art software recognizers, while running at a fraction of the clock speed. Copyright 2007 ACM. | |
dc.source | Scopus | |
dc.subject | DSP | |
dc.subject | FPGA | |
dc.subject | In silico vox | |
dc.subject | Speech recognition | |
dc.type | Conference Paper | |
dc.contributor.department | OFFICE OF THE PROVOST | |
dc.contributor.department | DEPARTMENT OF COMPUTER SCIENCE | |
dc.description.doi | 10.1145/1216919.1216928 | |
dc.description.sourcetitle | ACM/SIGDA International Symposium on Field Programmable Gate Arrays - FPGA | |
dc.description.page | 60-68 | |
dc.published.state | published | |
Appears in Collections: | Staff Publications |
Show simple item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.