Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/99324
Title: | Large vocabulary Mandarin Final recognition based on Two-Level Time-Delay Neural Networks (TLTDNN) | Authors: | Poo, G.-S. | Keywords: | Mandarin finals Speech recognition Time-Delay Neural Network (TDNN) |
Issue Date: | Jul-1997 | Citation: | Poo, G.-S. (1997-07). Large vocabulary Mandarin Final recognition based on Two-Level Time-Delay Neural Networks (TLTDNN). Speech Communication 22 (1) : 17-24. ScholarBank@NUS Repository. | Abstract: | A Two-Level Time-Delay Neural Network (TLTDNN) technique has been developed to recognize all Mandarin Finals of the entire Chinese syllables. The first level discriminates the vowel-group based on (a, e, i, o, u, v) and the nasal-group based on nasal ending (-n, -ng, -others). The nasal-group discriminator is used to further split the large /a/ subgroup produced by the vowel-group discriminator. The two groupings in the first level produce 8 subgroups in the second level. Further discrimination in the second level enables the identification of all 35 Mandarin Finals. The technique was thoroughly tested using 8 sets of 1265 isolated Hanyu Pinyin syllables, with 6 sets used for training and 2 sets used for testing. The overall result shows that a high recognition rate of 99.4% on the training datasets and 95.6% on the test datasets, is achievable. The top 4 recognition rate attained on the test datasets is as high as 99.1%. © 1997 Elsevier Science B.V. | Source Title: | Speech Communication | URI: | http://scholarbank.nus.edu.sg/handle/10635/99324 | ISSN: | 01676393 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.