Tài liệu tham khảo |
Loại |
Chi tiết |
[1] D. O‘Shaughnessy, ―Invited paper: Automatic speech recognition: History, methods and challenges,‖ Pattern Recognit., vol. 41, no. 10, pp. 2965–2979, Oct. 2008 |
Sách, tạp chí |
Tiêu đề: |
Pattern Recognit |
|
[2] D. Yu and L. Deng, Automatic Speech Recognition. London: Springer London, 2015 |
Sách, tạp chí |
Tiêu đề: |
Automatic Speech Recognition |
|
[3] R. E. Gruhn, W. Minker, and S. Nakamura, Statistical Pronunciation Modeling for Non- Native Speech Processing. Berlin, Heidelberg: Springer Berlin Heidelberg, 2011 |
Sách, tạp chí |
Tiêu đề: |
Statistical Pronunciation Modeling for Non-Native Speech Processing |
|
[4] H. Bourlard, H. Hermansky, and N. Morgan, ―Towards increasing speech recognition error rates,‖ Speech Commun., vol. 18, no. 3, pp. 205–231, 1996 |
Sách, tạp chí |
|
[5] S. Narang and M. D. Gupta, ―Speech Feature Extraction Techniques: A Review,‖ Int. J. Comput. Sci. Mob. Comput., vol. 4, no. 3, pp. 107–114, 2015 |
Sách, tạp chí |
Tiêu đề: |
Int. J. "Comput. Sci. Mob. Comput |
|
[6] N. Desai, K. Dhameliya, and V. Desai, ―Feature extraction and classification techniques for speech recognition: A review,‖ Int. J. Emerg. Technol. Adv. Eng., vol. 3, no. 12, pp.367–371, 2013 |
Sách, tạp chí |
Tiêu đề: |
Int. J. Emerg. Technol. Adv. Eng |
|
[7] U. Shrawankar and V. M. Thakare, ―Techniques for feature extraction in speech recognition system: A comparative study,‖ ArXiv Prepr. ArXiv13051145, 2013 |
Sách, tạp chí |
Tiêu đề: |
ArXiv Prepr. ArXiv13051145 |
|
[8] R. Kaur and V. Singh, ―Time-frequency domain characterization of stationary and non stationary signals,‖ Int. J. Res. Appl. Sci. Eng. Technol., vol. 2, no. 5, pp. 438–447, 2014 |
Sách, tạp chí |
Tiêu đề: |
Int. J. Res. Appl. Sci. Eng. Technol |
|
[10] V. G. Skuk and S. R. Schweinberger, ―Influences of Fundamental Frequency, Formant Frequencies, Aperiodicity, and Spectrum Level on the Perception of Voice Gender,‖ J.Speech Lang. Hear. Res., vol. 57, no. 1, p. 285, Feb. 2014 |
Sách, tạp chí |
Tiêu đề: |
J. "Speech Lang. Hear. Res |
|
[11] H. Traunmüller and A. Eriksson, ―The frequency range of the voice fundamental in the speech of male and female adults,‖ Unpubl. Manuscr., 1995 |
Sách, tạp chí |
|
[12] P. Divenyi, S. Greenberg, and G. Meyer, Eds., Dynamics of speech production and perception. Amsterdam ; Washington, DC: Ios Press, 2006 |
Sách, tạp chí |
Tiêu đề: |
Dynamics of speech production and perception |
|
[13] Thi-Anh-Xuan TRAN, ―ACOUSTIC GESTURE MODELING. APPLICATION TO A VIETNAMESE SPEECH RECOGNITION SYSTEM,‖ Doctoral thesis, COMMUNITY UNIVERSITY GRENOBLE ALPES, 2016 |
Sách, tạp chí |
|
[14] B. Schuller, ―Voice and Speech Analysis in Search of States and Traits,‖ in Computer Analysis of Human Behavior, Springer, London, 2011, pp. 227–253 |
Sách, tạp chí |
Tiêu đề: |
Computer Analysis of Human Behavior |
|
[15] ―[Xuedong_Huang,_Alex_Acero,_Hsiao-Wuen_Hon]_Spoken(BookZZ.org).pdf.‖ . [16] H. Hermansky, B. Hanson, and H. Wakita, ―Perceptually based linear predictiveanalysis of speech,‖ in Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP’85., 1985, vol. 10, pp. 509–512 |
Sách, tạp chí |
Tiêu đề: |
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP’85 |
|
[17] C. K. On, P. M. Pandiyan, S. Yaacob, and A. Saudi, ―Mel-frequency cepstral coefficient analysis in speech recognition,‖ in Computing & Informatics, 2006. ICOCI’06.International Conference on, 2006, pp. 1–5 |
Sách, tạp chí |
Tiêu đề: |
Computing & Informatics, 2006. ICOCI’06. "International Conference on |
|
[18] C. J. Long and S. Datta, ―Wavelet based feature extraction for phoneme recognition,‖ in Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on, 1996, vol. 1, pp. 264–267 |
Sách, tạp chí |
Tiêu đề: |
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on |
|
[19] D. Seung and L. Lee, ―Algorithms for non-negative matrix factorization,‖ Adv. Neural Inf. Process. Syst., vol. 13, pp. 556–562, 2001 |
Sách, tạp chí |
Tiêu đề: |
Adv. Neural Inf. Process. Syst |
|
[20] F. Zheng, G. Zhang, and Z. Song, ―Comparison of different implementations of MFCC,‖ J. Comput. Sci. Technol., vol. 16, no. 6, pp. 582–589, 2001 |
Sách, tạp chí |
Tiêu đề: |
J. Comput. Sci. Technol |
|
[22] R. Vergin, D. O‘shaughnessy, and A. Farhat, ―Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition,‖IEEE Trans. Speech Audio Process., vol. 7, no. 5, pp. 525–532, 1999 |
Sách, tạp chí |
Tiêu đề: |
IEEE Trans. Speech Audio Process |
|
[9] ―Non-Stationary Nature of Speech Signal (Theory) : Speech Signal Processing Laboratory : Electronics & Communications : IIT GUWAHATI Virtual Lab.‖ [Online].Available: http://iitg.vlab.co.in/?sub=59&brch=164&sim=371&cnt=1104. [Accessed: 13- Mar-2018] |
Link |
|