Mô hình hóa đặc tính âm học động cho hệ thống nhận dạng tiếng nói việt bằng phần mềm kaldi và ứng dụng cho việc phân tích sự chuyển tiếp nguyên âm phụ âm =

Tài liệu tham khảo

Loại

Chi tiết

[1] D. O‘Shaughnessy, ―Invited paper: Automatic speech recognition: History, methods and challenges,‖ Pattern Recognit., vol. 41, no. 10, pp. 2965–2979, Oct. 2008

Sách, tạp chí

Tiêu đề:	Pattern Recognit

[2] D. Yu and L. Deng, Automatic Speech Recognition. London: Springer London, 2015

Sách, tạp chí

Tiêu đề:	Automatic Speech Recognition

[3] R. E. Gruhn, W. Minker, and S. Nakamura, Statistical Pronunciation Modeling for Non- Native Speech Processing. Berlin, Heidelberg: Springer Berlin Heidelberg, 2011

Sách, tạp chí

Tiêu đề:	Statistical Pronunciation Modeling for Non-Native Speech Processing

[4] H. Bourlard, H. Hermansky, and N. Morgan, ―Towards increasing speech recognition error rates,‖ Speech Commun., vol. 18, no. 3, pp. 205–231, 1996

Sách, tạp chí

Tiêu đề:	Speech Commun

[5] S. Narang and M. D. Gupta, ―Speech Feature Extraction Techniques: A Review,‖ Int. J. Comput. Sci. Mob. Comput., vol. 4, no. 3, pp. 107–114, 2015

Sách, tạp chí

Tiêu đề:	Int. J. "Comput. Sci. Mob. Comput

[6] N. Desai, K. Dhameliya, and V. Desai, ―Feature extraction and classification techniques for speech recognition: A review,‖ Int. J. Emerg. Technol. Adv. Eng., vol. 3, no. 12, pp.367–371, 2013

Sách, tạp chí

Tiêu đề:	Int. J. Emerg. Technol. Adv. Eng

[7] U. Shrawankar and V. M. Thakare, ―Techniques for feature extraction in speech recognition system: A comparative study,‖ ArXiv Prepr. ArXiv13051145, 2013

Sách, tạp chí

Tiêu đề:	ArXiv Prepr. ArXiv13051145

[8] R. Kaur and V. Singh, ―Time-frequency domain characterization of stationary and non stationary signals,‖ Int. J. Res. Appl. Sci. Eng. Technol., vol. 2, no. 5, pp. 438–447, 2014

Sách, tạp chí

Tiêu đề:	Int. J. Res. Appl. Sci. Eng. Technol

[10] V. G. Skuk and S. R. Schweinberger, ―Influences of Fundamental Frequency, Formant Frequencies, Aperiodicity, and Spectrum Level on the Perception of Voice Gender,‖ J.Speech Lang. Hear. Res., vol. 57, no. 1, p. 285, Feb. 2014

Sách, tạp chí

Tiêu đề:	J. "Speech Lang. Hear. Res

[11] H. Traunmüller and A. Eriksson, ―The frequency range of the voice fundamental in the speech of male and female adults,‖ Unpubl. Manuscr., 1995

Sách, tạp chí

Tiêu đề:	Unpubl. Manuscr

[12] P. Divenyi, S. Greenberg, and G. Meyer, Eds., Dynamics of speech production and perception. Amsterdam ; Washington, DC: Ios Press, 2006

Sách, tạp chí

Tiêu đề:	Dynamics of speech production and perception

[13] Thi-Anh-Xuan TRAN, ―ACOUSTIC GESTURE MODELING. APPLICATION TO A VIETNAMESE SPEECH RECOGNITION SYSTEM,‖ Doctoral thesis, COMMUNITY UNIVERSITY GRENOBLE ALPES, 2016

Sách, tạp chí

Tiêu đề:	Doctoral thesis

[14] B. Schuller, ―Voice and Speech Analysis in Search of States and Traits,‖ in Computer Analysis of Human Behavior, Springer, London, 2011, pp. 227–253

Sách, tạp chí

Tiêu đề:	Computer Analysis of Human Behavior

[15] ―[Xuedong_Huang,_Alex_Acero,_Hsiao-Wuen_Hon]_Spoken(BookZZ.org).pdf.‖ . [16] H. Hermansky, B. Hanson, and H. Wakita, ―Perceptually based linear predictiveanalysis of speech,‖ in Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP’85., 1985, vol. 10, pp. 509–512

Sách, tạp chí

Tiêu đề:	Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP’85

[17] C. K. On, P. M. Pandiyan, S. Yaacob, and A. Saudi, ―Mel-frequency cepstral coefficient analysis in speech recognition,‖ in Computing & Informatics, 2006. ICOCI’06.International Conference on, 2006, pp. 1–5

Sách, tạp chí

Tiêu đề:	Computing & Informatics, 2006. ICOCI’06. "International Conference on

[18] C. J. Long and S. Datta, ―Wavelet based feature extraction for phoneme recognition,‖ in Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on, 1996, vol. 1, pp. 264–267

Sách, tạp chí

Tiêu đề:	Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on

[19] D. Seung and L. Lee, ―Algorithms for non-negative matrix factorization,‖ Adv. Neural Inf. Process. Syst., vol. 13, pp. 556–562, 2001

Sách, tạp chí

Tiêu đề:	Adv. Neural Inf. Process. Syst

[20] F. Zheng, G. Zhang, and Z. Song, ―Comparison of different implementations of MFCC,‖ J. Comput. Sci. Technol., vol. 16, no. 6, pp. 582–589, 2001

Sách, tạp chí

Tiêu đề:	J. Comput. Sci. Technol

[22] R. Vergin, D. O‘shaughnessy, and A. Farhat, ―Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition,‖IEEE Trans. Speech Audio Process., vol. 7, no. 5, pp. 525–532, 1999

Sách, tạp chí

Tiêu đề:	IEEE Trans. Speech Audio Process

[9] ―Non-Stationary Nature of Speech Signal (Theory) : Speech Signal Processing Laboratory : Electronics & Communications : IIT GUWAHATI Virtual Lab.‖ [Online].Available: http://iitg.vlab.co.in/?sub=59&brch=164&sim=371&cnt=1104. [Accessed: 13- Mar-2018]

Link

Định dạng
Số trang	99
Dung lượng	5,77 MB