Tài liệu tham khảo |
Loại |
Chi tiết |
[1] Jurafsky, Daniel and Martin, James H. Speech and Language Processing - 2nd Edition. Prentice Hall, ISBN-13: 978-0131873216, ISBN-10: 0131873210, 2008 |
Sách, tạp chí |
Tiêu đề: |
Speech and Language Processing - 2nd Edition |
|
[2] Ambra, N. and Catia, C. and Wilhelmus, S. "Automatic Speech Recognition for second language learning: How and why it actually works." International Congress of Phonetic Sciences (ICPhS).Barcelona, 2003 |
Sách, tạp chí |
Tiêu đề: |
Automatic Speech Recognition for second language learning: How and why it actually works |
|
[4] Đức, Đặng Ngọc. Mạng nơron và mô hình Markov ẩn trong nhận dạng tiếng Việt. Hà Nội: Luấn án tiến sỹ, Trường ĐH Khoa học tự nhiên – ĐH Quốc gia hà Nội, 2003 |
Sách, tạp chí |
Tiêu đề: |
Mạng nơron và mô hình Markov ẩn trong nhận dạng tiếng Việt |
|
[5] Lei, Xin. Modeling Lexical Tones for Mandarin Large Vocabulary Continuous Speech Recognition. USA: University of Washington, 2006 |
Sách, tạp chí |
Tiêu đề: |
Modeling Lexical Tones for Mandarin Large Vocabulary Continuous Speech Recognition |
|
[6] Muda, Lindasalwa and Begam, Mumtaj and Elamvazuthi, I. "Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques."journal of computing, V.2, No.2, ISSN 2151-9617, 2010 |
Sách, tạp chí |
Tiêu đề: |
Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques |
|
[7] Florian, Honig and Georg, Stemmer and Christian, Hacker and Fabio, Brugnara. "Revising Perceptual Linear Prediction (PLP)." INTERSPEECH. Lisbon, Portugal, 2005 |
Sách, tạp chí |
Tiêu đề: |
Revising Perceptual Linear Prediction (PLP) |
|
[8] Haeb-Umbach, R. and Ney, H. "Linear discriminant analysis for improved large vocabulary continuous speech recognition." Acoustics, Speech, and Signal Processing (ICASSP). California, USA, 1992. 13-16 |
Sách, tạp chí |
Tiêu đề: |
Linear discriminant analysis for improved large vocabulary continuous speech recognition |
|
[9] Sakai, M.,Denso Corp. "Generalization of Linear Discriminant Analysis used in Segmental Unit Input HMM for Speech Recognition." Acoustics, Speech and Signal Processing (ICASSP). Honolulu, 2007. IV-333 - IV-336 |
Sách, tạp chí |
Tiêu đề: |
Generalization of Linear Discriminant Analysis used in Segmental Unit Input HMM for Speech Recognition |
|
[10] Psutka, Josef V. "Benefit of Maximum Likelihood Linear Transform (MLLT) Used at Different Levels of Covariance Matrices Clustering in ASR Systems." Text, Speech and Dialogue, 10th International Conference (TSD). Czech Republic, 2007 |
Sách, tạp chí |
Tiêu đề: |
Benefit of Maximum Likelihood Linear Transform (MLLT) Used at Different Levels of Covariance Matrices Clustering in ASR Systems |
|
[11] Anastasakos, T. and McDonough, J. and Makhoul, J. "Speaker adaptive training: a maximum likelihood approach to speaker normalization." Acoustics, Speech and Signal Processing (ICASSP).Munich, 1997. 1043 – 1046 |
Sách, tạp chí |
Tiêu đề: |
Speaker adaptive training: a maximum likelihood approach to speaker normalization |
|
[12] Martin, Karafiat and Lukas, Burget and Pavel, Matejka and Ondrej, Glembek. "iVector-Based Discriminative Adaptation for Automatic Speech Recognition." Automatic Speech Recognition and Understanding (ASRU). Waikoloa: IEEE, 2011. 152-157 |
Sách, tạp chí |
Tiêu đề: |
iVector-Based Discriminative Adaptation for Automatic Speech Recognition |
|
[13] F. Metze, Z. A. W. Sheikh, A. Waibel, J. Gehring, K. Kilgour, Q. B. Nguyen, and V. H. Nguyen, “Models of tone for tonal and non-tonal languages,” in 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, Dec 2013, pp. 261–266 |
Sách, tạp chí |
Tiêu đề: |
Models of tone for tonal and non-tonal languages,” in "2013 IEEE Workshop on Automatic Speech "Recognition and Understanding |
|
[14] Tuerxun, M. and Zhang, Shiliang and Bao, Yebo and Dai, Lirong. "Improvements on bottleneck feature for large vocabulary continuous speech recognition." Signal Processing (ICSP). Hangzhou, 2014. 516 – 520 |
Sách, tạp chí |
Tiêu đề: |
Improvements on bottleneck feature for large vocabulary continuous speech recognition |
|
[15] Ravanelli, M. and Do, Van Hai and Janin, A. "TANDEM-bottleneck feature combination using hierarchical Deep Neural Networks." Chinese Spoken Language Processing (ISCSLP). Singapore, 2014. 113 – 117 |
Sách, tạp chí |
Tiêu đề: |
TANDEM-bottleneck feature combination using hierarchical Deep Neural Networks |
|
[16] Kevin, K. and Heck, M. and Muller, Markus and Sperber, Matthias and Stuker, Sebastian and Waibe, Alex. "The 2014 KIT IWSLT Speech-to-Text Systems for English, German and Italian." The International Workshop on Spoken Language Translation (IWSLT). Lake Tahoe, USA, 2014 |
Sách, tạp chí |
Tiêu đề: |
The 2014 KIT IWSLT Speech-to-Text Systems for English, German and Italian |
|
[17] Shen, Peng and Lu, Xugang and Hu, Xinhui and Kanda, Naoyuki and Saiko, Masahiro and Hori, Chiori. "The NICT ASR System for IWSLT 2014." The International Workshop on Spoken Language Translation (IWSLT). Lake Tahoe, USA, 2014 |
Sách, tạp chí |
Tiêu đề: |
The NICT ASR System for IWSLT 2014 |
|
[18] Ochiai, T. and Matsuda, S. and Lu, Xugang and Hori, C. and Katagiri, S. "Speaker Adaptive Training using Deep Neural Networks." Acoustics, Speech and Signal Processing (ICASSP).Florence, 2014. 6349 – 6353 |
Sách, tạp chí |
Tiêu đề: |
Speaker Adaptive Training using Deep Neural Networks |
|
[19] Daniel, Povey and Arnab, Ghoshal and Gilles, Boulianne and Lukas, Burget and Ondrej, Glembek and Nagendra, Goel and Mirko, Hannemann and Petr, Motlicek and Yanmin, Qian and Petr, Schwarz and Jan, Silovsky and Georg, Stemmer and Karel, Vesely. "The Kaldi Speech Recognition Toolkit."Automatic Speech Recognition and Understanding. Hawaii, US, 2011 |
Sách, tạp chí |
Tiêu đề: |
The Kaldi Speech Recognition Toolkit |
|
[20] Tokuda, K. and Masuko, Takashi and Miyazaki, Noboru and Kobayashi, Takao. "Hidden Markov models based on multi-space probability distribution for pitch pattern modeling." Acoustics, Speech, and Signal Processing (ICASSP). Phoenix, USA, 1999. 229-232 |
Sách, tạp chí |
Tiêu đề: |
Hidden Markov models based on multi-space probability distribution for pitch pattern modeling |
|
[21] Yu, Kai and Young, S. "Continuous F0 Modeling for HMM Based Statistical Parametric Speech Synthesis." Audio, Speech, and Language Processing, IEEE, V. 19, Issue 5, ISSN:1558-7916 [IEEE], 2010: 1071 – 1079 |
Sách, tạp chí |
Tiêu đề: |
Continuous F0 Modeling for HMM Based Statistical Parametric Speech Synthesis |
|