Tài liệu tham khảo |
Loại |
Chi tiết |
[2] L. R. Rabiner and R. W. Schafer, “Introduction to digital speech processing,” Found. Trends Signal Process., vol. 1, no. 1, pp. 1–194, Jan. 2007, doi:10.1561/2000000001 |
Sách, tạp chí |
Tiêu đề: |
Introduction to digital speech processing,” "Found. Trends Signal Process |
|
[4] M. S. Hawley et al., “A Voice-Input Voice-Output Communication Aid for People With Severe Speech Impairment,” IEEE Trans. Neural Syst. Rehabil.Eng., vol. 21, no. 1, pp. 23–31, Jan. 2013, doi: 10.1109/TNSRE.2012.2209678 |
Sách, tạp chí |
Tiêu đề: |
et al.", “A Voice-Input Voice-Output Communication Aid for People With Severe Speech Impairment,” "IEEE Trans. Neural Syst. Rehabil. "Eng |
|
[5] M. Hoy, “Alexa, Siri, Cortana, and More: An Introduction to Voice Assistants,” Med. Ref. Serv. Q., vol. 37, pp. 81–88, Jan. 2018, doi |
Sách, tạp chí |
Tiêu đề: |
Alexa, Siri, Cortana, and More: An Introduction to Voice Assistants,” "Med. Ref. Serv. Q |
|
[6] M. Schrửder, M. Charfuelan, S. Pammi, and I. Steiner, “Open source voice creation toolkit for the MARY TTS Platform,” p. 5 |
Sách, tạp chí |
Tiêu đề: |
Open source voice creation toolkit for the MARY TTS Platform |
|
[7] T. T. T. Nguyen, “HMM-based Vietnamese Text-To-Speech : Prosodic Phrasing Modeling, Corpus Design System Design, and Evaluation,”phdthesis, Université Paris Sud - Paris XI, 2015. Accessed: Apr. 18, 2021.[Online]. Available: https://tel.archives-ouvertes.fr/tel-01260884 |
Sách, tạp chí |
Tiêu đề: |
HMM-based Vietnamese Text-To-Speech : Prosodic Phrasing Modeling, Corpus Design System Design, and Evaluation |
|
[8] H. Zen, A. Senior, and M. Schuster, “STATISTICAL PARAMETRIC SPEECH SYNTHESIS USING DEEP NEURAL NETWORKS,” p. 5 |
Sách, tạp chí |
Tiêu đề: |
STATISTICAL PARAMETRIC SPEECH SYNTHESIS USING DEEP NEURAL NETWORKS |
|
[9] Z. Wu, O. Watts, and S. King, “Merlin: An Open Source Neural Network Speech Synthesis System,” Sep. 2016, pp. 202–207. doi: 10.21437/SSW.2016- 33 |
Sách, tạp chí |
Tiêu đề: |
Merlin: An Open Source Neural Network Speech Synthesis System |
|
[10] Y. Wang et al., “Tacotron: Towards End-to-End Speech Synthesis,” ArXiv170310135 Cs, Apr. 2017, Accessed: Apr. 24, 2021. [Online]. Available:http://arxiv.org/abs/1703.10135 |
Sách, tạp chí |
Tiêu đề: |
et al.", “Tacotron: Towards End-to-End Speech Synthesis,” "ArXiv170310135 Cs |
|
[11] R. Skerry-Ryan et al., “Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron,” p. 10 |
Sách, tạp chí |
Tiêu đề: |
et al.", “Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron |
|
[12] J. Shen et al., “Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions,” ArXiv171205884 Cs, Feb. 2018, Accessed: Apr. 22, 2021. [Online]. Available: http://arxiv.org/abs/1712.05884 |
Sách, tạp chí |
Tiêu đề: |
et al.", “Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions,” "ArXiv171205884 Cs |
|
[13] A. van den Oord et al., “WaveNet: A Generative Model for Raw Audio,” ArXiv160903499 Cs, Sep. 2016, Accessed: Apr. 24, 2021. [Online]. Available:http://arxiv.org/abs/1609.03499 |
Sách, tạp chí |
Tiêu đề: |
et al.", “WaveNet: A Generative Model for Raw Audio,” "ArXiv160903499 Cs |
|
[14] Fu-Chiang Chou, Chiu-Yu Tseng, and Lin-Shan Lee, “Automatic generation of prosodic structure for high quality Mandarin speech synthesis,”in Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP ’96, Philadelphia, PA, USA, 1996, vol. 3, pp. 1624–1627.doi: 10.1109/ICSLP.1996.607935 |
Sách, tạp chí |
Tiêu đề: |
Automatic generation of prosodic structure for high quality Mandarin speech synthesis,” in "Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP ’96 |
|
[15] J. Tao, H. Dong, and S. Zhao, “Rule learning based Chinese prosodic phrase prediction,” in International Conference on Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003, Oct. 2003, pp. 425–432.doi: 10.1109/NLPKE.2003.1275944 |
Sách, tạp chí |
Tiêu đề: |
Rule learning based Chinese prosodic phrase prediction,” in "International Conference on Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 |
|
[17] J. Apel, F. Neubarth, H. Pirker, and H. Trost, “Have a break! Modelling pauses in German Speech.,” p. 8 |
Sách, tạp chí |
Tiêu đề: |
Have a break! Modelling pauses in German Speech |
|
[18] P. Chistikov and O. Khomitsevich, “Improving Prosodic Break Detection in a Russian TTS System,” in Speech and Computer, Cham, 2013, pp. 181– |
Sách, tạp chí |
Tiêu đề: |
Improving Prosodic Break Detection in a Russian TTS System,” in "Speech and Computer |
|
[19] P. Sarkar and K. Rao, Data-Driven Pause Prediction for Speech Synthesis in Storytelling Style Speech. 2015. doi: 10.13140/RG.2.1.2079.3042 |
Sách, tạp chí |
Tiêu đề: |
Data-Driven Pause Prediction for Speech Synthesis in Storytelling Style Speech |
|
[20] T. T. T. Nguyen, A. Rilliard, and D. D. Tran, “Prosodic Phrasing Modeling for Vietnamese TTS Using Syntactic Information,” p. 5 |
Sách, tạp chí |
Tiêu đề: |
Prosodic Phrasing Modeling for Vietnamese TTS Using Syntactic Information |
|
[22] M. Nespor and I. Vogel, “Prosodic Structure Above the Word,” in Prosody: Models and Measurements, A. Cutler and D. R. Ladd, Eds. Berlin, Heidelberg:Springer, 1983, pp. 123–140. doi: 10.1007/978-3-642-69103-4_10 |
Sách, tạp chí |
Tiêu đề: |
Prosodic Structure Above the Word,” in "Prosody: "Models and Measurements |
|
[23] H. Zen, T. Toda, M. Nakamura, and K. Tokuda, “Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005,”IEICE Trans., vol. 90-D, pp. 325–333, Jan. 2007, doi: 10.1093/ietisy/e90- 1.1.325 |
Sách, tạp chí |
Tiêu đề: |
Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005,” "IEICE Trans |
|
35–39. Accessed: Apr. 24, 2021. [Online]. Available: https://www.aclweb.org/anthology/2020.vlsp-1.7 |
Link |
|