Tài liệu tham khảo |
Loại |
Chi tiết |
[3] William Chan, Navdeep Jaitly, Quoc Le, and Oriol Vinyals.Listen, attend and spell: A neural network for large vocabu- lary conversational speech recognition. In Acoustics, Speech and Signal Processing (ICASSP), 2016 IEEE International Conference on, pages 4960–4964. IEEE, 2016 |
Sách, tạp chí |
Tiêu đề: |
Listen, attend and spell: A neural network for large vocabulary conversational speech recognition |
Tác giả: |
William Chan, Navdeep Jaitly, Quoc Le, Oriol Vinyals |
Nhà XB: |
IEEE |
Năm: |
2016 |
|
[4] Rohit Prabhavalkar, Kanishka Rao, Tara N. Sainath, Bo Li, Leif Johnson, and Navdeep Jaitly. A Comparison of sequence-to-sequence models for speech recognition. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, vol- ume 2017-Augus, pages 939–943, 2017 |
Sách, tạp chí |
Tiêu đề: |
A Comparison of sequence-to-sequence models for speech recognition |
Tác giả: |
Rohit Prabhavalkar, Kanishka Rao, Tara N. Sainath, Bo Li, Leif Johnson, Navdeep Jaitly |
Nhà XB: |
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
Năm: |
2017 |
|
[7] Alex Graves, Santiago Fernández, Faustino Gomez, and J¨ urgen Schmidhuber. Connectionist temporal classifica- tion. Proceedings of the 23rd international conference on Machine learning - ICML ’06, pages 369–376, 2006 |
Sách, tạp chí |
Tiêu đề: |
Connectionist temporal classification |
Tác giả: |
Alex Graves, Santiago Fernández, Faustino Gomez, Jürgen Schmidhuber |
Nhà XB: |
Proceedings of the 23rd international conference on Machine learning - ICML '06 |
Năm: |
2006 |
|
[8] Theodore Bluche, Hermann Ney, Jerome Louradour, and Christopher Kermorvant. Framewise and ctc training of neural networks for handwriting recognition. In Proceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR), ICDAR ’15, pages 81– |
Sách, tạp chí |
Tiêu đề: |
Framewise and ctc training of neural networks for handwriting recognition |
Tác giả: |
Theodore Bluche, Hermann Ney, Jerome Louradour, Christopher Kermorvant |
Nhà XB: |
Proceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR) |
Năm: |
2015 |
|
[9] Awni Y. Hannun, Carl Case, Jared Casper, Bryan Catan- zaro, Greg Diamos, Erich Elsen, Ryan Prenger, Sanjeev Satheesh, Shubho Sengupta, Adam Coates, and Andrew Y.Ng. Deep speech: Scaling up end-to-end speech recognition.CoRR, abs/1412.5567, 2014 |
Sách, tạp chí |
Tiêu đề: |
Deep speech: Scaling up end-to-end speech recognition |
Tác giả: |
Awni Y. Hannun, Carl Case, Jared Casper, Bryan Catan-zaro, Greg Diamos, Erich Elsen, Ryan Prenger, Sanjeev Satheesh, Shubho Sengupta, Adam Coates, Andrew Y.Ng |
Nhà XB: |
CoRR |
Năm: |
2014 |
|
[10] Jan Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, KyungHyun Cho, and Yoshua Bengio. Attention-based models for speech recognition. CoRR, abs/1506.07503, 2015 |
Sách, tạp chí |
Tiêu đề: |
Attention-based models for speech recognition |
Tác giả: |
Jan Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, KyungHyun Cho, Yoshua Bengio |
Nhà XB: |
CoRR |
Năm: |
2015 |
|
[13] Eric Battenberg, Jitong Chen, Rewon Child, Adam Coates, Yashesh Gaur, Yi Li, Hairong Liu, Sanjeev Satheesh, David Seetapun, Anuroop Sriram, and Zhenyao Zhu. Explor- ing neural transducers for end-to-end speech recognition.CoRR, abs/1707.07413, 2017 |
Sách, tạp chí |
Tiêu đề: |
Exploring neural transducers for end-to-end speech recognition |
Tác giả: |
Eric Battenberg, Jitong Chen, Rewon Child, Adam Coates, Yashesh Gaur, Yi Li, Hairong Liu, Sanjeev Satheesh, David Seetapun, Anuroop Sriram, Zhenyao Zhu |
Nhà XB: |
CoRR |
Năm: |
2017 |
|
[14] Daniel Povey, Vijayaditya Peddinti, Daniel Galvez, Pegah Ghahremani, Vimal Manohar, Xingyu Na, Yiming Wang, and Sanjeev Khudanpur. Purely sequence-trained neural networks for asr based on lattice-free mmi. In Interspeech, pages 2751–2755, 2016 |
Sách, tạp chí |
Tiêu đề: |
Purely sequence-trained neural networks for asr based on lattice-free mmi |
Tác giả: |
Daniel Povey, Vijayaditya Peddinti, Daniel Galvez, Pegah Ghahremani, Vimal Manohar, Xingyu Na, Yiming Wang, Sanjeev Khudanpur |
Nhà XB: |
Interspeech |
Năm: |
2016 |
|
[15] Abdel-rahman Mohamed, George Dahl, and Geoffrey Hin- ton. Deep belief networks for phone recognition. In Nips workshop on deep learning for speech recognition and related applications, volume 1, page 39. Vancouver, Canada, 2009 |
Sách, tạp chí |
Tiêu đề: |
Deep belief networks for phone recognition |
Tác giả: |
Abdel-rahman Mohamed, George Dahl, Geoffrey Hinton |
Nhà XB: |
Nips workshop on deep learning for speech recognition and related applications |
Năm: |
2009 |
|
[16] Yann LeCun, Léon Bottou, Genevieve B. Orr, and Klaus- Robert M¨ uller. Efficient backprop. In Neural Networks:Tricks of the Trade - Second Edition, pages 9–48. 2012 |
Sách, tạp chí |
Tiêu đề: |
Neural Networks: Tricks of the Trade - Second Edition |
Tác giả: |
Yann LeCun, Léon Bottou, Genevieve B. Orr, Klaus-Robert Müller |
Năm: |
2012 |
|
[20] Hugo Van Hamme and Filip Van Aelten. An adaptive-beam pruning technique for continuous speech recognition. In Fourth International Conference on Spoken Language Pro- cessing, 1996 |
Sách, tạp chí |
Tiêu đề: |
An adaptive-beam pruning technique for continuous speech recognition |
Tác giả: |
Hugo Van Hamme, Filip Van Aelten |
Nhà XB: |
Fourth International Conference on Spoken Language Processing |
Năm: |
1996 |
|
[22] B H Tran y H Ney V. Steinbiss. Improvements in Beam Search. Proc. of the International Conference on Spo- ken Language Processing (ICSLP), (July 2014):2140–2143, 1994 |
Sách, tạp chí |
Tiêu đề: |
Improvements in Beam Search |
Tác giả: |
B H Tran, H Ney V. Steinbiss |
Nhà XB: |
Proc. of the International Conference on Spoken Language Processing (ICSLP) |
Năm: |
1994 |
|
[23] Stefan Ortmanns, Hermann Ney, and Andreas Eiden.Language-model look-ahead for large vocabulary speech |
Sách, tạp chí |
Tiêu đề: |
Language-model look-ahead for large vocabulary speech |
Tác giả: |
Stefan Ortmanns, Hermann Ney, Andreas Eiden |
|
[24] Vassil Panayotov, Guoguo Chen, Daniel Povey, and Sanjeev Khudanpur. Librispeech: An ASR corpus based on public domain audio books, 2015 |
Sách, tạp chí |
Tiêu đề: |
Librispeech: An ASR corpus based on public domain audio books |
Tác giả: |
Vassil Panayotov, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur |
Năm: |
2015 |
|
[25] Kenneth Heafield. Kenlm: Faster and smaller language model queries. In Proceedings of the Sixth Workshop on Statistical Machine Translation, WMT ’11, pages 187–197, Stroudsburg, PA, USA, 2011. Association for Computa- tional Linguistics |
Sách, tạp chí |
Tiêu đề: |
Kenlm: Faster and smaller language model queries |
Tác giả: |
Kenneth Heafield |
Nhà XB: |
Association for Computational Linguistics |
Năm: |
2011 |
|
[27] W Nelson Francis and Henry Kucera. The Brown Corpus: A Standard Corpus of Present-Day Edited American English, 1979 |
Sách, tạp chí |
Tiêu đề: |
The Brown Corpus: A Standard Corpus of Present-Day Edited American English |
Tác giả: |
W Nelson Francis, Henry Kucera |
Năm: |
1979 |
|
[5] Hiroaki Sakoe and Seibi Chiba. Readings in speech recogni- tion. chapter Dynamic Programming Algorithm Optimiza- tion for Spoken Word Recognition, pages 159–165. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1990 |
Khác |
|
[6] Xuedong Huang, James Baker, and Raj Reddy. A Histor- ical Perspective of Speech Recognition. Commun. ACM, 57(1):94–103, 2014 |
Khác |
|
[11] Albert Zeyer, Kazuki Irie, Ralf Schl¨ uter, and Hermann Ney.Improved training of end-to-end attention models for speech recognition. CoRR, abs/1805.03294, 2018 |
Khác |
|
[12] Alex Graves, Abdel-rahman Mohamed, and Geoffrey E.Hinton. Speech recognition with deep recurrent neural net- works. CoRR, abs/1303.5778, 2013 |
Khác |
|