Tài liệu tham khảo |
Loại |
Chi tiết |
[1] K. Xu, J. Ba, R. Kiros, K. Cho, A. Courville, R. Salakhudinov, R. Zemel và Yoshua, “Show, attend and tell: Neural image caption generation with visual attention,” trong International Conference on Machine Learning, Lille, 2015 |
Sách, tạp chí |
Tiêu đề: |
Show, attend and tell: Neural image caption generation with visual attention |
[3] L. B. Y. B. a. P. H. Yann LeCun, “Gradient-based learning applied to document recognition,” Proceedings of the IEEE, pp. 2278-2234, 1998 |
Sách, tạp chí |
Tiêu đề: |
Gradient-based learning applied to document recognition |
[5] M. Z. Hossain, F. Sohel, M. F. Shiratuddin và H. Laga, “A Comprehensive Survey of Deep Learning for Image Captioning,” arXiv.org, pp. 2-3, 13 May 2018 |
Sách, tạp chí |
Tiêu đề: |
A Comprehensive Survey of Deep Learning for Image Captioning |
[6] Cho, Kyunghyun, v. Merrienboer, Bart, Gulcehre, Caglar, Bougares, Fethi, Schwenk, Holger, Bengio và Yoshua, “Learning phrase representations using RNN encoder-decoder for statistical machine translation,” trong EMNLP, Doha, 2014 |
Sách, tạp chí |
Tiêu đề: |
Learning phrase representations using RNN encoder-decoder for statistical machine translation |
[7] R. Socher, A. Karpathy, Q. V. Le, C. D. Manning và A. Y. Ng, “Grounded compositional semantics for finding and describing images with sentences,”Transactions of the Association for Computational Linguistics, pp. 207-218, 2014 |
Sách, tạp chí |
Tiêu đề: |
Grounded compositional semantics for finding and describing images with sentences |
[8] A. Karpathy, A. Joulin và F. F. F. Li, “Deep fragment embeddings for bidirectional image sentence mapping,” trong Advances in neural information processing systems, Montreal, 2014 |
Sách, tạp chí |
Tiêu đề: |
Deep fragment embeddings for bidirectional image sentence mapping |
[9] F. Rosenblatt, “The Perceptron: A Probabilistic Model For Information Storage And Organization In The Brain,” Psychological Review, p. 386–408, 1958 |
Sách, tạp chí |
Tiêu đề: |
The Perceptron: A Probabilistic Model For Information Storage And Organization In The Brain |
[10] Zhang, Tong (2004). "Solving large scale linear prediction problems using stochastic gradient descent algorithms". Proceedings of the 21st InternationalConference on Machine Learning (ICML'04) |
Sách, tạp chí |
Tiêu đề: |
Solving large scale linear prediction problems using stochastic gradient descent algorithms |
Tác giả: |
Zhang, Tong |
Năm: |
2004 |
[12] Sepp Hochreiter; Jürgen Schmidhuber (1997). "Long short-term memory". Neural Computation. 9 (8): 1735–1780. doi: 10.1162/neco.1997.9.8.1735. PMID 9377276 |
Sách, tạp chí |
Tiêu đề: |
Long short-term memory |
Tác giả: |
Sepp Hochreiter; Jürgen Schmidhuber |
Năm: |
1997 |
[14] K. &. R. S. &. W. T. &. Z. W.-j. Papineni, “BLEU: a Method for Automatic Evaluation of Machine Translation,” trong Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL), Philadelphia, 2002 |
Sách, tạp chí |
Tiêu đề: |
BLEU: a Method for Automatic Evaluation of Machine Translation |
[15] C.-Y. Lin, “ROUGE: A Package for Automatic Evaluation of Summaries,” trong Proceedings of the ACL Workshop: Text Summarization Braches Out 2004, 2004 |
Sách, tạp chí |
Tiêu đề: |
ROUGE: A Package for Automatic Evaluation of Summaries |
[16] C. L. Z. D. P. Ramakrishna Vedantam, “CIDEr: Consensus-based Image Description Evaluation,” 2014 |
Sách, tạp chí |
Tiêu đề: |
CIDEr: Consensus-based Image Description Evaluation |
[17] Negar Rostamzadeh, Seyedarian Hosseini, Thomas Boquet, Wojciech Stokowiec, Ying Zhang, Christian Jauvin, Chris Pal,"Fashion-Gen: The Generative Fashion Dataset and Challenge" ArXiv e-prints, 2018 |
Sách, tạp chí |
Tiêu đề: |
Fashion-Gen: The Generative Fashion Dataset and Challenge |
[4] Alex Sherstinsky, "Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) Network" |
Khác |
[11] CS231n Convolutional Neural Networks for Visual Recognition". cs231n.github.io. Retrieved 2018-12-13 |
Khác |
[13] SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning Long Chen, Hanwang Zhang, Jun Xiao, Liqiang Nie, Jian Shao, Wei Liu, Tat-Seng Chua |
Khác |