Tìm hiểu bài toán tạo câu mô tả cho ảnh thời trang dùng học sâu

Tài liệu tham khảo

Loại

Chi tiết

[1] K. Xu, J. Ba, R. Kiros, K. Cho, A. Courville, R. Salakhudinov, R. Zemel và Yoshua, “Show, attend and tell: Neural image caption generation with visual attention,” trong International Conference on Machine Learning, Lille, 2015

Sách, tạp chí

Tiêu đề:	Show, attend and tell: Neural image caption generation with visual attention

[3] L. B. Y. B. a. P. H. Yann LeCun, “Gradient-based learning applied to document recognition,” Proceedings of the IEEE, pp. 2278-2234, 1998

Sách, tạp chí

Tiêu đề:	Gradient-based learning applied to document recognition

[5] M. Z. Hossain, F. Sohel, M. F. Shiratuddin và H. Laga, “A Comprehensive Survey of Deep Learning for Image Captioning,” arXiv.org, pp. 2-3, 13 May 2018

Sách, tạp chí

Tiêu đề:	A Comprehensive Survey of Deep Learning for Image Captioning

[6] Cho, Kyunghyun, v. Merrienboer, Bart, Gulcehre, Caglar, Bougares, Fethi, Schwenk, Holger, Bengio và Yoshua, “Learning phrase representations using RNN encoder-decoder for statistical machine translation,” trong EMNLP, Doha, 2014

Sách, tạp chí

Tiêu đề:	Learning phrase representations using RNN encoder-decoder for statistical machine translation

[7] R. Socher, A. Karpathy, Q. V. Le, C. D. Manning và A. Y. Ng, “Grounded compositional semantics for finding and describing images with sentences,”Transactions of the Association for Computational Linguistics, pp. 207-218, 2014

Sách, tạp chí

Tiêu đề:	Grounded compositional semantics for finding and describing images with sentences

[8] A. Karpathy, A. Joulin và F. F. F. Li, “Deep fragment embeddings for bidirectional image sentence mapping,” trong Advances in neural information processing systems, Montreal, 2014

Sách, tạp chí

Tiêu đề:	Deep fragment embeddings for bidirectional image sentence mapping

[9] F. Rosenblatt, “The Perceptron: A Probabilistic Model For Information Storage And Organization In The Brain,” Psychological Review, p. 386–408, 1958

Sách, tạp chí

Tiêu đề:	The Perceptron: A Probabilistic Model For Information Storage And Organization In The Brain

[10] Zhang, Tong (2004). "Solving large scale linear prediction problems using stochastic gradient descent algorithms". Proceedings of the 21st InternationalConference on Machine Learning (ICML'04)

Sách, tạp chí

Tiêu đề:	Solving large scale linear prediction problems using stochastic gradient descent algorithms
Tác giả:	Zhang, Tong
Năm:	2004

[12] Sepp Hochreiter; Jürgen Schmidhuber (1997). "Long short-term memory". Neural Computation. 9 (8): 1735–1780. doi: 10.1162/neco.1997.9.8.1735. PMID 9377276

Sách, tạp chí

Tiêu đề:	Long short-term memory
Tác giả:	Sepp Hochreiter; Jürgen Schmidhuber
Năm:	1997

[14] K. &. R. S. &. W. T. &. Z. W.-j. Papineni, “BLEU: a Method for Automatic Evaluation of Machine Translation,” trong Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL), Philadelphia, 2002

Sách, tạp chí

Tiêu đề:	BLEU: a Method for Automatic Evaluation of Machine Translation

[15] C.-Y. Lin, “ROUGE: A Package for Automatic Evaluation of Summaries,” trong Proceedings of the ACL Workshop: Text Summarization Braches Out 2004, 2004

Sách, tạp chí

Tiêu đề:	ROUGE: A Package for Automatic Evaluation of Summaries

[16] C. L. Z. D. P. Ramakrishna Vedantam, “CIDEr: Consensus-based Image Description Evaluation,” 2014

Sách, tạp chí

Tiêu đề:	CIDEr: Consensus-based Image Description Evaluation

[17] Negar Rostamzadeh, Seyedarian Hosseini, Thomas Boquet, Wojciech Stokowiec, Ying Zhang, Christian Jauvin, Chris Pal,"Fashion-Gen: The Generative Fashion Dataset and Challenge" ArXiv e-prints, 2018

Sách, tạp chí

Tiêu đề:	Fashion-Gen: The Generative Fashion Dataset and Challenge

[4] Alex Sherstinsky, "Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) Network&#34

Khác

[11] CS231n Convolutional Neural Networks for Visual Recognition". cs231n.github.io. Retrieved 2018-12-13

Khác

[13] SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning Long Chen, Hanwang Zhang, Jun Xiao, Liqiang Nie, Jian Shao, Wei Liu, Tat-Seng Chua

Khác

Tiêu đề	Tìm Hiểu Bài Toán Tạo Câu Mô Tả Cho Ảnh Thời Trang Dùng Học Sâu
Tác giả	Vũ Nguyên Hưng
Người hướng dẫn	TS. Nguyễn Thiên Bảo
Trường học	Trường Đại Học Sư Phạm Kỹ Thuật Thành Phố Hồ Chí Minh
Chuyên ngành	Công Nghệ Thông Tin
Thể loại	Đồ Án Tốt Nghiệp
Năm xuất bản	2020
Thành phố	Tp. Hồ Chí Minh

Định dạng
Số trang	58
Dung lượng	4,42 MB

Tìm hiểu bài toán tạo câu mô tả cho ảnh thời trang dùng học sâu

Mạng nơ-ron tích chập (Convolutional neural network)

Mạng nơ-ron hồi quy (Recurrent neural network-RNN)