Áp dụng mạng slowfast cho bài toán nhận dạng hành động trong video

Tài liệu tham khảo

Loại

Chi tiết

[1] David H Hubel and Torsten N Wiesel (1965). “Receptive fields and functional ar- chitecture in two nonstriate visual areas (18 and 19) of the cat”. In: Journal of neurophysiology 28.2, pp. 229–289

Sách, tạp chí

Tiêu đề:	Receptive fields and functional ar-chitecture in two nonstriate visual areas (18 and 19) of the cat”. In: "Journal ofneurophysiology
Tác giả:	David H Hubel and Torsten N Wiesel
Năm:	1965

[2] AM Derrington and P Lennie (1984). “Spatial and temporal contrast sensitivities of neurones in lateral geniculate nucleus of macaque.” In: The Journal of physiology 357.1, pp. 219–240

Sách, tạp chí

Tiêu đề:	Spatial and temporal contrast sensitivities ofneurones in lateral geniculate nucleus of macaque.” In: "The Journal of physiology
Tác giả:	AM Derrington and P Lennie
Năm:	1984

[3] Margaret Livingstone and David Hubel (1988). “Segregation of form, color, move- ment, and depth: anatomy, physiology, and perception”. In: Science 240.4853, pp. 740–749

Sách, tạp chí

Tiêu đề:	Segregation of form, color, move-ment, and depth: anatomy, physiology, and perception”. In:"Science
Tác giả:	Margaret Livingstone and David Hubel
Năm:	1988

[4] Yair Weiss, Eero P Simoncelli, and Edward H Adelson (2002). “Motion illusions as optimal percepts”. In: Nature neuroscience 5.6, pp. 598–604

Sách, tạp chí

Tiêu đề:	Motion illusionsas optimal percepts”. In:"Nature neuroscience
Tác giả:	Yair Weiss, Eero P Simoncelli, and Edward H Adelson
Năm:	2002

[5] Piotr Dollár et al. (2005). “Behavior recognition via sparse spatio-temporal features”. In: 2005 IEEE international workshop on visual surveillance and perfor- mance evaluation of tracking and surveillance. IEEE, pp. 65–72

Sách, tạp chí

Tiêu đề:	Behavior recognition via sparse spatio-temporal fea-tures”. In: "2005 IEEE international workshop on visual surveillance and perfor-mance evaluation of tracking and surveillance
Tác giả:	Piotr Dollár et al
Năm:	2005

[6] Alexander Klaser, Marcin Marszałek, and Cordelia Schmid (2008). “A spatio- temporal descriptor based on 3d-gradients”. In: BMVC 2008-19th British Machine Vision Conference. British Machine Vision Association, pp. 275–1

Sách, tạp chí

Tiêu đề:	A spatio-temporal descriptor based on 3d-gradients”. In: "BMVC 2008-19th British MachineVision Conference
Tác giả:	Alexander Klaser, Marcin Marszałek, and Cordelia Schmid
Năm:	2008

[7] Ivan Laptev et al. (2008). “Learning realistic human actions from movies”. In: 2008 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, pp. 1–8

Sách, tạp chí

Tiêu đề:	Learning realistic human actions from movies”. In:"2008IEEE Conference on Computer Vision and Pattern Recognition
Tác giả:	Ivan Laptev et al
Năm:	2008

[8] Jia Deng et al. (2009). “Imagenet: A large-scale hierarchical image database”. In:2009 IEEE conference on computer vision and pattern recognition. Ieee, pp. 248–255

Sách, tạp chí

Tiêu đề:	Imagenet: A large-scale hierarchical image database”. In:"2009 IEEE conference on computer vision and pattern recognition
Tác giả:	Jia Deng et al
Năm:	2009

[9] Graham W Taylor et al. (2010). “Convolutional learning of spatio-temporal features”. In: European conference on computer vision. Springer, pp. 140–153

Sách, tạp chí

Tiêu đề:	Convolutional learning of spatio-temporal fea-tures”. In:"European conference on computer vision
Tác giả:	Graham W Taylor et al
Năm:	2010

[10] Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton (2012). “Imagenet clas- sification with deep convolutional neural networks”. In: Advances in neural information processing systems 25

Sách, tạp chí

Tiêu đề:	Imagenet clas-sification with deep convolutional neural networks”. In: "Advances in neural infor-mation processing systems
Tác giả:	Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton
Năm:	2012

[11] Karen Simonyan and Andrew Zisserman (2014). “Two-stream convolutional networks for action recognition in videos”. In: Advances in neural information processing systems 27

Sách, tạp chí

Tiêu đề:	Two-stream convolutional net-works for action recognition in videos”. In: "Advances in neural information pro-cessing systems
Tác giả:	Karen Simonyan and Andrew Zisserman
Năm:	2014

[12] Du Tran et al. (2015). “Learning spatiotemporal features with 3d convolutional networks”. In: Proceedings of the IEEE international conference on computer vision, pp. 4489–4497

Sách, tạp chí

Tiêu đề:	Learning spatiotemporal features with 3d convolutionalnetworks”. In: "Proceedings of the IEEE international conference on computer vi-sion
Tác giả:	Du Tran et al
Năm:	2015

[13] Christoph Feichtenhofer, Axel Pinz, and Andrew Zisserman (2016). “Convolu- tional two-stream network fusion for video action recognition”. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1933–1941

Sách, tạp chí

Tiêu đề:	Convolu-tional two-stream network fusion for video action recognition”. In:"Proceedings ofthe IEEE conference on computer vision and pattern recognition
Tác giả:	Christoph Feichtenhofer, Axel Pinz, and Andrew Zisserman
Năm:	2016

[14] Kaiming He et al. (2016). “Deep residual learning for image recognition”. In:Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778

Sách, tạp chí

Tiêu đề:	Deep residual learning for image recognition”. In:"Proceedings of the IEEE conference on computer vision and pattern recognition
Tác giả:	Kaiming He et al
Năm:	2016

[15] Gunnar A Sigurdsson et al. (2016). “Hollywood in homes: Crowdsourcing data collection for activity understanding”. In: European Conference on Computer Vi- sion. Springer, pp. 510–526

Sách, tạp chí

Tiêu đề:	Hollywood in homes: Crowdsourcing datacollection for activity understanding”. In: "European Conference on Computer Vi-sion
Tác giả:	Gunnar A Sigurdsson et al
Năm:	2016

[16] Joao Carreira and Andrew Zisserman (2017). “Quo vadis, action recognition? a new model and the kinetics dataset”. In: proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6299–6308

Sách, tạp chí

Tiêu đề:	Quo vadis, action recognition? anew model and the kinetics dataset”. In: "proceedings of the IEEE Conference onComputer Vision and Pattern Recognition
Tác giả:	Joao Carreira and Andrew Zisserman
Năm:	2017

[17] Christoph Feichtenhofer, Axel Pinz, and Richard P Wildes (2017). “Spatiotempo- ral multiplier networks for video action recognition”. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4768–4777

Sách, tạp chí

Tiêu đề:	Spatiotempo-ral multiplier networks for video action recognition”. In: "Proceedings of the IEEEconference on computer vision and pattern recognition
Tác giả:	Christoph Feichtenhofer, Axel Pinz, and Richard P Wildes
Năm:	2017

[18] Will Kay et al. (2017). “The kinetics human action video dataset”. In: arXiv preprint arXiv:1705.06950

Sách, tạp chí

Tiêu đề:	The kinetics human action video dataset”. In
Tác giả:	Will Kay et al
Năm:	2017

[19] Tsung-Yi Lin et al. (2017). “Feature pyramid networks for object detection”. In:Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2117–2125

Sách, tạp chí

Tiêu đề:	Feature pyramid networks for object detection”. In:"Proceedings of the IEEE conference on computer vision and pattern recognition
Tác giả:	Tsung-Yi Lin et al
Năm:	2017

[20] Allah Bux Sargano, Plamen Angelov, and Zulfiqar Habib (2017). “A comprehen- sive review on handcrafted and learning-based action representation approaches for human activity recognition”. In: applied sciences 7.1, p. 110

Sách, tạp chí

Tiêu đề:	A comprehen-sive review on handcrafted and learning-based action representation approaches forhuman activity recognition”. In:"applied sciences
Tác giả:	Allah Bux Sargano, Plamen Angelov, and Zulfiqar Habib
Năm:	2017

Tiêu đề	Áp Dụng Mạng SlowFast Cho Bài Toán Nhận Dạng Hành Động Trong Video
Tác giả	Phùng Thế Ngọc
Trường học	Đại Học Quốc Gia Hà Nội
Chuyên ngành	Khoa Học Máy Tính
Thể loại	Báo Cáo Môn Học
Năm xuất bản	2022
Thành phố	Hà Nội

Định dạng
Số trang	30
Dung lượng	1,57 MB