Không thể phát hiện do quá ít thông tin

Phát hiện chưa chính xác do đối tượng không thể hiện rõ đặc trưng (mô hình chỉ học các đối tượng khi có đủ hai phần thông tin gồm màu đen tóc và màu da người). Hình 4.4 sẽ cho ta thấy đối tượng hình chữ nhật màu trắng chỉ có phần tóc màu đen, còn bị chìm vào màu áo của học sinh đằng trước. Với trường hợp đối tượng bị vật thể che khuất cũng không thể phát hiện được, hình 4.5.

- Phát hiện chưa chính xác do mô hình phát hiện sai đối tượng, hình 4.6.

Hình 4.6. Các bao biên đối tượng bị sai

- Một trường hợp khác, video không thỏa điều kiện đầu vào thuật toán 1 sẽ chọn sai thời điểm bắt đầu, dẫn đến toàn bộ quá trình đều thực hiện sai.

 Hình 4.7, frame đầu tiên của video là khi học sinh đã đứng dậy.

 Hình 4.8, thuật toán 1 sẽ trả về kết quả khi học sinh bắt đầu ngồi xuống

Hình 4.8. Frame bắt đầu phát hiện

4.4.2.2 Giai đoạn theo vết đối tượng đầu học sinh

- Sai khi theo vết chủ yếu do bao biên đối tượng thay đối cấu trúc khá nhanh so với trạng thái trước.

- Hình 4.9 trên với bao biên đối tượng được đánh dấu, ta thấy đặc trưng rõ là một chai nước với nền đen, nên khi đối tượng di chuyển thì bao biên đối tượng vẫn không thay đổi.

Hình 4.9. Theo vết chưa chính xác

- Mô hình Yolo v2 chỉ giải quyết các ảnh có chiều dài và chiều rộng phải bằng nhau và là bội của 32, vì thế khi ảnh đầu vào là 1920 ×1080 nên phải thay đổi kích thước thành 1088 × 1088 dẫn đến việc mất một phần thông tin của đối tượng.

- Độ chính xác ở bước phát hiện chưa hoàn toàn 100% nên chưa thể áp dụng ngay vào trong thực tế mà cần cải tiến thêm.

- Chưa đề xuất được phương pháp giải quyết hai bài toán phát hiện và theo vết khi đối tượng hoàn toàn là màu đen hoặc bị chìm vào nền đen.

4.5. Hướng phát triển

- Xây dựng thuật toán bắt đầu phát hiện khi cả giáo viên và học sinh đứng yên.

- Thay đổi góc độ camera để lấy được thông tin nhiều nhất có thể.

- Xây dựng mô hình dựa trên ý tưởng Yolo v2 để có thể xử lý khi ảnh có chiều dài và chiều rộng có kích thước khác nhau và là bội của một số bất kỳ.

TÀI LIỆU KHAM KHẢO

[1] Internet, "hệ thống camera giám sát giao thông" https://www.baomoi.com/trien-khai-du-an-he-thong-camera-giam-sat- an-ninh-giao-thong-trat-tu-uu-tien-tien-do-lap-dat-he-thong-camera- phuc-vu-tuan-le-cap-cao-apec-2017/c/23264994.epi, ngày truy cập 20/10/2017.

[2] Internet, "Sử dụng camera giám sát phát hiện hành vi đeo bám khách du lịch" http://www.catp.danang.gov.vn/-/se-trich-xuat-hinh-anh-tu- camera-giam-sat-e-xu-phat-hanh-vi-eo-bam-cheo-keo-khach-du-lich, ngày truy cập 20/10/2017.

[3] Internet, "Camera giám sát phát hiện vi phạm an toàn giao thông" http://voh.com.vn/chinh-tri-xa-hoi/tphcm-lap-2000-camera-tu-trich- xuat-hinh-anh-toi-pham-nguoi-vi-pham-giao-thong-232595.html, ngày truy cập 20/10/2017.

[4] Internet, "Hệ thống điểm danh sinh viên bằng thẻ từ hoặc vân tay" http://vken.vn/he-thong-diem-danh-sinh-vien-bang-the-tu-hoac-van- tay.html, ngày truy cập 04/07/2018.

[5] Phạm Nguyên Khang, Trần Nguyễn Minh Thư, Đỗ Thanh Nghị, "Điểm danh bằng mặt người với đặc trưng GIST và máy học véc-tơ hỗ trợ," Khoa CNTT-TT, Trường Đại học Cần Thơ, Tháng 8/2017.

[6] Bahlmann, C., Zhu, Y., Ramesh, V., Pellkofer, M., & Koehler, "A system for traffic sign detection, tracking, and recognition using color, shape, and motion information," Proceedings. IEEE, In Intelligent Vehicles Symposium, 2005.

[7] Sobottka, K., & Pitas, "Face localization and facial feature extraction based on shape and color information," In ICIP (3) (pp. 483-486), (1996, September).

[8] Goshtasby, Ardeshir, "Template matching in rotated images," IEEE Transactions on Pattern Analysis and Machine Intelligence, (1985): 338- 344.

[9] Comaniciu, D., & Meer, "Mean shift analysis and applications. In Computer Vision," The Proceedings of the Seventh IEEE International Conference on (Vol. 2, pp. 1197-1203), 1999.

[10] C.K.Chui, G. Chen, "Kalman filtering with real-time applications," pp.20 – 48, 1991.

[11] Gao, Long, Yunsong Li, and Jifeng Ning, "Improved kernelized correlation filter tracking by using spatial regularization," Journal of Visual Communication and Image Representation 50: 74-82, 2018. [12] Hepner, Georgef, et al, "Artificial neural network classification using a

minimal training set- Comparison to conventional supervised classification," Photogrammetric Engineering and Remote Sensing 56.4: 469-473, 1990.

[13] S. I. Amari & A. Cichocki, "Adaptive blind signal processing-neural network approaches," Proceedings of the IEEE, 86(10): 2026-2048, 1998.

[14] A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar & L. Fei- Fei, "Large-scale video classification with convolutional neural networks," In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (pp. 1725-1732), 2014.

[15] K. H. R. B. G. X. Z. a. J. S. S. Ren, "Object detection networks on convolutional feature maps," CoRR, abs/1504.06066, 2015.

[16] S. Ren, K. He, R. Girshick, and J. Sun, "Faster r-cnn: Towards real-time object detection with region proposal networks," arXiv preprint arXiv:1506.01497, 2015.

[17] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, "Going deeper with convolutions," CoRR, abs/1409.4842,, 2014.

[18] M. Lin, Q. Chen, and S. Yan, "Network in network," CoRR, abs/1312.4400, 2013.

[19] O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, and L. Fei-Fei, "ImageNet Large Scale Visual Recognition Challenge," International Journal of Computer Vision (IJCV), 2015.

[20] S. Ren, K. He, R. B. Girshick, X. Zhang, and J. Sun, "Object detection networks on convolutional feature maps," CoRR, abs/1504.06066, 2015. [21] A. Krizhevsky, I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks," In Advances in neural information processing systems, pages 1097–1105, 2012.

[22] K. H. R. G. a. J. S. S. Ren, "Faster r-cnn: Towards real-time object detection with region proposal networks," arXiv preprint arXiv:1506.01497, 2015.

[23] D. C. R. C. S. C. A. D. M. S. A.W. Smeulders, "Visual tracking: an experimental survey," IEEE Trans. Pattern Anal. Mach. Intell, 36 (2013) 1441–1468.

[24] A. Yilmaz, O. Javed, M. Shah, "Object tracking: a survey, ACM Comput," Surv, 38 (2006) 81–93.

[25] D.S. Bolme, J.R. Beveridge, B.A. Draper, Y.M. Lui, "Visual object tracking using adaptive correlation filters," IEEE Conference on Computer Vision and Pattern Recognition, 1113–1120 (2010).

[26] M. Danelljan, G. Hager, F.S. Khan, M. Felsberg, "Learning spatially regularized correlation filters for visual tracking," IEEE International Conference on Computer Vision, 4310–4318 (2015).

[27] J.F. Henriques, R. Caseiro, P. Martins, J. Batista, "High-speed tracking with kernelized correlation filters," IEEE Trans. Pattern Anal. Mach. Intell, 37 (2015) 583–596.

[28] Z. Hong, Z. Chen, C. Wang, X. Mei, D. Prokhorov, D. Tao, "Multi-Store Tracker (MUSTer): a cognitive psychology inspired approach to object tracking," IEEE Conference on Computer Vision and Pattern Recognition, 749–758 (2015).

[29] H. Grabner, C. Leistner, H. Bischof, "Semi-supervised on-line boosting for robust tracking," European Conference on Computer Vision, 234– 247 (2008).

[30] S. Hare, A. Saffari, P.H.S. Torr, "Struck: structured output tracking with kernels," IEEE International Conference on Computer Vision, 263–270 (2011).

[31] Z. Kalal, K. Mikolajczyk, J. Matas, "Tracking-learning-detection," IEEE Trans. Pattern Anal. Mach. Intell., 34 (2012) 1409–1422.

[32] J. Ning, J. Yang, S. Jiang, L. Zhang, M.H. Yang, "Object tracking via dual linear structured SVM and explicit feature map," IEEE Conference on Computer Vision and Pattern Recognition, 4266–4274 (2016).

[33] J.M. Zhang, S.G. Ma, S. Sclaroff, "MEEM: Robust tracking via multiple experts using entropy minimization," European Conference on Computer Vision, 188–203 (2014).

[34] S. Avidan, "Support vector tracking," IEEE Trans. Pattern Anal. Mach. Intell, 26 (2004) 1064–1072.

[35] C.Q. Hong, J. Yu, J. You, X.H. Chen, D.P. Tao, "Multi-view ensemble manifold regularization for 3D object recognition," Inf. Sci, 320 (2015) 395–405.

[36] H. Grabner, C. Leistner, H. Bischof, "Semi-supervised on-line boosting for robust tracking," European Conference on Computer Vision, 34–247 (2008).

[37] B. Babenko, M.H. Yang, S. Belongie, "Robust object tracking with online multiple instance learning," IEEE Trans. Pattern Anal. Mach. Intell, 33 (2011) 1619–1632.

[38] H. Song, Y. Zheng, K. Zhang, "Robust visual tracking via self-similarity learning," Electron. Lett, 53 (2017) 20–22..

[39] C. Ma, J.B. Huang, X.K. Yang, M.H. Yang, "Hierarchical convolutional features for visual tracking," IEEE International Conference on Computer Vision, 3074–3082 (2015).

[40] K. Zhang, Q. Liu, Y. Wu, M.H. Yang, "Robust visual tracking via convolutional networks without training," IEEE Trans. Image Process, 25 (2016) 1779–1792.

[41] H. Galoogahi, T. Sim, S. Lucey, "Muti-channel correlation filters," IEEE International Conference on Computer Vision, 3072–3079 (2013)..

[42] J.F. Henriques, R. Caseiro, P. Martins, J. Batista, "Exploiting the circulant structure of tracking-by-detection with kernels," European Conference on Computer Vision, 702–715 (2013).

[43] Y. Li, J. Zhu, "A scale adaptive kernel correlation filter tracker with feature integration," European Conference on Computer Vision, 254– 265 (2014)..

[44] M. Danelljan, G. Hager, F.S. Khan, M. Felsberg, J. Weijer, "Accurate scale estimation for robust visual tracking," British Machine Vision Conference, 2014.

[45] C.Q. Hong, J.K. Zhu, J. Yu, J. Chen, X.H. Chen, "Realtime and robust object matching with a large number of templates," Multimedia Tools Appl, 75 (2016) 1459–1480.

[46] W. Chen, K. Zhang, Q. Liu, "Robust visual tracking via patch based kernel correlation filters with adaptive multiple feature ensemble," Neurocomputing, 214 (2016) 607–617..

[47] B. Zhang, Z. Li, X. Cao, Q. Ye, C. Chen, L. Shen, A. Perina, R. Ji, "Output constraint transfer for kernelized correlation filter in tracking," IEEE Trans. Syst. Man Cybernet.: Syst, 47 (2017) 693–703..

[48] Y. Li, J. Zhu, S.C.H. Hoi, "Reliable Patch Trackers: Robust visual tracking by exploiting reliable patches," IEEE Conference on Computer Vision and Pattern Recognition, 353–361 (2015)..

[49] P.F. Felzenszwalb, R.B. Girshick, D. McAllester, D. Ramanan, "Object detection with discriminatively trained part-based models," IEEE Trans. Pattern Anal. Mach. Intell, 32 (2010) 1627–1645..

[50] J.V.D. Weijer, C. Schmid, J. Verbeek, D. Larlus, "Learning color names for real-world applications," IEEE Trans. Image Process, 18 (2009) 1512–1523..

Không thể phát hiện do quá ít thông tin

Tính toán với phương pháp MaxPooling

Nguyên tắc hoạt động của YOLO