Nghiên cứu giải pháp nâng cao tốc độ xử lý tín hiệu trong hệ thống thị giác nổi

i BỘ GIÁO DỤC VÀ ĐÀO TẠO BỘ QUỐC PHÒNG VIỆN KHOA HỌC VÀ CƠNG NGHỆ QN SỰ ĐỒN VĂN TUẤN NGHIÊN CỨU GIẢI PHÁP NÂNG CAO TỐC ĐỘ XỬ LÝ TÍN HIỆU TRONG HỆ THỐNG THỊ GIÁC NỔI LUẬN ÁN TIẾN SĨ KỸ THUẬT HÀ NỘI – 2019 ii BỘ GIÁO DỤC VÀ ĐÀO TẠO BỘ QUỐC PHÒNG VIỆN KHOA HỌC VÀ CƠNG NGHỆ QN SỰ ĐỒN VĂN TUẤN NGHIÊN CỨU GIẢI PHÁP NÂNG CAO TỐC ĐỘ XỬ LÝ TÍN HIỆU TRONG HỆ THỐNG THỊ GIÁC NỔI Chuyên ngành: Kỹ thuật điện tử Mã số: 52 02 03 LUẬN ÁN TIẾN SĨ KỸ THUẬT NGƢỜI HƢỚNG DẪN KHOA HỌC: TS Hà Hữu Huy PGS TS Bùi Trung Thành HÀ NỘI – 2019 iii LỜI CAM ĐOAN Tơi xin cam đoan cơng trình nghiên cứu tơi Các số liệu, kết trình bày luận án trung thực chƣa đƣợc cơng bố cơng trình khác Các liệu tham khảo đƣợc trích dẫn đầy đủ NGƢỜI CAM ĐOAN Đoàn Văn Tuấn iv LỜI CẢM ƠN Đầu tiên xin gửi lời cảm ơn chân thành tới thầy: TS Hà Hữu Huy, Viện Điện tử - Viện Khoa học Công nghệ quân thầy PGS TS Bùi Trung Thành – Hiệu trƣởng, Trƣờng Đại học Sƣ phạm Kỹ thuật Hƣng Yên trực tiếp hƣớng dẫn, tận tình bảo, tạo điều kiện tốt để tơi hồn thành luận án Tơi xin chân thành cảm ơn Ban Giám Đốc Viện Khoa học Cơng Nghệ Qn Sự Bộ Quốc phòng, Phòng Đào Tạo Viện Điện Tử tạo điều kiện thuận lợi giúp tơi hồn thành nhiệm vụ đạt kết mong muốn Xin chân thành cảm ơn Đảng Ủy, Ban Giám Hiệu Ban Chủ Nhiệm Khoa Điện – Điện Tử trƣờng Đại Học Sƣ Phạm Kỹ Thuật Hƣng Yên, nơi công tác, tạo điều kiện để tơi hồn thành nhiệm vụ Xin cảm ơn chun gia, nhà khoa học, bạn đồng nghiệp Ban Giám Đốc trung tâm Quang Điện Tử thuộc Viện Công Nghệ Ứng dụng C6 Thanh Xuân Bắc- Hà Nội trực tiếp đóng góp nhiều ý kiến quý báu, giúp tơi vƣợt qua khó khăn để hồn thành luận án Cuối xin gửi lời cảm ơn tới thành viên gia đình, đặc biệt vợ hai con, ngƣời tạo điều kiện thời gian, vật chất hết lòng chăm sóc, động viên tinh thần để tập trung nghiên cứu hoàn thành luận án v MỤC LỤC Trang DANH MỤC CÁC KÝ HIỆU, CHỮ VIẾT TẮT viii DANH MỤC CÁC BẢNG xi DANH MỤC CÁC HÌNH VẼ xii MỞ ĐẦU CHƢƠNG 1: TỔNG QUAN VỀ THỊ GIÁC NỔI VÀ XỬ LÝ TÍN HIỆU TRONG HỆ THỐNG THỊ GIÁC NỔI 1.1 Tổng quan thị giác 1.1.1 Một số khái niệm 11 1.1.2 Hệ thống thị giác 12 1.2 Mơ hình camera 18 1.3 Phƣơng pháp hiệu chuẩn camera 19 1.3.1 Phƣơng pháp hiệu chuẩn Hall 19 1.3.2 Phƣơng pháp hiệu chuẩn Salvi 19 1.3.3 Phƣơng pháp hiệu chuẩn Tsai 19 1.3.4 Phƣơng pháp hiệu chuẩn Weng 20 1.4 Phƣơng pháp hiệu chỉnh ảnh 20 1.5 Các thuật toán so khớp 22 1.5.1 Các thuật toán so khớp cục 23 1.5.2 Các thuật tốn so khớp tồn cục 24 1.6 Nền tảng xử lý hệ thống thị giác 27 1.6.1 Nền tảng xử lý dùng CPU 28 1.6.2 Nền tảng xử lý dùng DSP 29 1.6.3 Nền tảng xử lý dùng GPU 29 1.6.4 Nền tảng xử lý dùng FPGA/ASIC 30 1.7 Đánh giá hệ thống thị giác 31 vi 1.8 Các hƣớng nghiên cứu nhằm nâng cao hiệu hệ thống thị giác 33 1.8.1 Phƣơng pháp phân đoạn ảnh tối ƣu hóa phân cấp 34 1.8.2 Điều chỉnh vùng đồng khơng nhìn thấy 35 1.8.3 Phƣơng pháp cải thiện tối thiểu hàm chi phí cho so khớp điểm ảnh 35 1.8.4 Phƣơng pháp tối ƣu hóa liên kết 35 1.8.5 Phƣơng pháp xếp nhớ hiệu 36 1.8.6 Phƣơng pháp cải tiến thiết kế VLSI 36 1.9 Kết luận chƣơng 37 CHƢƠNG 2: NGHIÊN CỨU CÁC THUẬT TOÁN LAN TRUYỀN TIN CẬY BP VÀ XÂY DỰNG CÁC GIẢI PHÁP NÂNG CAO TỐC ĐỘ XỬ LÝ TRONG HỆ THỐNG THỊ GIÁC NỔI 39 2.1 Trƣờng ngẫu nhiên Markov 39 2.1.1 Định nghĩa trƣờng ngẫu nhiên Markov 42 2.1.2 Mơ hình trƣờng ngẫu nhiên Markov dạng lƣới 43 2.1.3.Mơ hình trƣờng ngẫu nhiên Markov dạng lƣới ẩn 45 2.1.4 Mơ hình trƣờng ngẫu nhiên Markov dạng 46 2.1.5 Ứng dụng trƣờng ngẫu nhiên Markov cho thị giác 47 2.2 Thuật toán lan truyền tin cậy BP ứng dụng cho thị giác 47 2.2.1 Thuật toán lan truyền tin cậy BP dạng 47 2.2.2 Thuật toán lan truyền tin cậy BP dạng lƣới 48 2.3 Thuật toán cục CT 59 2.4 Các giải pháp cải tiến tốc độ xử lý thuật toán BP 60 2.5 Đề xuất giải pháp tối thiểu hàm chi phí 61 2.5.1 Thuật toán đề xuất 61 2.5.2 Thuật toán đề xuất 68 vii 2.6 Đề xuất giải pháp kết hợp 73 2.6.1 Thuật toán đề xuất 74 2.6.2 Thuật toán đề xuất 79 2.7 Phƣơng pháp đánh giá thuật toán 84 2.8 Kết luận chƣơng 84 CHƢƠNG THỰC NGHIỆM CÁC THUẬT TOÁN ĐỀ XUẤT VÀ ĐÁNH GIÁ KẾT QUẢ 87 3.1 Công cụ liệu thực nghiệm 87 3.2 Chỉ số đánh giá độ tin cậy RMSE 88 3.3 Kết thực nghiệm 89 3.3.1 Thuật toán BP tiêu chuẩn 90 3.3.2 Thuật toán đề xuất 92 3.3.3 Thuật toán đề xuất 94 3.3.4 Thuật toán đề xuất 96 3.3.5 Thuật toán đề xuất 99 3.4 Đánh giá thuật toán đề xuất 101 3.4.1 Đánh giá thuật toán đề xuất thuật toán BP 101 3.4.2 Đánh giá thuật toán đề xuất thuật toán BP 106 3.4.3 Đánh giá thuật toán đề xuất thuật toán BP 110 3.4.4 Đánh giá thuật toán đề xuất thuật toán BP 114 3.4.5 So sánh thuật toán đề xuất 118 3.5 Kết luận chƣơng 121 KẾT LUẬN 122 DANH MỤC CÁC CƠNG TRÌNH KHOA HỌC ĐÃ CÔNG BỐ 124 TÀI LIỆU THAM KHẢO 125 viii DANH MỤC CÁC KÝ HIỆU, CHỮ VIẾT TẮT Ký hiệu Ý nghĩa b Khoảng cách hai camera stereo camera b j ( x j ), b'j ( x j ), b''j ( x j ) Độ tin cậy nút j thuật toán BP thuật toán b'''j ( x j ), b''''j ( x j ) đề xuất tƣơng ứng d, dCT Độ chênh lệch vị trí điểm tƣơng ứng ảnh stereo camera E Tập cạnh mơ hình đồ thị dạng lƣới E ( x), E ' ( x), E '' ( x) Năng lƣợng chi phí thực đồ chênh lệch thuật toán BP thuật toán đề xuất tƣơng ứng E ''' ( x), E '''' ( x) G Mơ hình đồ thị biểu diễn đồ chênh lệch i,j Biểu diễn nút thứ i nút lân cận j I L ( x, y ) , I R ( x, y ) Độ xám tọa độ (x,y) ảnh trái ảnh phải ảnh stereo camera k , k ' , k '' , k ''' , k '''' Độ chênh lệch điểm tƣơng ứng ảnh stereo camera thuật toán BP thuật toán đề xuất tƣơng ứng k2' , k2'' Số lần thực chia thơ đến mịn CTF mức thuật tốn đề xuất thuật toán đề xuất k1' , k1'' Số vòng lặp thực CTF thuật toán đề xuất thuật toán đề xuất m Độ phân giải hàng ảnh stereo camera M , M ' , M '' , M ''' , M '''' Dung lƣợng nhớ yêu cầu thuật toán BP thuật toán đề xuất tƣơng ứng n Độ phân giải cột ảnh stereo camera N Tổng số nút ảnh stereo camera  ( xi , x j ), ' ( xi , x j ), '' ( xi , x j )  ( xi , x j ), ( xi , x j ) ''' '''' Hàm chi phí cho nút i nút lân cận j thuật toán BP thuật toán đề xuất tƣơng ứng ix  ,  ' ,  '' ,  ''' ,  '''' Hàm chi phí nhẵn cho hai nút cạnh thuật toán BP thuật toán đề xuất tƣơng ứng p, p' , p'' , p''' , p'''' Hệ số tỉ lệ tăng hàm chi phí nhẵn thuật toán BP thuật toán đề xuất tƣơng ứng q, q' , q'' , q''' , q'''' Ngƣỡng dừng tăng hàm chi phí nhẵn thuật toán BP thuật toán đề xuất tƣơng ứng V Tập cạnh mơ hình dạng lƣới xi Giá trị nhãn đƣợc gán cho nút i xCT Dải quét thuật toán CT xj , x 'j , x ''j , x '''j  , x ''''j  Giá trị nhãn đƣợc chọn gán cho nút j thuật toán BP thuật toán đề xuất tƣơng ứng Chữ viết tắt Ý nghĩa ASIC Các mạch tích hợp ứng dụng riêng (Application Specific Integrated circuits) BP Lan truyền tin cậy (Belief propagation) CFBP Lan truyền tin cậy dùng chia thô đến mịn (Coarse to fine belief propagation) CFCSBP Lan truyền tin cậy có độ sâu thay đổi dùng chia thô đến mịn (Coarse to fine change space belief propagation) CPU Khối xử lý trung tâm (Central processing unit) CT Biến đổi thống kê (Census tranform) CTBP Lan truyền tin cậy kết hợp biến đổi thống kê (Census transform belief propagation) CTCSBP Lan truyền tin cậy có độ sâu thay đổi kết hợp biến đổi thống kê (Census transform change space belief propagation) CTF Chia thô đến mịn ( Coarse to fine) CUDA Kiến trúc thiết bị tính tốn hợp (Compute unified device architecture) x DP Quy hoạch động (Dynamic programming) DSP Vi xử lý tín hiệu số (Digital signal processing) EPT Biến đổi điểm đƣờng bao ( Envelope point transform) FPGA Mảng cổng lập trình dạng trƣờng (Field programmable gate array) GC Lát cắt đồ thị (Graph cut) GPU Khối xử lý đồ họa (Graphic processing unit) HBP Lan truyền tin cậy phân cấp (Hierarchical belief propagation) ICM Các chế độ lặp ( Iterated conditional modes) MAP Xác suất hậu nghiệm (Maximum a posteriori) MRF Trƣờng ngẫu nhiên Markov (Markov random field) NCC Chuẩn hóa tƣơng quan chéo ( Normalized cross correlation) SAD Tổng chênh lệch tuyệt đối ( Sum of absolute different) SGM Bán toàn cục (Semi - Global matching) SIFT Biến đổi đặc trƣng bất biến tỉ lệ (Scale Invariant feature transform) SLAM Đồng thời xây dựng đồ định vị (Simultaneous localization and mapping) SSD Tổng chênh lệch bình phƣơng (Sum of squared difference) SURF Đặc trƣng tăng tốc mạnh (Speed up robust feature) Z '' , Z '''' Hệ số độ sâu thay đổi độ sâu thuật toán đề xuất thuật toán đề xuất 124 DANH MỤC CÁC CƠNG TRÌNH KHOA HỌC ĐÃ CƠNG BỐ Đồn Văn Tuấn, Bùi Trung Thành, Hà Hữu Huy (2015), Hệ thống thị giác đánh giá thuật tốn thị giác nổi, Tạp chí Khoa học Công nghệ Trƣờng Đại học Thái nguyên, số 137 tháng /2015, tr 155 – 160 Đoàn Văn Tuấn, Bùi Trung Thành, Hà Hữu Huy (2017), Giải pháp nâng cao tốc độ thuật toán Belief Propagation cho hệ thống thị giác nổi, Tạp chí Khoa học Công nghệ Trƣờng Đại học Thái nguyên, số 162 tháng / 2017, tr 65 – 71 Đoàn Văn Tuấn, Bùi Trung Thành (2017), Một giải pháp thực đồ chênh lệch ảnh stereo camera có mật độ điểm tương ứng dày ứng dụng cho ảnh 3D đồ độ sâu, Tạp chí Nghiên cứu Khoa học Công nghệ Quân số 51 tháng 10 /2017, tr 100 – 109 Đoàn Văn Tuấn, Bùi Trung Thành (2017), Nâng cao chất lượng thuật toán lan truyền tin cậy xác định đồ chênh lệch ứng dụng cho thị giác Robot, Tạp chí Nghiên cứu Khoa học Công nghệ Quân số 52 tháng 12/ 2017, tr 111 – 120 Đoàn Văn Tuấn, Bùi Trung Thành, Hà Hữu Huy (2018), Giải pháp nâng cao hiệu xử lý tín hiệu thị giác Robot, Tạp chí Nghiên cứu Khoa học Cơng nghệ Quân số 53 tháng 2/ 2018, tr 19 – 27 125 TÀI LIỆU THAM KHẢO Tiếng Việt: [1] Trƣơng Quốc Bảo, Quách Tuấn Văn, Nguyễn Minh Luân (2015), “Phát triển thuật toán xử lý ảnh để phát ƣớc lƣợng khoảng cách từ hệ camera đến tâm cà chua chín cây,” Tạp chí Khoa học trường Đại học Cần Thơ, tập 36, pp 112 - 120 [2] Trần Thuận Hoàng (2015), “Nghiên cứu phương pháp tổng hợp cảm biến dùng cho kỹ thuật dẫn đường robot di động,” Trƣờng đại học công nghệ, trƣờng đại học Quốc gia Hà Nội, Luận án tiến sĩ kỹ thuật điện tử [3] Nguyễn Quang Hùng (2016), "Nghiên cứu phát triển số kĩ thuật định vị dựa hình ảnh, ứng dụng trợ giúp dẫn đường cho người khiếm thị", Đại học Bách khoa Hà nội: Luận án tiến sĩ khoa học máy tính [4] Bạch Ngọc Minh (2014), “Nghiên cứu so sánh thuật tốn xử lý ảnh tính độ sâu ảnh stereo ứng dụng camera thị giác,” Tạp chí Khoa học ĐHQGHN: Khoa học tự nhiên công nghệ, tập 30, số 3, pp 26 - 48 Tiếng Anh: [5] A Blake, P Kohli, C Rother (2011), “Markov Random Fields for Vision and Image Processing,” MIT Press [6] A Celebi, O Urhan, I Hamzao and S Erturk, “Efficient Hardware Implementation of Low Bit Depth Motion Estimation Algorithms,” IEEE Signal Processing Letters, vol 6, no 16, pp 513 - 516, 2009 [7] A Gruber, E Boros and R Zabih (2011), “A Graph Cut Algorthm for Higher-order Markov Random Fields,” 2011 IEEE International Conference on Computer Vision (ICCV), vol 3, no 57, pp 1020-027 [8] A Fusiello, I Luca (2011), “Quasi-Euclidean epipolar rectification of uncalibrated images,” Machine Vision and Applications, vol 22, no 4, pp 663-670 126 [9] A L Francis, N Ayache (1991), “Trinocular stereo vision for robotics,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol 13, no 1, pp 73 - 85 [10] A Prati, R Cucchiara, P Piccinini (2012), “Real-Time Object Detection and Localization with SIFT-based Clustering,” Image and Vision Computing, vol 30, no 8, pp 573 - 587 [11] A P Whelan, J Mallon (2005), “Projective rectification from the fundamental matrix,” Image and Vision Computing, vol 23, số 7, pp 643 - 650 [12] A S Nayar, J Gluckman (2001), “Rectifying transformations that minimize resampling effects,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol 1, pp 111-117 [13] A Zhang, C Loop (1999), “Computing rectifying homographies for stereo vision,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol 1, pp 125 -131 [14] C Bishop (2006), "Pattern Recognition and Machine Learning", New York: Springer [15] C K Lin (2012), “A Stereo Matching based on Adaptive Windows,” International Journal of Electronic Commerce Studies, vol 3, no 1, pp 21 - 34 [16] C K Liang, C C Cheng, Y C Lai, L G Chen and H H Chen (2009), “Hardware Efficient Belief Propagation,” Proc of IEEE Conf Computer Vision and Pattern Recognition, pp 80 - 87 [17] C M Cheng S H Lai, S Y Yang, C H Sin (2009), “Geodesic Treebased Dynamic Programming for Fast Stereo Reconstruction,” IEEE 12th International Conference on Computer Vision Wordshops, pp 801-807 127 [18] C Rhemann, A Hosni, M Bleyer, C Rother and M Gelautz (2013), “Fast Cost volume Filtering for Viual Correspondence and Beyond,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol 2, no 35, pp 504 - 511 [19] C Sun (2012), “Closed form Stereo Image Rectification,” Vision and Scene Understanding, Dunedin, New Zealand [20] C Ttofis, T Theocharides (2012), “Towwards Accurate Hardware Stereo Correspondence: A Real-time FPGA Implementation of A Segmentatio Based Adaptive Spport Weight Algorithm,” Design Automation & Test in Europe Conference & Exhibition, vol 1, no 14, pp 703-708 [21] C W Chou, J J Tsai, H M Hang, H C Lin (2010), “A Fast Graph Cut Algorithm for Disparity Estimation,” Picture Coding Symposium, pp 326 - 329 [22] D Gasbarro, G Emilia (2017), “Review of techniques for 2D camera calibration suitable for industrial vision systems,” IOP Conf Series: Journal of Physics [23] D Geman, S Geman (1984), “Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol 6, no 6, pp 721 - 741 [24] D Juarez, A Espinosa (2016), “Embedded real-time stereo estimation via Semi-Global Matching on the GPU,” The International Conference on Computational, Universitat Autonoma de Barcelona, Barcelona, Catalonia, Spain [25] D Kumari, K Kaur (2016), “A Survey on Stereo Matching Techniques for 3D Vision in Image Processing,” Engineering and Manufacturing, vol 4, pp 40 - 49 128 [26] D Makowski, A Napieralski, P Perek (2016), “Efficient Uncalibrated Rectification Method for Stereo Vision Systems,” International Conference: Mixed design oj Intergrated Circuits and Systems, Lodz, Poland [27] D Min and K Sohn (2008), “Cost Aggregation and Occlusion Handling with WLS in Stereo Matching,” IEEE Trans on Image Processing, vol 8, no 17, pp 361 - 364 [28] D S Kim, S S Lee and B H Choi (2010), “A Real Time Stereo Depth Extraction Hardware for Intelligent Home Assistant Robot,” IEEE Transactions on Consumer Electronics, vol 3, no 56, pp 1782 - 1788 [29] D Scharstein, R Szeliski (2002), “A taxonomy and Evaluation of Dense Two Frame Stereo Correspondence Algorithms,” Computer vision, vol 47, no 1, pp - 42 [30] D Scharstein and R Szeliski, “Middlebury Stereo Datasets,” http://vision.middlebury.edu/stereo/(online) [31] E Fotiadou, I Bireps, P Daras, D Zarpalas (2012), “Anchoring Graph Cuts Toward Accurate Depth Estimation in Intergral Images,” Journal of Display Technology, vol 7, no 8, pp 405-417 [32] G Atanas, H Miska, M Georgiev (2013), “A fast and accurate recalibration technique for misaligned stereo cameras,” IEEE International conference on Image Processing, pp 24 -28 [33] G Chen, L Hong (2004), “Segment Based Stereo Matching Using Graph Cuts,” IEEE Int'l Conf.Computer Vision anf Pattern Recognition, pp 74-81 [34] G Lafruit, F Catthoor, L Jiangbo (2007), “Fast Variable Center biased Windowing for High speed Stereo on Programmable Grapics Hardwave,” In Proc IEEE Int Conf Image Process., pp 568 - 571 129 [35] H Cai, J G Lou, J Li, J Lu (2007), “An Epipolar Geomatry based Fast Disparity Estimation Algorithm for Multi view Image and Video Coding,” IEEE trans Crcuits Syst Video Technol, vol 6, no 17, pp 568 - 571 [36] H Durrant, T Bailey (2006), “Simultaneous localization and mapping (SLAM),” IEEE Robotics and Automation Magazine, vol 13, no 2, pp 99 - 110 [37] J Batlle, E M Mouaddib, J Salvi (1998), “A Robust Coded Pattern Projection for Dynamic 3D Scene Measurement,” Pattern Recognition Letters, vol 19, no 11, pp 1055 - 1065 [38] J Banks, M Bennamoun (2001), “Reliability Analysis of the Rank Transform for Stereo Matching,” IEEE Transactions on Systems, vol 6, no 31, pp 870 - 880 [39] J B Tio, F Sadjadi, E Hall (1982), “Measuring Curved Surfaces for Robot Vision,” Computer Journal, vol 15, no 12, pp 42 - 54 [40] J Besag (2010), “On the Statistical Analysis of Dirty Pictures,” Journal of the Royal Statistical Society , vol 48, no 3, pp 259 - 302 [41] J Morris, G Gimelfarb, R Kalarot (2010), “Performancer Analysis of Multi-resolution Symmetic Dynamic Programming Stereo on GPU,” 2010 25th International Conference of Image and Vision Computing New Zealand, pp 1-7 [42] J M Perez, P Sanchez, M Martinez (2009), “High Memory Throughput FPGA Architecture for High Definition Belief Propagation Stereo Matching,” 2009 3rd International Cenference on Signals, Circuits and Systems, pp - 130 [43] J Selzer, Y H Yang, C Lie (2006), “Region Tree Bsaed Stereo Using Dynamic Programming Optimization,” IEEE Conf Computer Vision and Pattern Recognition, pp 17-22 [44] J Yi, J Kim, S Lee (2005), “Real time Stereo Vision on A Reconfigurable System,” Proc Fifth Int Workshop on Systems, Architectures, Modeling, and Simulation, Samos, Greece [45] J Morel, Z Tang, P Monasse (2010), “Three-step image rectification,” In Proceedings of the British Machine Vision Conference, BMVA [46] K Kumari, D Kumari (2016), “A Survey on Stereo Matching Techniques for 3D Vision in Image Processing,” Engineering and Manufacturing, vol 4, pp 40 - 49 [47] K Q Wang, W M Zuo, Z X Li, M Huan (2011), “Real-Time Accurate Stereo Matching Using Modified Two-Pass Aggregation and WinnerTake-All Guided Dynamic Programming,” 2011 International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission (3DIMPVT), pp 73-79 [48] K M Huan (2011), “Template Based Stereo Matching Using Graphcut,” International Conference on Instrumentation, Mwasurement, Computer, Communication and Control, pp 303 - 306 [49] K Reinhard, G Luc, M Pollefeys (1999), “A simple and efficient rectification method for general motion,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol 1, pp 496 - 501 [50] K Sertac (2017), “https://www.stereolabs.com/,” Introducing the ZED Mini 131 [51] K Yoon and I Kweon (2006), “Adaptive Support weight Approach for Correspondence Search,” IEE transactions on Pattern Analysis and Machine Intelligence, vol 4, no 28, pp 650 - 656 [52] K Zhang, J B Lu, Q Yang, G Lafruit, R Lauwereins and L V Gool (2011), “Real time and Accurate Stereo: A Scalable Approach with Bitwise Fast Voting on CUDA,” IEEE Transactions on Circuits and Systems for Video Technology, vol 7, no 21, pp 867 - 879 [53] L D Meaztu, A Villanueva and R Cabeza (2012), “Near Real-Time Stereo Matching Using Geodesic Diffusion,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol 2, no 34, pp 410 - 416 [54] L H Wang, D X Li, M Zhang, Q Q Yang (2012), “Hybrid Stereo Matching by Dynamic Programming with Enhanced Cost Entry for Real-time Depth Generation,” 2012 International Conference on Audio, Language and Image Processing (ICALIP), pp 557-563 [55] L Nalpantidis (2010), “Stereo Vision for Robotic Applications in the Presence of Non Ideal Lighting Conditions,” Image and Vision Computing, vol 26, no 6, pp 940 - 951 [56] L Wang, N Ahuja, Q Yang (2010), “A Constant space Belief Propagation Algorithm for Stereo Matching,” IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 1458 1465 [57] L Zhang and S M Seitz (2005), “Parameter Esti mation for MRF Stereo,” Proc IEEE Int'l Conf Computer Vision and Pattern Recognition (CVPR '05), pp 288 - 295 [58] M A Gennert (2001), “Fast Approximate Energy Minimization via Graph Cuts,” IEEE Trans Pattern Analysis and Machine Intelligence 23, vol 11, no 23, pp 1222 - 1239 132 [59] M Bahrin (2016), “Industry 4.0: A Review on Industrial Automation and Robotic,” Jurnal Teknologi (Sciences & Engineering), vol 78, no 6, pp 137 - 143 [60] M Gong and Y H Yang (2005), “Fast Unambiguous Stereo Matching Using Reliability based Dynamic Programming,” IEEE Trans Patt Anal Mach Intell, vol 6, no 27, pp 998 - 1003 [61] M Gong and Y H Yang (2005), “Near Real time Reliable Stereo Matching Using Programmable Graphics Hardware,” IEEE Computer Society Conference on Computer Vision and Pattern Recogntion, pp 924 - 931 [62] M Humenberger, C Zinner, M Weber, W Kubinger and M Vincze (2010), “A Fast Stereo Matching Algorithm Suitable for Embedded Real Time Systems,” Comput Vis Image Understanding, vol 11, no 114, pp 1180 - 1202 [63] M Liao, M Gong, R Yang, D Nister, L Wang (2006), “High Quality Real time Stereo Using Adaptive Cost Aggregation and Dynamic Programming,” Proc of the Third International Symposium on 3D Data Processing, Visualization, and Transmission, pp 798-805 [64] M Stefano (2012), Stereo vision: Algorithms and Applications, University of Bologna [65] M Tomasi, M Vanegas, F Barranco, J Divaz and E Ros (2012), “Massive Parallel Hardware Architecture for Multiscale Stereo, Optical Flow and Image Structure Computation,” IEEE Transactions on Circuits and Systems for Video Technology, vol 2, no 22, pp 282 - 294 [66] N Aouf, D Rodriguez (2010), “Robust Harris SURF Features for Robotic Vision Based Navigation,” in Annual Conference on Intelligent Transportation Systems Madeira, Portugal 133 [67] N Chang, T M Lin, T H Tsai and Y C Tseng (2007), “Real time DSP Implementation on Local Stereo Matching,” IEEE Conf Multimedia Expo, Beijing, China [68] N H Tan, N H Hamid, P Sebastian, Y V Voon (2011), “Resource Minimization in A Real-Time Depth-map Processing System on FPGA,” Tencon 2011-2011 IEEE Region 10 Conference, pp 706-710 [69] N Y Chang, T H Tsai, B H Hsu, Y C Chen, T S Chang (2010), “Algorithm and Architecture of Disparity Estimation With Mini-Census Adaptive Spport Weight,” IEEE Transactions on Circuits and Systems for Video Technology, vol 6, no 20, pp 792 - 906 [70] NIVIDIA (2008), “NVIDIA CUDA Programming Guide Version 2.1,” [71] Nvidia (2017), “http://www.nvidia.com/gtx-700-graphics-cards/gtx- 750ti/,” Nvidia [72] O D Faugeras (1993), “Three-dimensional computer vision,” The MIT Press, Cambridge, Massachusetts [73] O Veksler, R Zabih, Y Boykov (1998), “Markov Ramdom Fields with Efficient Approximations,” IEEE Int Conf, Pattern Analysis and Machine Intelligence, pp 648-655 [74] O Veksler (2003), “Fast Variable Window for Stereo Correspondence Using Integral Images,” Proc Of IEEE Conf Computer Vision and Pattern Recognition, Madison, WI, USA [75] O.Veksler (2007), “Graph Cut Based Optimization for MRFs with Truncated Convex Priors,” Proc IEEE Conf Computer Vision and Pattern Recognition [76] P Cohen, M Herniou, J Weng (1992), “Camera Calibration with Distortion Models and Accuracy Evaluation,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol 14, no 10, pp 965 - 980 134 [77] P Corsonello, S Perri (2013), “Adaptive Census Transform: A Novel Hardware Oiented StereoVision Algorithm,” Image and Vision Computing, vol 117, pp 29 - 41 [78] P F Felzenszwalb and D P Huttenlocher (2004), “Efficient Belief Propagation for Early,” Proc IEEE Int Conf Computer Vision and Pattern Recognition, vol 1, no 70, pp 261 - 267 [79] P Grey (2016), Bumblebee xb3 camera, “http://www.ptgrey.com _/products/bbxb3/index.asp 1” [80] P Karl, T Matteo, D Alonso, K Javier, R Eduardo, V Hulle and M Marc (2012), “A Comparison of FPGA and GPU for Real Time Phase based Optical Flow, Stereo, and Local Image Features,” IEEE Transactions on Computers, vol 7, no 61, pp 999 - 1012 [81] P Kohli and P H Torr (2007), “Dynamic Graph Cuts for Efficient Inference in Markov Random Fields,” IEEE Trans Pattern Analysis and Machine Intelligence, vol 12, no 29, pp 2079 - 2088 [82] P N Thang (2011), “A Component-Wise Analysis of Constructible Match Cost Functions for Global Stereopsis,” IEEE transactions on Pattern Analysis and Machine Intelligence, vol 11, no 33, pp 807 - 822 [83] Q Yang, L Wang, R Yang, M Liao and D, Nister (2006), “Real-time global Stereo Matching Using Hierarchical Belif Prepagation,” In proc British Machine Visione Conference, pp 989 - 998 [84] Q Yang, L Wang, R Yang, S Wang, M Liao and D Nister (2006), “Real time Global Stereo Matching Using Hierarchical Belief Propagation,” In Proc British Machine Vision Conference, pp 989 998 [85] Q Zhang, S Li (2015), “New Stereo Matching Method Based on Improved BP Algorithm,” International Journal on Smart Sensing & Intelligent Systems, vol 8, no 1, pp 464 - 479 135 [86] R Woodfill, J Zabih (1994), “Non-Parametric Local Transforms for Computing Visual Correspondence,” Proceedingsings of Third European Conference of Computer Vision , vol 801, pp 151 - 158 [87] R Hamzah, H Ibrahim (2016), “Literature Survey on Stereo Vision Disparity Map Algorithms,” Journal of Sensors, pp - 23 [88] R Hartley (1999), “Theory and practice of projective rectification,” Internat J Comput Vision, vol 35, no 2, pp 115 -127 [89] R Lin, B Super, B Tang, T Yu (2005), “Efficient Message Representations for Belief Propagatine,” in Proc ICCV, pp - [90] R Roncella (2014), “A comparison of semiglobal and local dense matching algorithms for surface reconstruction,” in The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Riva del Garda, Italy [91] R Y Tsai (1987), “A Versatile Camera Calibration Technique for High Accuracy 3D Machine Vision Metrology Using Off the Shelf TV Cameras and Lenses,” IEEE International Journal on Robotics and Automation, pp 323 - 344 [92] R Zabih, D Scharstein, O Vekser, V Kolmogorov, A Agarwala, M Tappen, C.Rother, R Szeliski (2008), “A comparative Study of Energy Minimization Methodsfor Markov Random Fields with Smoothnessbased Priors,” IEEE Transactions on Pattern Analysis and Machine Intelligence, pp 1068-1080 [93] R.Kalarot and J Morris (2010), “Comparison of FPGA and GPU implementations of Real time Stereo Vision,” IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp - 15 136 [94] S A Fezza, S Ouddane (2010), “Fast Stereo Matching via Graph Cuts,” International Workshop on Systems Signal Processing and Application, pp 115 - 118 [95] S B Kang, R Szeliski (2004), “Extracting View-Dependent Depth Maps from a Collection of Image,” Int'l J Computer Vision, pp 139163 [96] S Darabiha, J MacLean, J Rose (2006), “Reconfigurable Hardware Implementation of Phase Correlation Stereo Algorihm,” Machine Vision Appl, vol 2, no 17, pp 116 - 132 [97] S Gotchev, G B Akar, T Capin, D Strohmeier, A Boev (2011), “Three Dimensional Media For Mobile Devices,” Proceedings of the IEEE, vol 4, no 99, pp 708 - 742 [98] S Grauer, C Kambhamettu (2009), “Hierarchical Belief Propagation To Reduce Search Space Using CUDA for Stereo and Motion Estimation,” Workshop on Application of Computer Vision, pp - [99] S H Lee, S Sharma (2011), “Real time Disparity Estimation Algorithm for Stereo Camera Systems,” IEEE Transactions on Consumer Electronics, vol 3, no 57, pp 1018 - 1027 [100] S Livation, F Banno, G Muscato (2012), “3D Integration of Robot Vision and Laser Data with Semiautomatic Calibration in Augmented Reality Stereoscopic Visual Interface,” IEEE Transaction on Industrial Infomatics, vol 1, no 8, pp 69 - 77 [101] S Perri, P Corsonello, G Cocorullo (2013), “Adaptive Census Transform: A novel hardware oriented stereovision algorithm,” Computer Vision and Image Understanding, vol 117, pp 29 - 41 [102] S Park, H Jeong (2007), “Real time Stereo Vision FPGA Chip with Low Error Rate,” Proceeding of the 2007 International Conference on Multimedia and Ubiquitous Engineerring, pp 751-756 [103] S Park, C Chen, H.Jeong (2007), “VLSI Architecture for MRF based Stereo Matching,” 7th International Workshop, pp 55 - 64 137 [104] Sun, N Zheng, H Shum (2003), “Stereo matching using belief propagation,” IEEE Trans PAMI, pp 787 -800 [105] T Emanuele, V Alessandro, A Fusiello (2000), “A compact algorithm for rectification of stereo pairs,” Machine Vision and Applications, vol 12, no 1, pp 16 - 22 [106] W R Szeliski (2010) “Computer Vision: Algorithms Applications,” Springer-Verlag New York Inc, pp 1458-1465 and [107] Y An, S Zhang, B Li (2017), “High accuracy, high speed 3D structured light imaging techniques and potential applications to intelligent robotics,” International Journal of Intelligent Robotics and Applications, vol 1, no 1, pp 86 -103 [108] Y C Tseng, T S Chang (2010), “Architecture Design of Belief Propagation for Real Time Disparity Estimation,” IEEE Transaction on Circuits and Systems for Video Technology, vol 11, no 20, pp 1555 1565 [109] Y C Tseng, N Chang, T S Chang (2007), “Low Memory Cost Block based Belief Propagation for Stereo Correspondence,” IEEE International Conference on Multimedia and Expo, pp 1415 - 1418 [110] Y Ohta, T Kanade (1985), “Stereo by Intra and Inter scanline Search Using Dynamic Programming,” IEEE Trans Pattern Anal Mach Intell, pp 139 - 145 [111] Y R Horng, Y.C Tseng, T S Chang (2011), “VLSI Architecture for Real Time HD1080p View Synthesis Engine,” IEEE Transactions on Circuits and Systems for Video Tecnology, pp 1329 - 1340 [112] Y S Heo, K M Lee, S U Lee (2010), “Robust Stereo Matching Using Adaptive Normalized Cross Correlation,” IEEE Tran on Pattern Analysis and Machine Intelligence, vol 4, no 32, pp 807-822 [113] Z Wang, S Chen (2012), “Acceleration Strate in Generalized Belief Propagation,” IEEE Transactions on Industrial Information, vol 1, no 8, pp 41 - 48 P-1 ... là: Nghiên cứu giải pháp nâng cao tốc độ xử lý tín hiệu hệ thống thị giác nổi Luận án đề xuất giải pháp để giải toán nâng cao tốc độ xử lý tín hiệu giảm dung lƣợng nhớ yêu cầu hệ thống thị giác. .. Sơ đồ khối hệ thống thị giác Hiện nay, theo [64], hệ thống thị giác nhƣ Hình 1.1 đƣợc chia làm hai loại hệ thống thị giác chủ động hệ thống thị giác thụ động Hệ thống chủ động hệ thống sử dụng... 12 Quá trình xử lý thông tin ảnh từ ảnh stereo camera để xác định đƣợc đồ độ sâu vật hệ thống thị giác đƣợc gọi trình xử lý tín hiệu hệ thống thị giác Để nâng cao tốc độ xử lý tín hiệu ngồi việc

Định dạng
Số trang	151
Dung lượng	4,16 MB