Điều khiển tối ưu hợp tác phân tán bẩy sự kiện cho đa hệ cơ khí phi holonom tự hành kết nối vật lý và ràng buộc ngõ vào báo cáo tổng kết đề tài nghiên cứu khoa học cấp bộ

BỘ KHOA HỌC VÀ CÔNG NGHỆ QUỸ PHÁT TRIỂN KHOA HỌC VÀ CÔNG NGHỆ QUỐC GIA BÁO CÁO TỔNG HỢP KẾT QUẢ THỰC HIỆN ĐỀ TÀI NGHIÊN CỨU CƠ BẢN TRONG KHOA HỌC TỰ NHIÊN Tên đề tài: Điều khiển tối ưu hợp tác phân tán bẩy kiện cho đa hệ khí phi holonom tự hành kết nối vật lý ràng buộc ngõ vào Mã số đề tài: 107.04-2019.25 TP HCM, 2022 Mẫu M4a Mã số hồ sơ Ngày nhận báo cáo (Do Cơ quan điều hành Quỹ ghi) PHẦN I THÔNG TIN CHUNG Tên đề tài: Điều khiển tối ưu hợp tác phân tán bẩy kiện cho đa hệ khí phi holonom tự hành kết nối vật lý ràng buộc ngõ vào Mã số: 107.04-2019.25 Danh sách chủ nhiệm, thành viên tham gia thực đề tài TT Họ tên Đơn vị công tác PGS.TS Nguyễn Tấn Trường Đại học Cơng nghiệp Luỹ TP Hồ Chí Minh TS Trần Hữu Tồn Trường Đại học Cơng nghiệp TP Hồ Chí Minh Chức danh đề tài Chủ nhiệm đề tài Thành viên chủ chốt Viện học tin học ứng TS Trần Trọng Toàn dụng, Viện hàn lâm khoa học Thư ký đề tài cơng nghệ Việt Nam Tổ chức chủ trì: Trường Đại học Công Nghiệp Tp HCM Thời gian thực hiện: 5.1 Theo hợp đồng: 24 tháng, từ 05/09/2019 đến 05/09/2021 5.2 Gia hạn (nếu có): 12 tháng 5.3 Thực thực tế: 32 tháng, từ 05/09/2019 đến 05/06/2022 Tổng kinh phí phê duyệt đề tài: 684.000.000 đồng (bằng chữ: Sáu trăm tám mươi bốn triệu đồng chẵn) PHẦN II NỘI DUNG VÀ KẾT QUẢ NGHIÊN CỨU Mục tiêu Đề tài bao gồm lý thuyết phương pháp tính tốn lĩnh vực điều khiển hợp tác phân tán đa hệ khí phi holonom tự hành với kết nối vật lý ràng buộc ngõ vào Phép toán biến đổi cho luật điều khiển truyền thẳng đề xuất để tích hợp động học động lực học dễ dàng thiết kế điều khiển tối ưu hợp tác phân tán Từ dựa vào lý thuyết qui hoạch động thích nghi bền vững chế bẩy kiện, phương pháp xấp xỉ nghiệm trực tuyến điều khiển hợp tác phân tán cung cấp cho độ phức tạp tính tốn gánh nặng truyền thơng giảm thiểu Nội dung phạm vi nghiên cứu Phạm vi nội dung nghiên cứu đề tài thể sau: - Nghiên cứu tổng quan điều khiển phân tán hợp tác đa hệ khí phi holonom tự hành, thuộc lớp hệ phi tuyến truyền ngược nghiêm ngặt với kết nối vật lý ràng buộc ngõ vào, ý thêm kết nối truyền thông, nhiễu tham số bất định - Dựa vào lý thuyết đồ thị đề xuất cấu hình hợp tác phân tán đối tượng hệ mơ hình hố hệ thống - Thiết kế luật điều khiển truyền thẳng để tích hợp mơ hình động học động lực học thành hệ phi tuyến nhất, loại bỏ kết nối bất định dễ dàng cho thiết kế luật điều khiển tối ưu phân tán bước - Biến đổi toán tối ưu hợp tác phân tán cho đa hệ khí (đa hệ phi tuyến truyền ngược nghiêm ngặt) thành phương trình vi phân phi tuyến bậc cao Hamilton-Jacobi-Isaacs (HJI) thiết kế phương pháp để xấp xỉ trực tuyến nghiệm phương trình HJI - Tăng cường hiệu luật điều khiển cách khai thác chế kích hoạt kiện - Chứng minh tính ổn định hội tụ hệ thống sử dụng lý thuyết Lyapunov biến đổi toán học nâng cao Cách tiếp cận phương pháp nghiên cứu Trên sở lý thuyết điều khiển phân tán hệ phi tuyến truyền ngược nghiêm ngặt tiếp cận đến kết công bố nhất, đề tài phân tích ưu nhược điểm phương pháp, tiếp tục nghiên cứu phát triển vấn đề Có hai hướng tiếp cận sau: - Nghiên cứu lý thuyết động học động lực học chứa tham số bất định hệ phi tuyến truyền ngược nghiêm ngặt từ áp dụng cho hệ phi holonom di động ràng buộc ngõ vào nhiễu; Khảo sát kết nối vật lý truyền thông đối tượng hệ thống Khi giả thiết phù hợp phạm vi giải toán đặt - Khai thác lý thuyết qui hoạch động thích nghi bền vững (robust adaptive dynamic programming), lĩnh vực hẹp học tăng cường bền vững (robust reinforcement learning), để thiết kế phương pháp xấp xỉ nghiệm phương trình vi phân phi tuyến bậc cao Hamilton-Jacobi-Isaacs liên quan đến luật điều khiển mong muốn Theo hướng tiếp cận trên, phương pháp nghiên cứu bao gồm: - Phân tích thiết kế hệ thống thực cách sử dụng lý thuyết hệ thống động nhiều vật lý thuyết điều khiển đại Trong ổn định hội tụ chứng minh lý thuyết Lapunov tốn học Ngồi ra, mơ số sử dụng để kiểm chứng tính hiệu phương pháp đề xuất - So sánh kết đạt với kết nghiên cứu khác lĩnh vực, đối tượng Kết nghiên cứu (Mô tả kết nghiên cứu đạt Đánh giá kết nghiên cứu bao gồm tính mới, giá trị khoa học, giá trị thực tiễn khả ứng dụng kết nghiên cứu) a) Tính nghiên cứu: - Bộ quan sát trạng thái dựa vào ngõ thiết kế nhằm hạn chế số lượng cảm biến hồi tiếp cho cấu chấp hành hệ khí phi holonom Tính ổn định áp dụng luật điều khiển quan sát trạng thái cho hệ kín phát biểu chứng minh nghiêm ngặt - Luật điều khiển truyền thẳng tích hợp mơ hình động học động lực học cho hệ phi tuyến truyền ngược nghiêm ngặt Luật loại bỏ hoàn toàn thủ tục nhận dạng hệ thống làm giảm chi phí tính tốn khác với luật có Luật điều khiển truyền thẳng tích hợp đề xuất từ hệ phi tuyến truyền ngược nghiêm ngặt áp dụng cho mơ hình động học động lực học cho đối tượng phi holonom di động kết nối vật lý - Luật điều khiển tối ưu hợp tác phân tán với ràng buộc ngõ vào loại bỏ nhiễu dựa vào chế kích hoạt kiện Chất lượng tối ưu điểm khác biệt phương pháp so với phương pháp có Ngồi chế kích hoạt kiện làm giảm đáng kể chi phí tính tốn chi phí truyền thơng so với phương pháp điều khiển lấy mẫu theo chu kỳ Định lý ổn định hội tụ áp dụng luật phát biểu chứng minh nghiêm ngặt b) Giá trị thực tiễn: Nghiên cứu ứng dụng thực tế việc điều khiển tối ưu bền vững trực tiếp cho loại cấu chấp hành hệ khí phi holonom di động sử dụng rộng rãi trong cơng nghiệp, hệ phi tuyến phức tạp sử dụng quan sát trạng thái khơng có cảm biến vị trí Luật điều khiển tối ưu phân tán chứng minh ổn định chặt chẽ áp dụng trực tiếp cho đối tượng phi holonom tự hành thực tế c) Khả ứng dụng: - Từ kết nghiên cứu đề tài thực nghiệm loại cấu chấp hành, ta ứng dụng luật điều khiển nhúng quan sát trạng thái nhúng robot di động thực tế công nghiệp logistic - Kết đề tài làm sở khoa học để nghiên cứu phát triển thuật toán điều khiển nâng cao cho hệ phi tuyến có trễ hệ phi tuyến có mơ hình động học hồn tồn khơng biết d) Kết cơng bố: Cơng trình đăng tạp chí ISI uy tín Quỹ Nafosted ban hành, theo Quyết định số 151/QĐ-HĐQL-NAFOSTED ngày 09 tháng năm 2019 Hội đồng Quản lý Quỹ Phát triển khoa học công nghệ Quốc gia [Luy Nguyen Tan, Thanh Pham Cong, Duy Pham Cong, Neural Network Observers and Sensorless Robust Optimal Control for Partially Unknown PMSM with Disturbances and Saturating Voltages, vol 36, no 10, pp 12045 - 12056, 2021, DOI: 10.1109/TPEL.2021.3071465, ISI(SCI), Q1, IF=6.15 (2021)] Các đóng góp cơng trình tóm tắt sau: 1.1 Để làm sở thiết kế luật điều khiển tối ưu hợp tác phân tán bẩy kiện cho đa hệ khí phi holonom tự hành kết nối vật lý ràng buộc ngõ vào, cần thiết phải khảo sát loại cấu chấp hành động đồng nam châm vĩnh cửu hệ phí phi holonom tự hành với ngõ vào bị ràng buộc không đủ trạng thái để thiết kế điều khiển hồi tiếp trạng thái 1.2 Bộ quan sát trạng thái dựa vào mạng thần kinh nhân tạo thiết kế Trong thơng tin biết trước động học hệ thống không cần thiết Luật cập nhật số trực tuyến trọng số dựa vào lý thuyết ổn định Lyapunov bảo đảm hệ kín ổn định 1.3 Dưới ảnh hưởng nhiễu, vị trí góc quay tốc độ quay trục cấu chấp hành trích xuất từ sức điện động ngược khơng xác, quan sát thứ hai thiết kế Sai số bám góc quay tốc độ quay chứng minh ổn định bị chặn lân cận zero 1.4 Dựa vào kỹ thuật học tăng cường tích phân (Intergral Reinforcement Learning-IRL) thuật toán điều khiển tối ưu bền vững với ngõ vào bị bão hồ đề xuất Mơ thực nghiệm chứng tỏ thuật toán hiệu Cơng trình đăng tạp chí ISI uy tín Quỹ Nafosted ban hành, theo Quyết định số 151/QĐ-HĐQL-NAFOSTED ngày 09 tháng năm 2019 Hội đồng Quản lý Quỹ Phát triển khoa học công nghệ Quốc gia [Luy Nguyen Tan, Huu-Toan Tran, Trong-Toan Tran, Event-Triggered Observers and Distributed H∞ Control of Physically Interconnected Nonholonomic Mechanical Agents in Harsh Conditions, pp 1-14, 2022, DOI: 10.1109/TSMC.2022.3177043, ISI(SCI), Q1, IF=13.45 (2021)] [Tan-Luy Nguyen, Huu-Toan Tran, Trong-Toan Tran, and Cong-Thanh Pham, Reinforcement Learning-Based Event-Triggered Robust Optimal Control for Mobile Euler-Lagrange Systems with Dead-Zone and Saturation Actuators, International Journal of Mechanical Engineering and Robotics Research, vol 10, no 3, pp 116-127, 2021, DOI: 10.18178/ijmerr.10.3.116-127] Các đóng góp cơng trình tóm tắt sau: 2.1 Khảo sát hệ phí phi holonom di động kết nối vật lý hoạt động môi trường phức tạp: - Cơ cấu chấp hành bị ràng buộc (bị bão hoà bị ảnh hường vùng chết hộp số) - Bánh xe chạy bề mặt trơn trượt nên hệ khí bị trượt ngang trượt dọc - Tham số bất định (khối lượng tải, lực quán tính, lực ly tâm quay vòng, lực ma sát) ảnh hưởng lên hệ thống - Nhiễu ngồi gồm nhiễu mơ men nhiễu cảm biến ảnh hưởng lên hệ - Không đủ trạng thái hồi tiếp thiết kế điều khiển 2.2 Thiết kế quan sát trạng thái kích hoạt kiện dựa vào ngõ luật điều khiển truyền thẳng tích hợp mơ hình động học động lực học cho đối tượng phi holonom di động kết nối vật lý Bộ quan sát luật điều khiển cập nhật tham số có kiện xảy Điều làm giảm chi phí tính tốn chi phí truyền thơng Tính ổn định áp dụng luật cập nhật cho hệ kín phát biểu chứng minh nghiêm ngặt 2.3 Luật điều khiển tối ưu hợp tác phân tán với ràng buộc ngõ vào loại bỏ nhiễu dựa vào chế kích hoạt kiện Chất lượng tối ưu điểm khác biệt phương pháp so với phương pháp có Định lý ổn định hội tụ áp dụng luật phát biểu chứng minh nghiêm ngặt So sánh với luật điều khiển phân tán theo chu kỳ lấy mẫu, luật điều khiển dựa vào kích hoạt kiện hiệu độ phức tạp tính tốn chi phí truyền thông Thay đổi so với thuyết minh ban đầu Không thay đổi so với thuyết minh ban đầu PHẦN III KẾT QUẢ CÔNG BỐ VÀ ĐÀO TẠO Kết cơng bố STT Cơng trình khoa học ISSN/ ISBN Tình trạng (Đã in/chấp nhận đăng) Cơng trình cơng bố quốc tế tạp chí thuộc danh mục ISI uy tín Luy Nguyen Tan, Huu-Toan Tran, Trong-Toan Tran, Event-Triggered Observers and Distributed H∞ Control of Physically Interconnected Nonholonomic Mechanical Agents in Harsh Conditions, pp 1-14, 2022, DOI: 1.1 10.1109/TSMC.2022.3177043, ISI(SCI), Q1, IF=13.45 2168-2216 (Thuộc ISI Uy tín Quỹ Nafosted ban hành, theo Quyết định số 151/QĐ-HĐQL-NAFOSTED ngày 09 tháng năm 2019 Hội đồng Quản lý Quỹ Phát triển khoa học công nghệ Quốc gia) Chấp nhận đăng (online ngày 01/06/2022) Luy Nguyen Tan, Thanh Pham Cong, Duy Pham Cong, Neural Network Observers and Sensorless Robust Optimal Control for Partially Unknown PMSM with Disturbances and Saturating Voltages, vol 36, no 10, 1.2 pp 12045 - 12056, 2021, DOI: 0885-8993 Đã in and 2278-0149 Đã in 10.1109/TPEL.2021.3071465, ISI(SCI), Q1, IF=6.15 (Thuộc ISI Uy tín Quỹ Nafosted ban hành, theo Quyết định số 151/QĐ-HĐQL-NAFOSTED ngày 09 tháng năm 2019 Hội đồng Quản lý Quỹ Phát triển khoa học công nghệ Quốc gia) Cơng trình cơng bố quốc tế tạp chí khơng thuộc ISI Tan-Luy Nguyen, Huu-Toan Tran, Trong-Toan Tran, and Cong-Thanh Pham, Reinforcement Learning-Based Event-Triggered Robust Optimal Control for Mobile 2.1 Euler-Lagrange Saturation Systems Actuators, with Dead-Zone International Journal of Mechanical Engineering and Robotics Research, vol 10, no 3, pp 116-127, 2021, SCOPUS, Q3 Cơng trình cơng bố tạp chí quốc gia có uy tín Nguyễn Tấn Luỹ, Trần Hữu Tồn, Đặng Quang Minh, Thuật tốn điều khiển tối ưu bền vững kích hoạt kiện 3.1 cho hệ phi tuyến truyền ngược nghiêm ngặt với vùng 1859-0551 Đã in chết nhiễu ngoài, Chuyên san Đo lường, điều khiển Tự động hoá, Quyển 22, Số 2-3, Trang 3-14, 2019 Báo cáo khoa học đăng kỷ yếu hội nghị quốc tế Tan-Luy Nguyen, Huu-Toan Tran, Trong-Toan Tran, and Cong-Thanh Pham, Reinforcement Learning-Based 4.1 Event-Triggered Robust Optimal Control for Mobile Euler-Lagrange Saturation Systems Actuators, with Dead-Zone International Journal and of Mechanical Engineering and Robotics Research, 2020 2278-0149 Đã in 3rd International Conference on Control, Robotics and Informatics, Ho Chi Minh City, Vietnam, 2020 (Bài báo hội nghị mở rộng, phản biện chấp nhận đăng tạp chí "International Journal of Mechanical Engineering and Robotics Research, vol 10, no 3, pp 116-127, Ho Chi Minh City, Vietnam, 2021, DOI: 10.18178/ijmerr.10.3.116-127, SCOPUS, Q3, Bài đăng xem mục 2.1) Báo cáo khoa học đăng kỷ yếu hội nghị quốc gia Sách chuyên khảo Bằng sáng chế Kết cơng bố khác (nếu có) Tổng hợp kết công bố ISI: - Số báo ISI đề tài đăng ký: 02 - Số báo ISI đề tài công bố/chấp nhận công bố: 02 Ghi chú: - Gửi kèm tồn văn cơng bố tạp chí, hội nghị; sách chuyên khảo - Cột cơng trình khoa học: o Liệt kê báo đăng tạp chí theo thứ tự o Liệt kê báo đăng hội nghị theo thứ tự Kết đào tạo Thời gian tham STT Họ tên Cơng trình công bố gia đề tài (số tháng) Nghiên cứu sinh Học viên cao học 10 This article has been accepted for inclusion in a future issue of this journal Content is final as presented, with the exception of pagination 10 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS: SYSTEMS (a) Fig (c) Full-state-feedback-based control performance (a) Horizontal states (b) Vertical states (c) Heading states (a) Fig (b) (b) (c) Observer-based control performance (a) Horizontal states (b) Vertical states (c) Heading states A PPENDIX A P ROOF OF T HEOREM Choosing the Lyapunov function candidate Li = Fig Event-triggering instants of each agent for full-feedback-state control N N 1 1 x˜ i Pi x˜ i + δi,h δi,h 2 i=1 i=1 h=q,v # $% & # $% & Li,1 + N i=1 # Li,2 ˜ i W ˜ i} Trace{W $% (59) & Li,3 Fig Event-triggering instants of each agent for estimated state control VI C ONCLUSION This article presents a method to design ET observers and distributed H∞ controllers for physically interconnected NMA systems in harsh conditions Strict assumptions of polynomialtype nonlinearities for physical interconnection functions are relaxed by the method The observer, virtual controller, and an ADP-distributed H∞ control algorithm are created by adopting event-triggering mechanisms to reduce computational complexity and communication Thanks to the observer and the virtual controller, physically interconnected agent systems are transformed into isolated subsystems, and thanks to the algorithm, distributed H∞ optimal control laws and disturbance rejection laws are obtained The proposed designs guarantee the boundedness of closed-loop signals and the exclusion of Zeno behavior The future work will consider problems of ET optimal control with time delay and output feedback The proof is considered in two cases The systems dynamics are within interevent intervals and at event-triggering instants Case 1: Taking the time derivative of Li,1 along with (10) and noting (9), we have ˙ x˜ i Pi x˜ i + x˜ i Pi x˙˜ i i=1 N ˆ i ) + x˜ i Pi (ξi − ψi ) ˜ i φi ( − x˜ i Qi x˜ i + x˜ i Pi W = N L˙ i,1 = i=1 (60) Taking the time derivative of Li,2 along with (13) and (18), yields L˙ i,2 = − # N i=1 h=q,v ⎛ ⎝ χδi,h (bi + ci )ˆzi,h − $% A1 j∈Ni ⎞ aij zˆj,h ⎠ & Authorized licensed use limited to: NASATI Downloaded on June 04,2022 at 08:50:18 UTC from IEEE Xplore Restrictions apply This article has been accepted for inclusion in a future issue of this journal Content is final as presented, with the exception of pagination TAN et al.: ET OBSERVERS AND DISTRIBUTED H∞ CONTROL 11 (a) (b) (c) Fig Evolution of sampling errors and triggering thresholds (a) Robot (b) Robot (c) Robot Fig Approximate optimal value functions Fig 11 Distributed H∞ control performance (a) Robot trajectories in a plane (b) Control torques From (14), (16), the communication graph configuration, and the triggering condition (20), the first sum of (61) can be rewritten as follows: M Mh χ zˆ∗,h h zˆ∗,h A1 = − h=q,v (a) =− (b) z ˆ χ Qh zˆ − z ˜ ∗,h ∗,h ∗,h h=q,v ≤− N χ Qi,h (1 − κi )ˆz i,h zˆ i,h (62) i=1 h=q,v where a∗,h = [a 1,h , , aN,h ] , a = {z, z, zˆ , zˆ , κ, χ }, Mh = M (L + C) ⊗ Inh , and Qh = Mh h = diag(Q1,h , , QN,h ), h = q, v Using Young and Cauchy–Schwartz inequalities, the second sum of (61) is changed to Q (c) A2 ≤ zˆ∗,q q g∗,q zˆ∗,v Q Q ≤ g∗,q zˆ∗,q q zˆ∗,q + zˆ∗,v q zˆ∗,v N 1 Qi,h zˆ ≤ i,h zˆ i,h (d) Fig 10 Evolution of NN weights (a) Robot (b) Robot (c) Robot (d) Robott + N ⎛ ⎝ δi,q (bi + ci )gi,q zî,v − i=1 + N j∈Ni $% # A2 ⎛ ⎝ ˆ δi,h Fi,h ⎞ aij gj,q zˆj,v ⎠ where g∗,q = diag(g1,q , , gN,q ) The time derivative of Li,3 along with (20) gives L˙ i,3 = By substituting (60)–(63) into (59), one has & + (bi + ci ) gi,h ui,h − ψi,h + ξi,h i=1 h=q,v − j∈Ni aij gj,h uj,h L˙ i ≤ − N ⎞ − ψj,h + ξj,h (63) i=1 h=q,v ⎠ (61) − i=1 N λmin (Qi ) ˜xi + i=1 h=q,v N (bi,ξ + bi,π¯ ) Pi ˜xi i=1 χ Qi,h (1 − κi ) − Qi,h zˆ i,h zˆ i,h Authorized licensed use limited to: NASATI Downloaded on June 04,2022 at 08:50:18 UTC from IEEE Xplore Restrictions apply This article has been accepted for inclusion in a future issue of this journal Content is final as presented, with the exception of pagination 12 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS: SYSTEMS + Fˆ i,h + (bi + ci ) gi,h ui,h − ψi,h + ξi,h δi,h N + 16αi b2i,φ m2i Ci ˜xi 2 αi ˜ i + αi b2i,φ m2i ˜xi Ci ˜xi W + 16 + 2αi b2i,W Ci ˜xi + 16αi Ci ˜xi 2 αi ˜ Ci ˜xi Wi (71) + 16 i=1 h=q,v − aij gj,h uj,h − ψj,h + ξj,h (64) j∈Ni By choosing χ > 1, < κi ≤ (χ − 1/χ ), one obtains L˙ i ≤ − N i=1 + N λmin (Qi )( ˜xi − Di ) + ⎛ δi ⎝Fˆ i i=1 N λmin (Qi ) i=1 D2i After reducing the terms in (71), we have Li,3 + (bi + ci ) gi ui − ψi + ξi ⎞ aij gj uj − ψj + ξj ⎠ − (65) where bi,ξ + bi,π¯ Pi (66) Di = λmin (Qi ) Assume that we have the ET-distributed optimal tracking N ˙ control laws ui , which make i=1 δi δi < (closed-loop dynamics (19) is stable) If ˜xi ≥ 2Di , then L˙ i < That means the estimation error is UUB Case 2: Taking the first difference of (59) at t = tki , one has + + x˜ i Pi x˜ i − x˜ i Pi x˜ i N Li,2 = Li,3 i=1 N + + δi,h δi,h − δi,h δi,h (67) (68) i=1 h=q,v N ' ( ' ( ˜ i+ W ˜ i W ˜ i+ − Tr W ˜i Tr W = (69) From (64), it is shown that the estimation errors are asymptotically stable Thus, Li,1 ≤ and Li,2 ≤ We then complete Li,3 Using (11) for (69), it is obtained that N ' ( ' ( ˜ i mi x˜ i φi + 2tr W ˜ i Ci x˜ i W î = αi 2tr W i=1 where λi,W˜ = αi (2mi b2i,φ A¯ − i + αi (2bi,φ mi Ci bi,W + b2i,φ m2i + 2 Ci b2i,W ) ˜xi + 16αi (b2i,φ m2i + 1) Ci ˜xi + b2i,W Ci ) With (65), ˜xi is bounded, then λi,W˜ ≤ bi,W˜ for some positive constants bi,W˜ With < αi < 4, if ˜ i surpasses a stable boundary, i.e., the approximation error W ) ˜ i ≥ 8λ ˜ /( Ci αi (4 − αi )), then Li,3 < Therefore, W i,W the UUB condition is satisfied This completes the proof A PPENDIX B P ROOF OF T HEOREM For t ∈ i ), [tki , tk+1 it follows from (14) that (bi + ci )z˙î = δ˙i + aij z˙ˆj (73) j∈Ni i=1 Li,3 (72) i=1 j∈Ni Li,1 = N 2 αi αi ˜ i + λ ˜ ˜xi 1− Ci W − ≤ i,W ( ' ˆ + 2αi tr φi x˜ i m C x ˜ W i i i i ( ' + αi tr φi x˜ i mi mi x˜ i φi ( ' ˆ i x˜ i Ci Ci x˜ i W î + αi tr W i ), and note that Taking the derivative of (16) in t ∈ [tki , tk+1 ˙zî = z˙ˆ + z˙˜ , z˙ˆ = 0, one obtains i i i z˙˜i = z˙î ≤ (bi + ci )z˙î (74) Combining (73) and (74) and using the Lipchitz condition (23) with recalling that Theorem guarantees that the error dynamics of δ˙i in (10) is UUB stability yields z˙˜i ≤ Bi bi,W ˜zi + Bi,ξ + ni,η (75) where ni,η is small positive and Bi,ξ is a positive constant deduced from (65) The rest of the proof can follow from [37] R EFERENCES (70) ˆ ˜ where mi = A¯ − i Ci Ci By using Wi = Wi − Wi and Young’s 2 inequality 2ab < (1/l)a + lb , Li,3 is bounded by N 2 ˜ i + 2mi b2i,φ xi αi Ci ˜xi W Li,3 ≤ A¯ − i ˜ i=1 2 2 ˜ i − 2 Ci ˜xi W ˜ i + Ci ˜xi W 2 ˜ i + 2b2i,W Ci ˜xi + Ci ˜xi W + 2αi bi,φ mi Ci bi,W ˜xi [1] Y Li, F Qu, and S Tong, “Observer-based fuzzy adaptive finite-time containment control of nonlinear multiagent systems with input delay,” IEEE Trans Cybern., vol 51, no 1, pp 126–137, Jan 2021 [2] Z Zuo, B Tian, M Defoort, and Z Ding, “Fixed-time consensus tracking for multiagent systems with high-order integrator dynamics,” IEEE Trans Autom Control, vol 63, no 2, pp 563–570, Feb 2018 [3] N Ahmed, J Cortes, and S Martinez, “Distributed control and estimation of robotic vehicle networks: Overview of the special issue,” IEEE Control Syst Mag., vol 36, no 2, pp 36–40, Apr 2016 [4] Y Wang, Y Song, H Gao, and F L Lewis, “Distributed fault-tolerant control of virtually and physically interconnected systems with application to high-speed trains under traction/braking failures,” IEEE Trans Intell Transp Syst., vol 17, no 2, pp 535–545, Feb 2016 [5] L N Tan, “Event-triggered distributed H∞ constrained control of physically interconnected large-scale partially unknown strict-feedback systems,” IEEE Trans Syst., Man, Cybern., Syst., vol 51, no 4, pp 2444–2456, Apr 2021 [6] K Sun, S Sui, and S Tong, “Fuzzy adaptive decentralized optimal control for strict feedback nonlinear large-scale systems,” IEEE Trans Cybern., vol 48, no 4, pp 1326–1339, Apr 2018 Authorized licensed use limited to: NASATI Downloaded on June 04,2022 at 08:50:18 UTC from IEEE Xplore Restrictions apply This article has been accepted for inclusion in a future issue of this journal Content is final as presented, with the exception of pagination TAN et al.: ET OBSERVERS AND DISTRIBUTED H∞ CONTROL [7] S Tong, K Sun, and S Sui, “Observer-based adaptive fuzzy decentralized optimal control design for strict-feedback nonlinear large-scale systems,” IEEE Trans Fuzzy Syst., vol 26, no 2, pp 569–584, Apr 2018 [8] M Chen, “Disturbance attenuation tracking control for wheeled mobile robots with skidding and slipping,” IEEE Trans Ind Electron., vol 64, no 4, pp 3359–3368, Apr 2017 [9] S Li, L Ding, H Gao, C Chen, Z Liu, and Z Deng, “Adaptive neural network tracking control-based reinforcement learning for wheeled mobile robots with skidding and slipping,” Neurocomputing, vol 283, pp 20–30, Mar 2018 [10] R R Selmic and F L Lewis, “Deadzone compensation in motion control systems using neural networks,” IEEE Trans Autom Control, vol 45, no 4, pp 602–613, Apr 2000 [11] Z Liu, F Wang, and Y Zhang, “Adaptive visual tracking control for manipulator with actuator fuzzy dead-zone constraint and unmodeled dynamic,” IEEE Trans Syst., Man, Cybern., Syst., vol 45, no 10, pp 1301–1312, Oct 2015 [12] W He, A O David, Z Yin, and C Sun, “Neural network control of a robotic manipulator with input deadzone and output constraint,” IEEE Trans Syst., Man, Cybern., Syst., vol 46, no 6, pp 759–770, Jun 2016 [13] D Wang and C B Low, “Modeling and analysis of skidding and slipping in wheeled mobile robots: Control design perspective,” IEEE Trans Robot., vol 24, no 3, pp 676–687, Jun 2008 [14] S Roy, S B Roy, and I N Kar, “Adaptive-robust control of Euler–Lagrange systems with linearly parametrizable uncertainty bound,” IEEE Trans Control Syst Technol., vol 26, no 5, pp 1842–1850, Sep 2018 [15] S Roy and S Baldi, “On reduced-complexity robust adaptive control of switched Euler–Lagrange systems,” Nonlinear Anal Hybrid Syst., vol 34, pp 226–237, Nov 2019 [16] S Roy, E B Kosmatopoulos, and S Baldi, “On vanishing gains in robust adaptation of switched systems: A new leakage-based result for a class of Euler–Lagrange dynamics,” Syst Control Lett., vol 144, Oct 2020, Art no 104773 [17] D A Aligia, G A Magallan, and C H D Angelo, “EV traction control based on nonlinear observers considering longitudinal and lateral tire forces,” IEEE Trans Intell Transp Syst., vol 19, no 8, pp 2558–2571, Aug 2018 [18] C Hua, L Zhang, and X Guan, “Distributed adaptive neural network output tracking of leader-following high-order stochastic nonlinear multiagent systems with unknown dead-zone input,” IEEE Trans Cybern., vol 47, no 1, pp 177–185, Jan 2017 [19] G Wang, C Wang, and L Li, “Fully distributed low-complexity control for nonlinear strict-feedback multiagent systems with unknown deadzone inputs,” IEEE Trans Syst., Man, Cybern., Syst., vol 50, no 2, pp 421–431, Feb 2020 [20] D Wang, D Wang, and W Wang, “Necessary and sufficient conditions for containment control of multi-agent systems with time delay,” Automatica, vol 103, pp 418–423, May 2019 [21] R S Sutton and A G Barto, Reinforcement Learning—An Introduction Cambridge, MA, USA: MIT Press, 1998 [22] Y Huang, “Neuro-observer based online finite-horizon optimal control for uncertain non-linear continuous-time systems,” IET Control Theory Appl., vol 11, no 3, pp 401–410, 2017 [23] D Liu, Y Huang, D Wang, and Q Wei, “Neural-network-observerbased optimal control for unknown nonlinear systems using adaptive dynamic programming,” Int J Control, vol 86, no 9, pp 1554–1566, 2013 [24] H Jiang and H He, “Data-driven distributed output consensus control for partially observable multiagent systems,” IEEE Trans Cybern., vol 49, no 3, pp 848–858, Mar 2019 [25] J Mao, H R Karimi, and Z Xiang, “Observer-based adaptive consensus for a class of nonlinear multiagent systems,” IEEE Trans Syst., Man, Cybern Syst., vol 49, no 9, pp 1893–1900, Sep 2019 [26] P Tabuada, “Event-triggered real-time scheduling of stabilizing control tasks,” IEEE Trans Autom Control, vol 52, no 9, pp 1680–1685, Sep 2007 [27] L Ding, Q Han, X Ge, and X Zhang, “An overview of recent advances in event-triggered consensus of multiagent systems,” IEEE Trans Cybern., vol 48, no 4, pp 1110–1123, Apr 2018 [28] V Narayanan and S Jagannathan, “Event-triggered distributed approximate optimal state and output control of affine nonlinear interconnected systems,” IEEE Trans Neural Netw Learn Syst., vol 29, no 7, pp 2846–2856, Jul 2018 13 [29] Y Zhu, D Zhao, H He, and J Ji, “Event-triggered optimal control for partially unknown constrained-input systems via adaptive dynamic programming,” IEEE Trans Ind Electron., vol 64, no 5, pp 4101–4109, May 2017 [30] X Yang and H He, “Adaptive critic learning and experience replay for decentralized event-triggered control of nonlinear interconnected systems,” IEEE Trans Syst., Man, Cybern., Syst., vol 50, no 11, pp 4043–4055, Nov 2020 [31] X Yang and H He, “Event-driven H∞ -constrained control using adaptive critic learning,” IEEE Trans Cybern., vol 51, no 10, pp 4860–4872, Oct 2021 [32] Z Sun, L Dai, Y Xia, and K Liu, “Event-based model predictive tracking control of nonholonomic systems with coupled input constraint and bounded disturbances,” IEEE Trans Autom Control, vol 63, no 2, pp 608–615, Feb 2018 [33] Y Yang, C Xu, D Yue, X Zhong, X Si, and J Tan, “Event-triggered ADP control of a class of non-affine continuous-time nonlinear systems using output information,” Neurocomputing, vol 378, pp 304–314, Feb 2020 [34] Q Wang, J Chen, B Xin, and X Zeng, “Distributed optimal consensus for Euler–Lagrange systems based on event-triggered control,” IEEE Trans Syst., Man, Cybern., Syst., vol 51, no 7, pp 4588–4598, Jul 2021 [35] D Wang, Z Wang, Z Wang, and W Wang, “Design of hybrid event-triggered containment controllers for homogeneous and heterogeneous multiagent systems,” IEEE Trans Cybern., vol 51, no 10, pp 4885–4896, Oct 2021 [36] Q Liu, M Ye, J Qin, and C Yu, “Event-triggered algorithms for leader– follower consensus of networked Euler–Lagrange agents,” IEEE Trans Syst., Man, Cybern., Syst., vol 49, no 7, pp 1435–1447, Jul 2019 [37] Q Zhang, D Zhao, and Y Zhu, “Event-triggered H∞ control for continuous-time nonlinear system via concurrent learning,” IEEE Trans Syst., Man, Cybern., Syst., vol 47, no 7, pp 1071–1081, Jul 2017 [38] C Mu and K Wang, “Approximate-optimal control algorithm for constrained zero-sum differential games through event-triggering mechanism,” Nonlinear Dyn., vol 95, no 1, pp 2639–2657, 2019 [39] F Tatari, K G Vamvoudakis, and M Mazouchi, “Optimal distributed learning for disturbance rejection in networked non-linear games under unknown dynamics,” IET Control Theory Appl., vol 13, no 17, pp 2838–2848, Nov 2019 [40] W Ren and R W Beard, “Consensus seeking in multiagent systems under dynamically changing interaction topologies,” IEEE Trans Autom Control, vol 50, no 5, pp 655–661, May 2005 [41] A M Bloch, M Reyhanoglu, and N H McClamroch, “Control and stabilization of nonholonomic dynamic systems,” IEEE Trans Autom Control, vol 37, no 11, pp 1746–1757, Nov 1992 [42] R Fierro and F L Lewis, “Control of a nonholonomic mobile robot using neural networks,” IEEE Trans Neural Netw., vol 9, no 4, pp 589–600, Jul 1998 [43] G Chen, Y Song, and F L Lewis, “Distributed fault-tolerant control of networked uncertain Euler–Lagrange systems under actuator faults,” IEEE Trans Cybern., vol 47, no 7, pp 1706–1718, Jul 2017 [44] D Chwa, “Robust distance-based tracking control of wheeled mobile robots using vision sensors in the presence of kinematic disturbances,” IEEE Trans Ind Electron., vol 36, no 10, pp 6172–6183, Jul 2016 [45] Q Jiao, H Modares, S Xu, F Lewis, and K G Vamvoudakis, “Multiagent zero-sum differential graphical games for disturbance rejection in distributed control,” Automatica, vol 69, pp 24–34, Jul 2016 [46] B A Finlayson, The Method of Weighted Residuals and Variational Principles New York, NY, USA: Academic, 1990 [47] T Basar and P Bernhard, Optimal Control and Related Minimax Design Problems: A Dynamic Game Approach Cambridge, MA, USA: Birkhäuser, 1995 [48] A V D Schaft, “l2 -gain analysis of nonlinear systems and nonlinear state-feedback H∞ control,” IEEE Trans Autom Control, vol 37, no 6, pp 770–784, Jun 1992 [49] M Abu-Khalaf and F L Lewis, “Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach,” Automatica, vol 41, no 5, pp 779–791, 2005 [50] D Vrabie and F Lewis, “Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems,” Neural Netw., vol 22, no 3, pp 237–246, 2009 [51] G Chowdhary and E Johnson, “Concurrent learning for convergence in adaptive control without persistency of excitation,” in Proc 49th IEEE Conf Decis Control (CDC), 2010, pp 3674–3679 Authorized licensed use limited to: NASATI Downloaded on June 04,2022 at 08:50:18 UTC from IEEE Xplore Restrictions apply This article has been accepted for inclusion in a future issue of this journal Content is final as presented, with the exception of pagination 14 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS: SYSTEMS [52] K G Vamvoudakis and F L Lewis, “Online solution of nonlinear two-player zero-sum games using synchronous policy iteration,” Int J Robust Nonlinear Control, vol 22, no 13, pp 1460–1483, 2011 [53] A K Das, R Fierro, V Kumar, J P Ostrowski, J Spletzer, and C J Taylor, “A vision-based formation control framework,” IEEE Trans Robot Autom., vol 18, no 5, pp 813–825, Oct 2002 [54] L N Tan, “Event-triggered distributed H∞ control of physically interconnected mobile Euler–Lagrange systems with slipping, skidding and dead zone,” IET Control Theory Appl., vol 14, no 3, pp 438–451, 2020 Luy Nguyen Tan received the B.S and M.Sc degrees in computer science and automation and control engineering and the Ph.D degree in automation from the Ho Chi Minh City University of Technology, Ho Chi Minh City, Vietnam, in 1996, 2006, and 2015, respectively He is currently with the Faculty of Electronics Technology, Industrial University of Ho Chi Minh City, Ho Chi Minh City His current research interests include adaptive dynamic programming, distributed optimal control, large-scale systems, industrial robotic systems, and human–robot interaction Huu-Toan Tran received the M.Sc degree in automation from the HCM City University of Technology, Ho Chi Minh City, Vietnam, in 2009, and the Ph.D degree in electronics science and technology from the University of Electronics Science and Technology of China, Chengdu, China, in 2015 He is currently a Lecturer with the Faculty of Electronics Technology, Industrial University of Ho Chi Minh City, Ho Chi Minh City His current research interests include control application, robot learning, and human–robot interaction Trong-Toan Tran received the B.Eng and M.Eng degrees in automation from Bauman Moscow State Technical University, Moscow, Russia, in 2006 and 2008, respectively, and the Ph.D degree in automatic control from the University of Electronics Science and Technology of China, Chengdu, China, in 2016 He was a Research Fellow with the Department of Electrical and Computer Engineering, National University of Singapore, Singapore, from 2017 to 2018 He is currently a Research Scientist of Control and Mechatronics Engineering with the National Institute of Applied Mechanics and Informatics, Vietnam Academy of Science and Technology, Ho Chi Minh City, Vietnam His current research interests include autonomous control, unmanned aerial vehicle, deep-water technology, and industrial robotic system Authorized licensed use limited to: NASATI Downloaded on June 04,2022 at 08:50:18 UTC from IEEE Xplore Restrictions apply

Định dạng
Số trang	90
Dung lượng	22,17 MB