1. Trang chủ
  2. » Giáo án - Bài giảng

TTNT_Chapter8.ppt

79 367 0

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 79
Dung lượng 1,12 MB

Nội dung

Artificial Intelligence Trí Tuệ Nhân Tạo Chapter 8 – Game Playing Lê Quân Hà Outline • What are games? • Optimal decisions in games  Which strategy leads to success? • α-β pruning • Games of imperfect information • Games that include an element of chance What are and why study games? • Games are a form of multi-agent environment  What do other agents do and how do they affect our success?  Cooperative vs. competitive multi-agent environments.  Competitive multi-agent environments give rise to adversarial problems a.k.a. games • Game playing is a good problem for AI research • Game playing is non-trivial  Players need “human-like” intelligence  Games can be very complex (e.g. chess, go)  Requires decision making within limited time • Games usually are:  Well-defined and repeatable  Limited and accessible Types of Games Relation of Games to Search • Search – no adversary  Solution is (heuristic) method for finding goal  Heuristics and CSP techniques can find optimal solution  Evaluation function: estimate of cost from start to goal through given node  Examples: path planning, scheduling activities • Games – adversary  Solution is strategy (strategy specifies move for every possible opponent reply).  Time limits force an approximate solution  Evaluation function: evaluate “goodness” of game position  Examples: chess, checkers, Othello, backgammon Game setup • Two players: MAX and MIN • MAX moves first and they take turns until the game is over. Winner gets award, looser gets penalty. • Games as search:  Initial state: e.g. board configuration of chess  Successor function: list of (move,state) pairs specifying legal moves.  Terminal test: Is the game finished?  Utility function (hàm tiện ích): Gives numerical value of terminal states. E.g. win (+1), loose (-1) and draw (0) in tic-tac-toe (next) • MAX uses search tree to determine next move. Partial Game Tree for Tic-Tac-Toe • How complex would search be in this case?  Worst case: O(b d )  Tic-Tac-Toe: ~5 legal moves, max of 9 moves  5 9 = 1,953,125 states  Chess: ~35 legal moves, ~100 moves per game  35 100 ~10 154 states (but “only” ~10 40 legal states) • Common games produce enormous search trees!! Complexity of Game Playing Greedy Search for Games • Expand the search tree to the terminal states • Evaluate utility of each terminal board state • Make the initial move that results in the board configuration with the maximum value But this still ignores what the opponent is likely to do… – Computer chooses C because its utility is 9 – Opponent chooses J and wins! Minimax principle - Optimal strategies • Chooses the best move considering both its move and the opponent’s best move • Assumption: Both players play optimally !!  MAX maximizing the utility under the assumption after it moves, MIN will choose the minimizing move. • Given a game tree, the optimal strategy can be determined by using the minimax value of each node: MINIMAX-VALUE( n )= UTILITY(n) If n is a terminal max s ∈ successors(n) MINIMAX-VALUE(s) If n is a max node min s ∈ successors(n) MINIMAX-VALUE(s) If n is a min node

Ngày đăng: 16/07/2014, 05:00

Xem thêm

TÀI LIỆU CÙNG NGƯỜI DÙNG

TÀI LIỆU LIÊN QUAN