Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống
1
/ 45 trang
THÔNG TIN TÀI LIỆU
Thông tin cơ bản
Định dạng
Số trang
45
Dung lượng
2,06 MB
Nội dung
Computer Architecture Computer Science & Engineering Chapter OVERVIEW: Abstracts and Technology BK TP.HCM CuuDuongThanCong.com https://fb.com/tailieudientucntt The Computer Revolution Progress in computer technology Makes novel applications feasible BK Based on the Moore’s Law Computers in automobiles Cell phones Human genome project World Wide Web Search Engines Computers used now everywhere TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt History of Computer Development First generation 1945 - 1955 Second generation 1955 - 1965 ICs and multiprogramming Fourth generation 1980 – present BK transistors, batch systems Third generation 1965 – 1980 vacuum tubes, plug boards personal computers (Desk, Lap) SuperComp., DataCenter, Clusters, etc TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt Classes of Computers Desktop computers General purpose, variety of software Subject to cost/performance tradeoff Server computers Network based High capacity, performance, reliability Range from small servers to building sized Embedded computers Hidden as components of systems Stringent power/performance/cost constraints BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt The Processor Market BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt What You Will Learn How programs are translated into the machine language And how the hardware executes them The hardware/software interface What determines program performance And how it can be improved How hardware designers improve performance What is parallel processing BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt Understanding Performance Algorithm Programming language, compiler, architecture Determine number of machine instructions executed per operation Processor and memory system Determines number of operations executed Determine how fast instructions are executed I/O system (including OS) Determines how fast I/O operations are executed BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt Below Your Program Application software Written in high-level language System software Compiler: translates HLL code to machine code Operating System: service code Handling input/output Managing memory and storage Scheduling tasks & sharing resources Hardware Processor, memory, I/O controllers BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt Levels of Program Code High-level language Assembly language Level of abstraction closer to problem domain Provides for productivity and portability Textual representation of instructions Hardware representation Binary digits (bits) Encoded instructions and data BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt Components of a Computer Same components for all kinds of computer Desktop, server, embedded Input/output includes User-interface devices Storage devices Hard disk, CD/DVD, flash Network adapters BK Display, keyboard, mouse For communicating with other computers TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 10 CPI Example Alternative compiled code sequences using instructions in classes A, B, C Sequence 1: IC = Clock Cycles = 2×1 + 1×2 + 2×3 = 10 Avg CPI = 10/5 = 2.0 Sequence 2: IC = Clock Cycles = 4×1 + 1×2 + 1×3 =9 Avg CPI = 9/6 = 1.5 BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 31 Performance Summary Performance depends on Algorithm: affects IC, possibly CPI Programming language: affects IC, CPI Compiler: affects IC, CPI Instruction set architecture: affects IC, CPI, Tc BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 32 Power Trends In CMOS IC technology BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 33 Reducing Power Suppose a new CPU has The power wall 85% of capacitive load of old CPU 15% voltage and 15% frequency reduction We can’t reduce voltage further We can’t remove more heat How else can we improve performance? BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 34 Uniprocessor Performance BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 35 Multiprocessors Multicore microprocessors More than one processor per chip Requires explicitly parallel programming Compare with instruction level parallelism Hardware executes multiple instructions at once Hidden from the programmer Hard to Programming for performance Load balancing Optimizing communication and synchronization BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 36 Manufacturing ICs Yield: proportion of working dies per wafer BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 37 AMD Opteron X2 Wafer X2: 300mm wafer, 117 chips, 90nm technology X4: 45nm technology BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 38 Integrated Circuit Cost Nonlinear relation to area and defect rate Wafer cost and area are fixed Defect rate determined by manufacturing process Die area determined by architecture and circuit design BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 39 SPEC CPU Benchmark Programs used to measure performance Standard Performance Evaluation Corp (SPEC) Supposedly typical of actual workload Develops benchmarks for CPU, I/O, Web, … SPEC CPU2006 Elapsed time to execute a selection of programs Negligible I/O, so focuses on CPU performance Normalize relative to reference machine Summarize as geometric mean of performance ratios CINT2006 (integer) and CFP2006 (floating-point) BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 40 CINT2006 for Opteron X4 2356 BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 41 SPEC Power Benchmark Power consumption of server at different workload levels Performance: ssj_ops/sec Power: Watts (Joules/sec) BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 42 SPECpower_ssj2008 for X4 BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 43 Pitfall: MIPS as a Performance Metric MIPS: Millions of Instructions Per Second Doesn’t account for BK Differences in ISAs between computers Differences in complexity between instructions CPI varies between programs on a given CPU TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 44 Concluding Remarks Cost/performance is improving Hierarchical layers of abstraction BK In both hardware and software Instruction set architecture Due to underlying technology development The hardware/software interface Execution time: the best performance measure Power is a limiting factor Use parallelism to improve performance TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 45 ... Sequence 1: IC = Clock Cycles = 2 1 + 1 2 + 2×3 = 10 Avg CPI = 10 /5 = 2.0 Sequence 2: IC = Clock Cycles = 4 1 + 1 2 + 1 3 =9 Avg CPI = 9/6 = 1. 5 BK TP.HCM 07-Sep -13 CuuDuongThanCong .com. .. a program 10 s on A, 15 s on B Execution TimeB / Execution TimeA = 15 s / 10 s = 1. 5 So A is 1. 5 times faster than B BK TP.HCM 07-Sep -13 CuuDuongThanCong .com Faculty of Computer Science &... TP.HCM 07-Sep -13 CuuDuongThanCong .com Faculty of Computer Science & Engineering https://fb .com/ tailieudientucntt 13 Opening the Box BK TP.HCM 07-Sep -13 CuuDuongThanCong .com Faculty of Computer Science