SCHAUM’S OUTLINE OF Principles of COMPUTER SCIENCE This page intentionally left blank SCHAUM’S OUTLINE OF Principles of COMPUTER SCIENCE CARL REYNOLDS Department of Computer Science Rochester Institute of Technology PAUL TYMANN Department of Computer Science Rochester Institute of Technology Schaum’s Outline Series McGRAW-HILL New York | Chicago | San Francisco | Lisbon | London | Madrid Mexico City | Milan | New Delhi | San Juan Seoul | Singapore | Sydney | Toronto Copyright © 2008 by The McGraw-Hill Companies, Inc All rights reserved Manufactured in the United States of America Except as permitted under the United States Copyright Act of 1976, no part of this publication may be reproduced or distributed in any form or by any means, or stored in a database or retrieval system, without the prior written permission of the publisher 0-07-151037-0 The material in this eBook also appears in the print version of this title: 0-07-146051-9 All trademarks are trademarks of their respective owners Rather than put a trademark symbol after every occurrence of a trademarked name, we use names in an editorial fashion only, and to the benefit of the trademark owner, with no intention of infringement of the trademark Where such designations appear in this book, they have been printed with initial caps McGraw-Hill eBooks are available at special quantity discounts to use as premiums and sales promotions, or for use in corporate training programs For more information, please contact George Hoare, Special Sales, at george_hoare@mcgraw-hill.com or (212) 904-4069 TERMS OF USE This is a copyrighted work and The McGraw-Hill Companies, Inc (“McGraw-Hill”) and its licensors reserve all rights in and to the work Use of this work is subject to these terms Except as permitted under the Copyright Act of 1976 and the right to store and retrieve one copy of the work, you may not decompile, disassemble, reverse engineer, reproduce, modify, create derivative works based upon, transmit, distribute, disseminate, sell, publish or sublicense the work or any part of it without McGraw-Hill’s prior consent You may use the work for your own noncommercial and personal use; any other use of the work is strictly prohibited Your right to use the work may be terminated if you fail to comply with these terms THE WORK IS PROVIDED “AS IS.” McGRAW-HILL AND ITS LICENSORS MAKE NO GUARANTEES OR WARRANTIES AS TO THE ACCURACY, ADEQUACY OR COMPLETENESS OF OR RESULTS TO BE OBTAINED FROM USING THE WORK, INCLUDING ANY INFORMATION THAT CAN BE ACCESSED THROUGH THE WORK VIA HYPERLINK OR OTHERWISE, AND EXPRESSLY DISCLAIM ANY WARRANTY, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE McGraw-Hill and its licensors not warrant or guarantee that the functions contained in the work will meet your requirements or that its operation will be uninterrupted or error free Neither McGraw-Hill nor its licensors shall be liable to you or anyone else for any inaccuracy, error or omission, regardless of cause, in the work or for any damages resulting therefrom McGraw-Hill has no responsibility for the content of any information accessed through the work Under no circumstances shall McGraw-Hill and/or its licensors be liable for any indirect, incidental, special, punitive, consequential or similar damages that result from the use of or inability to use the work, even if any of them has been advised of the possibility of such damages This limitation of liability shall apply to any claim or cause whatsoever whether such claim or cause arises in contract, tort or otherwise DOI: 10.1036/0071460519 Carl Reynolds teaches courses in database, operating systems, programming, and programming language theory in the RIT Computer Science Department at the Rochester Institute of Technology He has taught at the college level for 10 years, and in the computer industry for years Before coming to RIT, Reynolds spent 19 years in the computer industry working in technical and training capacities for both hardware and software suppliers, and years with a Dow Jones Industrial manufacturer creating expert systems for machine control His interests include genetic algorithms, expert systems, and image processing Paul Tymann is Professor and Chair of the Computer Science Department at the Rochester Institute of Technology He has taught both basic and advanced programming techniques for over 15 years More recently he has been involved with development of a new bioinformatics program at RIT Prior to entering academia, Professor Tymann worked in industry developing control software for point-of-sale terminals For the past years he has worked in the area of bioinformatics and has completed joint software development projects at the University of Rochester and Rutgers University Professional Want to learn more? We hope you enjoy this McGraw-Hill eBook! If you’d like more information about this book, its author, or related books and websites, please click here For more information about this title, click here CONTENTS CHAPTER 1 What Is Computer Science? Computing History A Roadmap Review Questions CHAPTER Introduction to Computer Science 12 13 Algorithms 14 Definition of Algorithm Example—Designing a Staircase Example—Finding the Greatest Common Denominator Representing Algorithms with Pseudocode Characterizing Algorithms Analyzing Algorithms Algorithms as Technology Formal Models of Computation Church–Turing Thesis Unsolvable Problems Summary Review Questions CHAPTER Computer Organization Von Neumann Architecture Data Representation Computer Word Size Integer Data Formats Real Number Formats Character Formats CPU/ALU Instruction Set Memory Input and Output (I/O) Summary Review Questions CHAPTER 14 14 15 15 16 17 25 25 28 28 29 29 31 31 31 32 33 34 35 36 36 39 40 41 42 Software 44 Generations of Languages Compilers and Interpreters 44 47 v vi CONTENTS Virtual Machines Procedural Programming Object-Oriented Programming Scripting Languages Functional Languages Language Design Language Syntax and Semantics Summary Review Questions CHAPTER Programming in Java Introduction Java Types Arrays Java Operators Java Identifiers Basic Control Structures Object-Oriented Programming Classes and Objects Object State and Behavior Inheritance Instance and Static Variables and Methods Polymorphism Interfaces Error Handling Input and Output Scanner PrintWriter Summary Review Questions CHAPTER 49 49 51 53 56 59 61 64 64 66 66 66 69 70 71 71 77 78 78 79 79 83 84 85 88 91 92 93 93 Operating Systems 95 Capabilities of the Hardware Operating Systems History Single-user OS → Network OS Multiprocessor Operating Systems Real-Time Operating Systems Embedded Systems Management of Input and Output Processes and Scheduling Threads Synchronization Semaphores Monitors Deadlock Scheduling Memory Management Memory Management with Multiprogramming Timesharing and Swapping Virtual Memory 95 97 98 98 99 99 99 101 102 103 106 108 111 114 116 116 117 118 CONTENTS File Systems Summary Review Questions CHAPTER Networking Introduction Reference Model Subnet (Data-Link) Layer Internet (Network) Layer Protocol End-to-end (Transport) Layer Protocol Application Layer Putting It All Together The World Wide Web, HTTP, and HTML Summary Review Questions CHAPTER Database The Ubiquitous Database Database Types Advantages of Using a Database Modeling the Data Domain Building a Relational Database from the Data Model Normalization SQL—Structured Query Language DDL—Data Definition Language DML—Data Manipulation Language Stored Procedures Triggers Data Integrity Transaction Isolation Levels Accessing the Database Programmatically Summary Review Questions CHAPTER Social Issues Ethics Theories Intellectual Property Privacy Encryption Viruses, Worms, and Trojan Horses Hackers Can Computers Kill? Summary Review Questions APPENDIX vii 122 124 125 127 127 128 130 131 132 133 133 134 137 138 139 139 139 140 140 144 145 147 148 150 157 159 160 161 162 165 166 169 169 170 174 175 175 176 177 178 179 Answers to Review Questions 180 Index 217 This page intentionally left blank SCHAUM’S OUTLINE OF Principles of COMPUTER SCIENCE This page intentionally left blank CHAPTER Introduction to Computer Science WHAT IS COMPUTER SCIENCE? Computer Science is defined in different ways by different authors Wikipedia (http://en.wikipedia.org/ wiki/Computer_science) defines computer science as the collection of a variety of disciplines related to computing, both theoretical and practical: theoretical foundations of information and computation, language theory, algorithm analysis and development, implementation of computing systems, computer graphics, databases, data communications, etc The US National Coordination Office for Networking and Information Technology Research and Development (NITRD) defines computer science in a similarly broad way: the systematic study of computing systems and computation The body of knowledge resulting from this discipline contains theories for understanding computing systems and methods; design methodology, algorithms, and tools; methods for the testing of concepts; methods of analysis and verification; and knowledge representation and implementation (http://www.nitrd.gov/pubs/bluebooks/1995/section.5.html) Another broad definition comes from the Association for Computing Machinery (ACM) Model Curriculum It says that computer science is the “study of computers and algorithmic processes, including their principles, their hardware and software design, their applications, and their impact on society.” A famous definition of computer science by Gibbs and Tucker (Gibbs and Tucker, “A Model Curriculum for a Liberal Arts Degree in Computer Science,” Comm of the ACM, vol 29, no 3, March 1986) emphasizes algorithm development and analysis as the central focus of computer science It’s also a fair question to ask, “How is computer science a science?” In contrast to physics, biology, and chemistry, computer science is not based on the study of the natural world In that sense, computer science is more like mathematics than science Some argue that computer science is really computer art (where “art” means practice) On the other hand, computer scientists use the scientific method to propose and test hypotheses, and some very nonobvious discoveries in computer science have important real-world implications An example, which we will discuss later, is the discovery that some important problems simply cannot be solved by computation Despite many variations, essentially all definitions of computer science emphasize the study of algorithms Algorithms, in one form or another, are central to computer science Computer science combines the theoretical concepts of algorithm design and analysis with the practical considerations of how to implement algorithms on a computer and solve practical problems INTRODUCTION TO COMPUTER SCIENCE [CHAP An algorithm defines a detailed and unambiguous sequence of actions for solving a particular problem or for performing some task If you have ever followed a recipe when cooking, followed a set of driving directions, or filled out an income tax form, you have worked with an algorithm For example, at some point in time you were probably taught how to determine the greatest common divisor (GCD) of two numbers In case you’ve forgotten, the GCD of two positive integers is the greatest integer that is an even divisor of both numbers For example, the GCD of 42 and 30 is The algorithm given below can be used to compute the GCD of two positive integers a and b: If b is zero, then the GCD of a and b is a Algorithm ends Set r to be the remainder obtained from the integer division of a and b Repeat this process using b and r Consider computing the GCD of 42 and 30 Let a = 42 and b = 30 We start the process at step of the algorithm Since b is not zero, we proceed to step In step we compute the remainder obtained when 42 is divided by 30, which is 12 Step instructs us to repeat the process, this time using 30 and 12 So on this second trip through the process a is now 30 and b is now 12 Since b is not zero, we compute the remainder of 30 and 12, which is 6, and repeat the process using 12 and As before, since b is not zero, we compute the remainder of 12 and and get zero We will now repeat the process using and This time through, since b is now zero, we conclude that the GCD of 42 and 30 is Algorithms are essential to the way computers process information because a computer program is basically an electronic form of an algorithm that tells the computer what specific steps to perform to carry out a specified task In order to study an electronic form of an algorithm, a computer scientist must also understand the computer that will be used to execute the steps of the algorithm The term hardware is used to describe the physical, tangible parts of a computer A keyboard, mouse, motherboard, graphics card, and processor are all examples of computer hardware Just as a racecar driver needs to understand the capabilities and limitations of the vehicle they are driving, a computer scientist must also understand the hardware platform on which computing algorithms will be implemented It is not enough just to “know how to drive” in the case of the racecar driver, and it is not enough just to “know algorithms” to be a computer scientist An algorithm that is optimal for a particular hardware platform may not be optimal on another Algorithms are typically expressed in a form that can be easily understood by a human being For example, the algorithm given earlier to compute the GCD of two numbers was written using the English language so that it would be easy for you to understand Even though you may understand more than one language, the only language that a computer understands is machine language Machine language is a system of codes that the computer is designed to interpret Each word in machine language represents a simple action that can be performed by the computer For example the machine language instruction “add” instructs the computer to add together two numbers (In Chap on Computer Organization, we will explain machine language in much more detail.) The set of instructions that, when executed by a computer, executes the steps of an algorithm is called a program It is difficult for humans to work directly with machine language Machine instruction words consist of rows of ones and zeros, typically 8, 16, 32, or 64 bits long, and sometimes varying in length Since people have difficulty manipulating these strange codes directly, computer languages have been developed to ease the process of converting an algorithm into a form that the computer can act upon We refer to these languages as higher-level languages, because the languages have been designed to allow humans to work at a “higher level” than at the level of ones and zeros of the computer Machine language, on the other hand, is often referred to as a low-level language Java, FORTRAN, Basic, and ADA are just a few examples of high-level languages that are used by computer scientists to express the algorithms they have developed The act of expressing an algorithm using a low-level or high-level language is referred to as programming Over the years, starting in the 1950s, computer scientists have created many higher-level languages In the early days some experts thought that it should be possible to develop one language that would be best for all uses Since then, however, computer scientists have found that language design always trades off some features and capabilities for others As a result, today we have many good higher-level languages, some particularly suited to symbol manipulation, some particularly good for teaching programming, some good for matrix CHAP 1] INTRODUCTION TO COMPUTER SCIENCE algebra applications, some for fast one-off, one-time programs, some for mission-critical, life-dependent applications, some tuned for applications in real-time automation control, and many good ones for general-purpose use Computer scientists study the general characteristics of computer languages and formal grammars, and usually become proficient in several or many different languages The term software is used to describe the set of instructions, or programs, that a computer uses to execute an algorithm Software contains the instructions that direct the operation of the hardware The software that makes the basic functions of the computer accessible is referred to as system software System software is responsible for controlling and managing the hardware of a computer system, and for making the computer easy to use for program developers as well as general users Examples of system software include operating systems, display managers, virus scanners, language processors (called compilers or interpreters—to be discussed in the chapter on software), and device drivers Programs such as word processors or spreadsheets are referred to as application software Application software is used to accomplish specific tasks Application software may consist of a single program, or a small collection of programs that work together to accomplish a task for a user of the computer Operating systems are particularly important and complex system software They are important because the performance of the operating system has a dramatic influence on the quality of the computer user’s experience and the efficiency of the computer system as a whole In the days of simpler computing systems, in the 1960s and 1970s, a company might purchase a computer without an operating system, with the intention of writing or using its own operating system, but today one always buys an operating system when one buys a computer The operating system provides easy access to peripheral devices like printers and displays, a file system for storing information like data, documents, and programs, a user interface to make it easy to start application programs, a time-of-day clock, a connection to the Internet using the standard network protocols, a set of “calls” or “methods” that application programs can use to request services of the operating system, an efficient algorithm for allocating memory to the various programs active at the same time, and an efficient algorithm for sharing access to the computer among several people and/or programs at the same time Popular operating systems today include Microsoft Windows, Mac OS, Unix, Linux (a variety of Unix), and IBM’s MVS, among others In fact, the field of operating system development is still a very active one in computer science Not only are operating systems becoming more complex (adding firewalls and other protections, for example), but operating systems are also becoming more diverse As simpler devices like thermostats and dishwashers come under computer control, computer scientists have created specialized “embedded systems” operating systems for those requirements Even into the 1980s many, if not most, computers were stand-alone—not connected to one another During the 1970s and 1980s computer scientists explored the advantages of computing networks and proposed a number of different physical connections among computers, as well as different networking protocols At the time there was hot competition among different vendors of computers, each with a different standard, and each hoping to “lock in” customers by selling its particular networking products IBM offered System Networking Architecture (SNA), Digital Equipment promoted DECnet, Hewlett Packard offered Distributed Systems (DS), and Xerox offered Xerox Networking Systems (XNS) Even General Motors got into the act, with its Manufacturing Automation Protocol (MAP) None was directly compatible with any other, but all offered “bridges” to other systems Today the problems for computer scientists in networking are different For the most part, the world has agreed on the IEEE 801 standards and the TCP/IP protocols for the Internet The problems now have to with expanding the number of Internet addresses without disrupting the operation of the older “installed base,” adapting to new and much faster physical connections such as optical fiber, increasing the speed of wireless connections, which are by nature slower and more susceptible to interference, managing larger data transfers such as movies, which also require strict real-time performance so the movie doesn’t stop midaction, and providing low-power, low-cost protocols for the ad hoc connection of hundreds or thousands of digital sensors Supporting almost all applications today is database technology The dominant database model is the relational database, first offered for commercial use in the 1980s Computer scientists develop algorithms for storing and retrieving information quickly from absolutely enormous reservoirs of data How is it, for example, INTRODUCTION TO COMPUTER SCIENCE [CHAP that Google can call up almost instantly more than 400,000 images of a “red barn” from over 1.5 billion images in its database? There is a great deal to know about creating a good database, accessing a database from a program, growing a database, and managing a database Application programmers and database administrators need to understand databases at this level in order to use them efficiently Even computer scientists focused on other specialties need to know about databases today For instance, some of the newer operating systems use database technology in their file systems for the storage of all information, not just information formally devoted to a particular database The benefits to the operating system include speed, space savings, and data security At a deeper level, computer scientists develop algorithms for sharing access to a database among many users simultaneously For instance, a site like Amazon.com may serve more that 100,000 users at once, and it’s important that each user’s choices and purchases be kept distinct from one another Likewise, when you reserve an airplane seat on-line, it’s important that two people on-line at the same time are not promised space in the same seat! Computer scientists also develop algorithms for making backup copies of the database to protect against the possibility of data loss due to equipment failure For a site like Amazon, such an algorithm must allow backup without first stopping the operation of the primary database, for the site must be up at all times! Algorithms to provide such service reliably and efficiently are very challenging to perfect It should not be hard to convince you that computers have dramatically changed the way in which human beings live their lives Technologies such as the Internet and the World Wide Web put a vast amount of information at our fingertips Instant messenger systems, electronic mail, and cell phones have revolutionized the way in which human beings communicate Computer surveillance systems are being used by police forces to make the world a safer place to live While all of these technologies are primarily being used for the betterment of human kind, it is also possible to use these technologies to inflict harm, obtain unauthorized access to information, or to spy on people Coupled with the ability to develop these technologies is a need to address the social and ethical uses of the technology It is just as important, perhaps sometimes even more important, to ask questions about the potential impact of a technology on society, as it is to build the technology As more and more people come to depend on computing technology in their daily lives, computer science must also consider the study of social issues of the technologies that it produces There is a common misconception that computer science is nothing more than the study of computer hardware and programming It should be clear to you now that computer science is much more than simply writing programs It includes the study of computer hardware, computer languages, operating systems, networking, databases, and the social consequences of computing In order to be effective, a computer scientist must understand and master each of these areas Further, computer science is a young discipline that is still rapidly evolving since its beginnings in the 1940s In the next section we will briefly explore the history of computing from both a hardware and software perspective COMPUTING HISTORY Though computer science is a relatively young field that only began in earnest in the 1940s, interest in computing and computing devices started much earlier The abacus, a simple counting device invented in Babylonia in the fourth century BC, is considered by many to be the first computing device In 1614 the Scottish lord John Napier, inventor of logarithms, invented a calculating device consisting of a series of rods (often called “bones”) that reduced the complex process of multiplication and division into the relatively simple tasks of addition and subtraction Some regard his inventions as the first attempt at mechanical computation Blaise Pascal is often credited with the invention of the first mechanical calculator, the Pascaline, in 1642 (there is evidence that Leonardo DaVinci may have beaten Pascal by 150 years) According to Pascal’s memoirs, he developed the machine to help his father with his work as a tax collector Pascal’s device could only add and subtract but it was possible to perform multiplication and division operations using a series of additions or subtractions What is noteworthy about Pascal’s machine is that it was capable of calculating with eight figures, and that subtraction was performed using complement techniques Subtracting using complementary addition is same technique that is used to implement subtraction in most modern computers CHAP 1] INTRODUCTION TO COMPUTER SCIENCE Figure 1-1 The Pascaline, photograph by Yves Serra (http://pagesperso-orange.fr/yves.serra/) In the early 1800s inventors were just beginning to build the power-driven machinery that would fuel the industrial revolution One of these inventors, Joseph Marie Jacquard, invented a loom in 1801 that revolutionized the weaving industry Although it was not the first mechanical loom, Jacquard’s loom was revolutionary in that it could be used to weave complex and intricate patterns automatically The key idea behind the loom was that the pattern to be woven into the cloth was encoded by holes punched in a card A group of these cards, that were literally strung together, provided the information required to Figure 1-2 The Jacquard Loom, photograph by Frank da Cruz (http://www.columbia.edu/acis/history/jacquard.html) 6 INTRODUCTION TO COMPUTER SCIENCE [CHAP control the actions of the loom The Jacquard loom required fewer people and little skill to operate, and versions of the loom are still in use today The Jacquard loom also had a profound impact on computing in that it was the one of the first devices that could be programmed The loom gave birth to the concept of punched cards, which played a fundamental role in the early days of computing Charles Babbage, a mathematician and inventor, grew tired of calculating astronomical tables by hand, and conceived of a way to build a mechanical device to perform the calculations automatically In 1822 Babbage started work on a computing device, the difference engine, to automatically calculate mathematical tables During the course of his work on the difference engine, he conceived of a more sophisticated machine he called the analytical engine The analytical engine was meant to be programmed using punched cards, and would employ features such as sequential control, branching, and looping Although Babbage never built a complete working model of either machine, his work became the basis on which many modern computers are built (One of Babbage’s earlier difference engines was eventually constructed from drawings by a team at London’s Science Museum in the 1990s The machine weighs tons and is 10 feet wide by 1/2 feet tall.) Figure 1-3 Jacquard Loom Cards, photograph by Doug Jones (http://www.cs.uiowa.edu/~jones/cards/history.html) In his work on the analytical engine, Babbage made an important intellectual leap regarding the punched cards In the Jacquard loom, the presence or absence of each hole in the card physically allows a colored thread to pass or stops that thread Babbage realized that the pattern of holes could be used to represent an abstract idea such as a problem statement or the raw data required for that problem’s solution Because of the connection to the Jacquard loom, Babbage called the two main parts of his Analytic Engine the “Store” and the “Mill”, as both terms are used in the weaving industry The Store was where numbers were held, and the Mill was where they were “woven” into new results In a modern computer these same parts are called the memory unit and the central processing unit (CPU) Perhaps the key concept that separated the analytical engine from its predecessors was that it supported conditional program execution This allows the machine to determine what to next, based upon a condition or situation that is detected at the very moment the program is running CHAP 1] INTRODUCTION TO COMPUTER SCIENCE Augusta Ada Byron, the countess of Lovelace, was a mathematician who worked with Charles Babbage on his analytical engine Unlike Babbage, who was interested in building a computing device, Lovelace sought to understand and reason about methods for computing She studied these methods, their implementations, and the properties of their implementations Lovelace even developed a program that would have been able to compute the Bernoulli numbers (Bernoulli numbers comprise a sequence of rational numbers that have many roles in mathematics and number theory.) In her published analysis of the analytical engine, Lovelace outlined the fundamentals of computer programming, including looping and memory addressing The influence of the Jacquard loom on her work was evident in her writing, “We may say most aptly that the analytical engine weaves algebraic patterns just as the Jacquard loom weaves flowers and leaves.” It is because of this work that many consider Lovelace to be the world’s first programmer The US Department of Defense named the computer language ADA in honor of Lovelace’s work as a programmer Figure 1-4 Ada Lovelace (http://www-groups.dcs.st-and.ac.uk/~history/PictDisplay/Lovelace.html) The 1890 census of the United States proved another milestone in the history of computing when punch cards were used with automatic sorting and tabulating equipment invented by Herman Hollerith to speed the compilation of the data His machines reduced the time required for a full compilation of census results from 10 years to months, and saved $5,000,000 in costs to the census bureau Building on the success of his equipment with the US Census Bureau, Hollerith founded the Tabulating Machine Company in 1896 After merging with two other companies and changing its name, the company became known as the International Business Machines (IBM) Corp The punch card remained a staple of data storage well into the 20th century 8 INTRODUCTION TO COMPUTER SCIENCE [CHAP Figure 1-5 Hollerrith Tabulator & Sorter, photograph IBM Corporate Archives The 1940s were a decade of dramatic events for the world World War II changed the face of the world and many lives forever Although terrible atrocities were taking place during this period, it was also a time of innovation and invention in computing During the 1940s the first electronic computers were built, primarily to support the war Unfortunately the clouds of war make it difficult to determine exactly who invented the computer first Legally, at least in the United States, John Atanasoff is credited as being the inventor of the computer Atanasoff was a professor of mathematics and physics at Iowa State Atanasoff was frustrated at the difficulty his graduate students were having finding solutions to large systems of simultaneous algebraic equations for solving differential equations Like Babbage, almost 100 years earlier, Atanasoff believed that he could build a machine to solve these equations Working with graduate student Clifford Berry, Atanasoff completed a prototype of his machine near the end of 1939 Atanasoff and Berry sought simplicity in their computer The Atanasoff–Berry Figure 1-6 Clifford Berry with the ABC (www.scl.ameslab.gov/ABC/Progress.html) CHAP 1] INTRODUCTION TO COMPUTER SCIENCE Computer (ABC) used only 300 vacuum tubes and was capable of performing arithmetic electronically Perhaps what is most important about this particular machine is that is operated on base-2 numbers (binary) The ABC did not implement the stored program idea, however, so it was not a general-purpose computer During the same time period, Howard Aiken was working on the Mark I computer at Harvard University As completed in 1944, the Mark I contained more than 750,000 parts, including switches, relays, rotating shafts, and clutches The machine was huge, at 51 feet long, feet high, feet thick, and weighing tons It had 500 miles of wiring, and three million wire connections The machine sounded like a “roomful of ladies knitting” when it was running Aiken showed that it was possible to build a large-scale automatic computer capable of reliably executing a program Figure 1-7 The Aiden/IBM Mark Computer installed at Harvard, photograph IBM Corporate Archives One of the people who worked with Aiken on the Mark I was Grace Murray Hopper, a freshly commissioned lieutenant in the US Naval Reserve Hopper was involved with programming the Mark I from the very start One of her most significant contributions to the field of computing was the concept of a compiler Hopper was troubled by the mistake-plagued nature of code writing, and developed a piece of software that would translate an entire set of programmer’s instructions, written in a high-level symbolic language, into the machine’s language The first compiler developed by Hopper was named A-0, and was written in 1952 Grace Murray Hopper is also credited as the individual who coined the term “bug.” During the summer of 1947 the Mark II computer, a successor to the Mark I, was acting strangely At times it would produce the correct answer, and at other times the same program would produce erroneous results Hopper traced the problem down to a faulty relay within the computer When she physically examined the relay to correct the problem, she discovered that a moth had been trapped in the relay, causing it to malfunction Once she removed the moth from the relay, the machine functioned normally The “bug” was taped onto a page of the laboratory’s notebook with the inscription “First actual bug found.” 10 INTRODUCTION TO COMPUTER SCIENCE [CHAP Figure 1-8 The first computer bug (http://www.history.navy.mil/photos/images/h96000/h96566kc.htm) After World War II ended, the allies discovered that Konard Zuse, a German engineer, had been developing computers for use by the Germans Zuse’s first computer, the Z1, was built between 1936 and 1938 The machine contained all of the parts of a modern computer; however, it was not reliable Its mechanical construction was very complex and error-prone Zuse’s Z3 was the first fully functional program-controlled computer in the world The Z3 was finished in 1941 and predated Aiken’s Mark I Zuse’s accomplishments are all the more incredible given the material and worker shortages in Germany during World War II Zuse couldn’t even obtain paper tape, so he had to make his own by punching holes in discarded movie film Zuse also invented what might be the first high-level computer language, “Plankalkul”, though it, too, was unknown outside Germany The work done by the code breakers at Bletchley Park (between London and Birmingham, UK) during World War II provided the allies with information that literally turned the tide of the war Computers played a vital role in the work of the code breakers and made it possible for them to break the Enigma and Lorenz ciphers Colossus, a computer developed at Bletchley Park to break ciphers, became operational in 1943 Colossus was one of the first major computers to employ vacuum tubes, and was capable of reading information stored on paper tape at a rate of 5000 characters per second Colossus also featured limited programmability When the allies invaded North Africa in 1942, they discovered that the firing tables they used to aim their artillery were off This resulted in requests for new ballistics tables that exceeded the ability to compute them John Mauchly and J Presper Eckert used this opportunity to propose the development of an electronic high-speed vacuum tube computer Even though many experts predicted that, given the number of vacuum tubes in the machine, it would only run for five minutes without stopping, they were able to obtain the funding to build the machine Under a cloak of secrecy, they started work on the machine in the spring of 1943 They completed their work on the machine in 1946 The result was the Electronic Numerical Integrator Analyzer and Computer (ENIAC), a machine that weighed 30 tons and was built using 17,468 vacuum tubes and 6000 switches The machine was more than 1000 times faster than any machine built to date Unlike modern computers, reprogramming ENIAC required a rewiring of the basic circuits in the machine ENIAC heralded the dawning of the computer age CHAP 1] INTRODUCTION TO COMPUTER SCIENCE 11 Soon after ENIAC become functional, Mauchly and Eckert formed the Electronic Control Corporation (ECC) and received contracts from the government to design and build a computer for the Bureau of the Census ECC developed financial difficulties and as a result sold its patents to, and became an employee of, the Remington Rand Corporation In 1951 Remington Rand delivered the Universal Automatic Computer (UNIVAC) to the census bureau UNIVAC was the fastest computer of the time and was the only commercially available general-purpose computer It contained only 5000 vacuum tubes and was more compact than its predecessors UNIVAC computers were sold to government agencies, the A.C Neilson Company (market researchers), and Prudential Insurance By 1957 Remington Rand had sold over 40 machines Probably what made UNIVAC most famous was its use by CBS to predict the results of the 1952 presidential election Opinion polls predicted that Adalai Stevenson would beat Dwight D Eisenhower by a landslide UNIVAC’s analysis of early returns, however, showed a clear victory for Eisenhower Newscasters Walter Cronkite and Charles Collingwood questioned the validity of the computer’s forecast, so they postponed announcing UNIVAC’s prediction until very late For many years, Mauchly and Eckert were considered the inventors of the electronic computer In fact they applied for, and received, a patent for their work in 1947 After purchasing ECC, Remington Rand owned the rights to their patent and was collecting royalties from firms building computers In a legal battle, initiated by Honeywell’s refusal to pay royalties, a judge ruled the original patent invalid Part of his decision to invalidate the patent was based on the fact that Mauchly had visited John Atanasoff’s laboratory in 1941, and used the knowledge he gained during the visit to build ENIAC The results of this lawsuit legally established John Atanasoff as the inventor of the modern computer After the war, commercial development of computers continued, resulting in the development of many new machines that provided improved performance in terms of computing capability and speed Computers at this time were large, cumbersome devices that were capable of performing simple operations These machines were very expensive to build and maintain The only organizations that could afford to purchase and run the equipment were the government and large corporations Not surprisingly, many individuals working in the computing field felt that the use of computers would be limited In a 1950 article, Business Week noted, “Salesmen will find the market limited The UNIVAC is not the kind of machine that every office could use.” And though the story is probably apocryphal, the lore of computing attributes the following prediction to Thomas Watson, the founder of IBM, in 1943: “I think there is a world market for maybe five computers.” In the early 1950s, a group of scientists working at Bell Laboratories in New Jersey was studying the behavior of crystals as semiconductors in an attempt to replace vacuum tubes Its work resulted in the development of the transistor, which changed the way computers and many electronic devices were built Transistors switch and modulate electric current in much the same way as a vacuum tube Using transistors instead of vacuum tubes in computers resulted in machines that were much smaller and cheaper, and that required considerably less electricity to operate The transistor is one of the most important inventions in the 20th century While computer companies such as IBM and Honeywell focused on the development of mainframe computers, Digital Equipment Corporation (DEC) focused on the development of smaller computers DEC’s PDP series of computers were small and designed to serve the computing needs of laboratories The PDP-8 was one of the first computers purchased by end users Because of their low cost and portability, these machines could be purchased to fill a specific need The PDP-8 is generally regarded as the first minicomputer The invention of the integrated circuit caused the trend toward smaller, cheaper, and faster computers to accelerate Popular Electronics featured an article on a kit that home hobbyists could purchase that would enable them to build a computer at home This machine, offered first in 1974, was the Altair 8800, manufactured by a company named MITS It ushered in the personal computer era These initial machines were designed to be built at home, which was fine for the home hobbyist but limited the availability of the machine The first programming language for the Altair was Altair BASIC, the first product of a little company called Microsoft In 1981 IBM introduced its personal computer, or PC, which changed the face of computing forever It was now possible for individuals to simply purchase these machines and use them at home 12 INTRODUCTION TO COMPUTER SCIENCE [CHAP A ROADMAP After reading this chapter, you should realize that there is much more to computer science than simply writing programs Like any professional, a computer scientist must have an understanding of all of the subdisciplines of the field Some of the major disciplines of computer science are algorithms, programming, programming languages, computer hardware, networking, operating systems, database systems, distributed computing, and the ethical issues surrounding the use of computer technology There are two major schools of thought when it comes to the education of computer scientists The depth-first approach is to study one particular topic in depth For example, many computer science degree programs start out with a course in programming After taking such a course, students will be proficient programmers, but clearly they will not have enough knowledge of the other subdisciplines of the field to be considered computer scientists A second approach is to cover many of the subdisciplines of computer science, but only to the depth required to teach a basic understanding of the principles of each discipline After obtaining an overall view of the field, students will then study certain subdisciplines in depth This is referred to as the breadth-first approach, and is the approach we chose to use in this book The organization of this text follows the description of computing given in the first section of this chapter It begins with a discussion of algorithms, how they are developed, and how they may be compared We also introduce a formal model of computation After reading this chapter you will have a basic understanding of algorithm development and will be able to develop algorithms to solve simple problems After studying algorithms, the text will focus on the basics of computer hardware In this chapter you will learn what the major components of the computer are and how they work together You will also learn about the binary number system and see how it can be used to encode information at the hardware level The next two chapters will focus on programming We will first study software in general and discuss how high-level languages can be constructed to provide models in which algorithms can be expressed, and ultimately expressed in a way that the hardware can work with In the next chapter we will focus on programming using the programming language Java The goal of this chapter is not to make you an expert programmer, but instead to introduce you to the basics of programming using a language that is readily available and in wide use After learning the fundamentals of programming we will focus on operating systems, networking, and databases The topics covered in these chapters will address common techniques used to manage computer hardware, provide access to network resources, and manage and store data Almost every modern computer application uses the technologies discussed in these chapters The last chapter in the book will discuss some of the social issues of computing In this chapter we will discuss intellectual property rights and conflicts, privacy of data, “hacking,” and viruses We will also discuss our professional responsibilities when lives depend on the systems on which we work REVIEW QUESTIONS 1.1 Write an algorithm for your morning routine, from the time the alarm clock rings until you leave the house for work or school 1.2 Find or invent an algorithm to calculate the square root of any number Apply the algorithm to the number 2046, finding its square root to decimal places Do not use a computer or calculator! 1.3 Perl is a computer language that is often used for quick, one-off programming jobs, like converting text in a document from one format to another ADA is a language used for Department of Defense applications where human life may be at stake What differences would you imagine to find when you compare Perl with ADA? 1.4 Why might a computer scientist with a primary interest in databases also need to know about networking? 1.5 The acronym API stands for Application Programming Interface What you suppose API means with respect to an operating system? ... Computers Kill? Summary Review Questions APPENDIX vii 12 2 12 4 12 5 12 7 12 7 12 8 13 0 13 1 13 2 13 3 13 3 13 4 13 7 13 8 13 9 13 9 13 9 14 0 14 0 14 4 14 5 14 7 14 8 15 0 15 7 15 9 16 0 16 1 16 2 16 5 16 6 16 9 16 9 17 0 17 4... Multiprogramming Timesharing and Swapping Virtual Memory 95 97 98 98 99 99 99 10 1 10 2 10 3 10 6 10 8 11 1 11 4 11 6 11 6 11 7 11 8 CONTENTS File Systems Summary Review Questions CHAPTER Networking Introduction.. .SCHAUM’S OUTLINE OF Principles of COMPUTER SCIENCE This page intentionally left blank SCHAUM’S OUTLINE OF Principles of COMPUTER SCIENCE CARL REYNOLDS Department of Computer Science Rochester