Computing Networks Computing Networks from cluster to cloud computing Pascale Vicat-Blanc Sébastien Soudan Romaric Guillier Brice Goglin First published 2011 in Great Britain and the United States by ISTE Ltd and John Wiley & Sons, Inc Apart from any fair dealing for the purposes of research or private study, or criticism or review, as permitted under the Copyright, Designs and Patents Act 1988, this publication may only be reproduced, stored or transmitted, in any form or by any means, with the prior permission in writing of the publishers, or in the case of reprographic reproduction in accordance with the terms and licenses issued by the CLA Enquiries concerning reproduction outside these terms should be sent to the publishers at the undermentioned address: ISTE Ltd 27-37 St George’s Road London SW19 4EU UK John Wiley & Sons, Inc 111 River Street Hoboken, NJ 07030 USA www.iste.co.uk www.wiley.com © ISTE Ltd 2011 The rights of Pascale Vicat-Blanc, Sébastien Soudan, Romaric Guillier, Brice Goglin to be identified as the authors of this work have been asserted by them in accordance with the Copyright, Designs and Patents Act 1988 Library of Congress Cataloging-in-Publication Data Reseaux de calcul English Computing networks : from cluster to cloud computing / Pascale Vicat-Blanc [et al.] p cm Includes bibliographical references and index ISBN 978-1-84821-286-2 Computer networks I Vicat-Blanc, Pascale II Title TK5105.5.R448613 2011 004.6 dc22 2011006658 British Library Cataloguing-in-Publication Data A CIP record for this book is available from the British Library ISBN 978-1-84821-286-2 Printed and bound in Great Britain by CPI Antony Rowe, Chippenham and Eastbourne Table of Contents Introduction Chapter From Multiprocessor Computers to the Clouds 1.1 The explosion of demand for computing power 1.2 Computer clusters 1.2.1 The emergence of computer clusters 1.2.2 Anatomy of a computer cluster 1.3 Computing grids 1.3.1 High-performance computing grids 1.3.2 Peer-to-peer computing grids 1.4 Computing in a cloud 1.5 Conclusion Chapter Utilization of Network Computing Technologies 2.1 Anatomy of a distributed computing application 2.1.1 Parallelization and distribution of an algorithm 2.1.1.1 Embarrassingly parallel applications 2.1.1.2 Fine-grained parallelism 2.1.2 Modeling parallel applications 2.1.3 Example of a grid application 2.1.4 General classification of distributed applications 13 21 21 24 24 24 26 29 30 32 36 39 39 41 42 43 44 44 47 Computing Networks 2.1.4.1 Widely distributed computing 2.1.4.2 Loosely coupled computing 2.1.4.3 Pipeline computing 2.1.4.4 Highly synchronized computing 2.1.4.5 Interactive and collaborative computing 2.1.4.6 Note 2.2 Programming models of distributed parallel applications 2.2.1 Main models 2.2.2 Constraints of fine-grained-parallelism applications 2.2.3 The MPI communication library 2.3 Coordination of distributed resources in a grid 2.3.1 Submission and execution of a distributed application 2.3.2 Grid managers 2.4 Conclusion 48 49 50 50 51 51 52 52 53 54 57 57 59 60 Chapter Specificities of Computing Networks 63 3.1 Typology of computing networks 3.1.1 Cluster networks 3.1.2 Grid networks 3.1.3 Computing cloud networks 3.2 Network transparency 3.2.1 The advantages of transparency 3.2.2 Foundations of network transparency 3.2.3 The limits of TCP and IP in clusters 3.2.4 Limits of TCP and network transparency in grids 3.2.5 TCP in a high bandwidth-delay product network 3.2.6 Limits of the absence of communication control 3.3 Detailed analysis of characteristics expected from protocols 3.3.1 Topological criteria 63 65 65 67 68 68 69 72 75 75 76 78 78 Table of Contents 3.3.1.1 Number of sites involved 3.3.1.2 Number of users involved 3.3.1.3 Resource-localization constraints 3.3.2 Performance criteria 3.3.2.1 Degree of inter-task coupling 3.3.2.2 Sensitivity to latency and throughput 3.3.2.3 Sensitivity to throughput and its control 3.3.2.4 Sensitivity to confidentiality and security 3.3.2.5 Summary of requirements 3.4 Conclusion Chapter The Challenge of Latency in Computing Clusters 4.1 Key principles of high-performance networks for clusters 4.2 Software support for high-performance networks 4.2.1 Zero-copy transfers 4.2.2 OS-bypass 4.2.3 Event notification 4.2.4 The problem of address translation 4.2.5 Non-blocking programming models 4.2.5.1 Case 1: message-passing 4.2.5.2 Case 2: remote access model 4.3 Description of the main high-performance networks 4.3.1 Dolphins SCI 4.3.2 Myricom Myrinet and Myri-10G 4.3.3 Quadrics QsNet 4.3.4 InfiniBand 4.3.5 Synthesis of the characteristics of high-performance networks 4.4 Convergence between fast and traditional networks 4.5 Conclusion 78 79 79 80 80 81 83 84 84 85 87 88 90 90 90 91 93 95 96 97 99 99 100 104 105 107 108 111 Chapter The Challenge of Throughput and Distance 5.1 Obstacles to high rate 5.2 Operating principle and limits of TCP congestion control 113 113 115 Computing Networks 5.2.1 Slow Start 5.2.2 Congestion avoidance 5.2.3 Fast Retransmit 5.2.4 Analytical model 5.3 Limits of TCP over long distances 5.4 Configuration of TCP for high speed 5.4.1 Hardware configurations 5.4.2 Software configuration 5.4.3 Parameters of network card drivers 5.5 Alternative congestion-control approaches to that of standard TCP 5.5.1 Use of parallel flows 5.5.2 TCP modification 5.5.2.1 Slow Start modifications 5.5.2.2 Methods of congestion detection 5.5.2.3 Bandwidth-control methods 5.5.3 UDP-based approaches 5.6 Exploration of TCP variants for very high rate 5.6.1 HighSpeed TCP 5.6.2 Scalable 5.6.3 BIC-TCP 5.6.4 H-TCP 5.6.5 CUBIC 5.7 Conclusion 116 117 117 119 120 122 123 124 126 126 127 129 129 130 131 132 133 133 134 134 135 135 136 Chapter Measuring End-to-End Performances 139 6.1 Objectives of network measurement and forecast in a grid 6.1.1 Illustrative example: network performance and data replication 6.1.2 Objectives of a performance-measurement system in a grid 6.2 Problem and methods 6.2.1 Terminology 6.2.2 Inventory of useful characteristics in a grid 6.2.3 Measurement methods 139 140 143 144 145 149 152 Table of Contents 6.2.3.1 Active method 6.2.3.2 Passive method 6.2.3.3 Measurement tools 6.3 Grid network-performance measurement systems 6.3.1 e2emonit 6.3.2 PerfSONAR 6.3.3 Architectural considerations 6.3.4 Sensor deployment in the grid 6.3.5 Measurement coordination 6.4 Performance forecast 6.4.1 The Network Weather Service tool 6.4.2 Network-cost function 6.4.3 Formulating the cost function 6.4.4 Estimate precision 6.5 Conclusion 152 152 154 155 155 155 156 160 161 164 164 166 167 169 170 Chapter Optical Technology and Grids 171 7.1 Optical networks and switching paradigms 7.1.1 Optical communications 7.1.1.1 Wavelength multiplexing 7.1.1.2 Optical add-drop multiplexers 7.1.1.3 Optical cross-connect 7.1.2 Optical switching paradigms 7.1.2.1 Optical packet switching 7.1.2.2 Optical burst switching 7.1.2.3 Optical circuit switching 7.1.3 Conclusion 7.2 Functional planes of transport networks 7.2.1 Data plane 7.2.2 Control plane 7.2.2.1 Routing 7.2.2.2 Signaling 7.2.3 Management plane 7.2.4 Conclusion 7.3 Unified control plane: GMPLS/automatic switched transport networks 172 172 173 174 175 176 176 177 177 179 179 181 182 182 182 182 184 184 10 Computing Networks 184 185 187 188 Chapter Bandwidth on Demand 189 7.3.1 7.3.2 7.3.3 7.3.4 Label-switching Protocols: OSPF-TE/RSVP-TE/LMP/PCEP GMPLS service models Conclusion 8.1 Current service model: network neutrality 8.1.1 Structure 8.1.2 Limits and problems 8.1.3 Conclusion 8.2 Peer model for bandwidth-delivery services 8.2.1 UCLP/Ca*net 8.2.2 GLIF 8.2.3 Service-oriented peer model 8.2.4 Conclusion 8.3 Overlay model for bandwidth-providing services 8.3.1 GNS-WSI 8.3.2 Carriocas 8.3.3 StarPlane 8.3.4 Phosphorus 8.3.5 DRAGON 8.3.6 Conclusion 8.4 Bandwidth market 8.5 Conclusion 190 191 192 193 194 194 194 195 196 196 196 197 198 198 198 199 200 201 Chapter Security of Computing Networks 203 9.1 Introductory example 9.2 Principles and methods 9.2.1 Security principles 9.2.2 Controlling access to a resource 9.2.3 Limits of the authentication approach 9.2.4 Authentication versus authorization 9.2.5 Decentralized approaches 9.3 Communication security 9.4 Network virtualization and security 203 205 206 207 209 210 211 212 213 Table of Contents 11 9.4.1 Classic network-virtualization approaches 9.4.2 The HIP protocol 9.5 Conclusion 213 215 216 Chapter 10 Practical Guide for the Configuration of High-speed Networks 217 10.1 Hardware configuration 10.1.1 Buffer memory 10.1.2 PCI buses 10.1.3 Computing power: CPU 10.1.4 Random access memory: RAM 10.1.5 Disks 10.2 Importance of the tuning of TCP parameters 10.3 Short practical tuning guide 10.3.1 Computing the bandwidth delay product 10.3.2 Software configuration 10.3.3 Other solutions 10.4 Use of multi-flow 10.5 Conclusion 218 218 218 219 220 220 221 222 223 224 225 226 228 229 Bibliography 235 Acronyms and Definitions 251 Index 263 Conclusion: From Grids to the Future Internet " " " " " ... Specificities of Computing Networks 63 3.1 Typology of computing networks 3.1.1 Cluster networks 3.1.2 Grid networks 3.1.3 Computing cloud networks ... growing computing power demand These technologies embody the revolution of network computing: computer clusters, computing grids and computing clouds A quick definition of these is as follows: – a computing. .. – computer clusters; – computing grids; and – computing and storage clouds 24 Computing Networks In the following sections, we analyze the specificities of these different network computing technologies