Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống
1
/ 251 trang
THÔNG TIN TÀI LIỆU
Thông tin cơ bản
Định dạng
Số trang
251
Dung lượng
9,05 MB
Nội dung
[...]... choosing the right design to solve your problem with MapReduce extremely important and can yield performance gains of several orders of magnitude Before we dive into some design patterns in the chapters following this one, we’ll talk a bit about how and why designpatterns and MapReduce together make sense, and a bit of a history lesson of how we got here Design PatternsDesignpatterns have been making... solving, come unique designpatterns 1 What is a MapReducedesign pattern? It is a template for solving a common and general data manipulation problem with MapReduce A pattern is not specific to a domain such as text processing or graph analysis, but it is a general approach to solving a problem Using designpatterns is all about using tried and true design principles to build better software Designing good...Preface Welcome to MapReduceDesign Patterns! This book will be unique in some ways and familiar in others First and foremost, this book is obviously about design patterns, which are templates or general guides to solving problems We took a look at other designpatterns books that have been written in the past as inspiration, particularly Design Patterns: Elements of Reusable Object-Oriented... added by Pig or Hive will matter, in which case you should use Java MapReduce Pig and Hive are likely to influence MapReducedesignpatterns more than anything else New feature requests in Pig and Hive will likely translate down into something that could be a design pattern in MapReduce Likewise, as more designpatterns are devel‐ oped for MapReduce, some of the more popular ones will become first-class... have patterns of their own and experts will start documenting more as they solve more problems Hive has the benefit of building off of decades of SQL patterns, but not all patterns in SQL are smart in Hive and vice versa Perhaps as these platforms gain more popularity, cookbook and design pattern books will be written for them 12 | Chapter 1: DesignPatterns and MapReduce CHAPTER 2 Summarization Patterns. .. part, the MapReduce design patterns in this book are intended to be plat‐ form independent MapReduce, being a paradigm published by Google without any actual source code, has been reimplemented a number of times, both as a standalone system (e.g., Hadoop, Disco, Amazon Elastic MapReduce) and as a query language within a larger system (e.g., MongoDB, Greenplum DB, Aster Data) Even if design patterns. .. concepts conveyed in the book useful in modern languages such as Ruby and Python The patterns in this book should be usable with systems other than Hadoop You’ll just have to use the code examples as a guide to developing your own code DesignPatterns | 3 MapReduce History How did we get to the point where a MapReducedesignpatterns book is a good idea? At a certain point, the community’s momentum and widespread... Chapter 1: DesignPatterns and MapReduce opers with a common language in which to communicate verbally and through code Simply saying “abstract factory” is easier than explaining what an abstract factory is over and over Also, when looking at a stranger’s code that implements an abstract factory, you already have a general understanding of what the code is trying to accomplish MapReducedesign patterns. .. arbitrary and is very important to what the MapReduce job is accomplishing The key is what the data will be grouped on and the value is the information pertinent to the analysis in the reducer Plenty of detail will be provided in the design patterns in this book to explain what and why the particular key/value is chosen One major differentiator between MapReducedesignpatterns is the semantics of this pair... MapReduce design patterns in a ecosystem with Hive and Pig However, we would like to take this opportunity early in the book to explain why MapReducedesignpatterns are still important Pig and Hive are higher-level abstractions of MapReduce They provide an interface that has nothing to do with “map” or “reduce,” but the systems interpret the higher-level language into a series of MapReduce jobs Much . h1" alt=""
Donald Miner and Adam Shook
MapReduce Design Patterns
ISBN: 978-1-449-32717-0
[LSI]
MapReduce Design Patterns
by Donald Miner and Adam Shook
Copyright. ix
1. Design Patterns and MapReduce. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Design Patterns 2
MapReduce