www.it-ebooks.info www.it-ebooks.info Working with Microsoft ® FAST ™ Search Server 2010 for SharePoint ® Mikael Svenson Marcus Johansson Robert Piddocke www.it-ebooks.info Published with the authorization of Microsoft Corporation by: O’Reilly Media, Inc. 1005 Gravenstein Highway North Sebastopol, California 95472 Copyright © 2012 by Mikael Svenson, Marcus Johansson, Robert Piddocke All rights reserved. No part of the contents of this book may be reproduced or transmitted in any form or by any means without the written permission of the publisher. ISBN: 978-0-7356-6222-3 1 2 3 4 5 6 7 8 9 LSI 7 6 5 4 3 2 Printed and bound in the United States of America. Microsoft Press books are available through booksellers and distributors worldwide. If you need support related to this book, email Microsoft Press Book Support at mspinput@microsoft.com. Please tell us what you think of this book at http://www.microsoft.com/learning/booksurvey. Microsoft and the trademarks listed at http://www.microsoft.com/about/legal/en/us/IntellectualProperty/ Trademarks/EN-US.aspx are trademarks of the Microsoft group of companies. All other marks are property of their respective owners. The example companies, organizations, products, domain names, email addresses, logos, people, places, and events depicted herein are ctitious. No association with any real company, organization, product, domain name, email address, logo, person, place, or event is intended or should be inferred. This book expresses the author’s views and opinions. The information contained in this book is provided without any express, statutory, or implied warranties. Neither the authors, O’Reilly Media, Inc., Microsoft Corporation, nor its resellers, or distributors will be held liable for any damages caused or alleged to be caused either directly or indirectly by this book. Acquisitions and Developmental Editor: Russell Jones Production Editor: Holly Bauer Editorial Production: Online Training Solutions, Inc. Technical Reviewer: Thomas Svensen Copyeditor: Jaime Odell, Online Training Solutions, Inc. Indexer: Judith McConville Cover Design: Twist Creative • Seattle Cover Composition: Karen Montgomery Illustrator: Jeanne Craver, Online Training Solutions, Inc. www.it-ebooks.info Contents at a Glance Foreword xiii Introduction xv PART I WHAT YOU NEED TO KNOW CHAPTER 1 Introduction to FAST Search Server 2010 for SharePoint 3 CHAPTER 2 Search Concepts and Terminology 29 CHAPTER 3 FS4SP Architecture 53 CHAPTER 4 Deployment 73 CHAPTER 5 Operations 115 PART II CREATING SEARCH SOLUTIONS CHAPTER 6 Search Conguration 161 CHAPTER 7 Content Processing 235 CHAPTER 8 Querying the Index 289 CHAPTER 9 Useful Tips and Tricks 329 CHAPTER 10 Search Scenarios 389 Index 445 www.it-ebooks.info v What do you think of this book? We want to hear from you! Microsoft is interested in hearing your feedback so we can continually improve our books and learning resources for you. To participate in a brief online survey, please visit: microsoft.com/learning/booksurvey Contents Foreword xiii Introduction xv PART I WHAT YOU NEED TO KNOW Chapter 1 Introduction to FAST Search Server 2010 for SharePoint 3 What Is FAST? 3 Past 4 Present 4 Future 5 Versions 5 SharePoint Search vs. Search Server Versions, and FS4SP 9 Features at a Glance 9 Explanation of Features 11 What Should I Choose? 19 Evaluating Search Needs. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .19 Decision Flowchart 23 Features Scorecard 23 Conclusion 28 Chapter 2 Search Concepts and Terminology 29 Overview 29 Relevancy 30 SharePoint Components 35 www.it-ebooks.info vi Contents Content Processing 40 Content Sources 40 Crawling and Indexing 41 Metadata 43 Index Schema 43 Query Processing 44 QR Server 45 Reners (Faceted Search) 45 Query Language 45 Search Scopes 47 Security Trimming 51 Claims-Based Authentication 52 Conclusion 52 Chapter 3 FS4SP Architecture 53 Overview 53 Server Roles and Components 56 FS4SP Architecture 57 Search Rows, Columns, and Clusters. . . . . . . . . . . . . . . . . . . . . . . . . . .67 FS4SP Index Servers 70 FS4SP Query Result Servers/QR Server 70 Conclusion 71 Chapter 4 Deployment 73 Overview 73 Hardware Requirements 74 Storage Considerations 74 FS4SP and Virtualization 78 Software Requirements 79 Installation Guidelines 80 Before You Start 81 Software Prerequisites 84 FS4SP Preinstallation Conguration 87 www.it-ebooks.info Contents vii FS4SP Update Installation 87 FS4SP Slipstream Installation 89 Single-Server FS4SP Farm Conguration 90 Deployment Conguration. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .94 Multi-Server FS4SP Farm Conguration 95 Manual and Automatic Synchronization of Conguration Changes 96 Certicates and Security 97 Creating FAST Content SSAs and FAST Query SSAs 99 Enabling Queries from SharePoint to FS4SP 100 Creating a Search Center 100 Scripted Installation 101 Advanced Filter Pack 101 IFilter 103 Replacing the Existing SharePoint Search with FS4SP 104 Development Environments 104 Single-Server Farm Setup 105 Multi-Server Farm Setup 105 Physical Machines 106 Virtual Machines 106 Booting from a VHD 106 Production Environments 106 Content Volume 107 Failover and High Availability 108 Query Throughput 108 Freshness 110 Disk Sizing 110 Server Load Bottleneck Planning 112 Conclusion 113 Chapter 5 Operations 115 Introduction to FS4SP Operations 115 Administration in SharePoint 116 Administration in Windows PowerShell 116 Other Means of Administration 117 www.it-ebooks.info viii Contents Basic Operations 117 The Node Controller 118 Indexer Administration 124 Search Administration 127 Search Click-Through Analysis 128 Link Analysis 129 Server Topology Management 133 Modifying the Topology on the FS4SP Farm 133 Modifying the Topology on the SharePoint Farm 135 Changing the Location of Data and Log Files 136 Logging 138 General-Purpose Logs 138 Functional Logs 141 Performance Monitoring 146 Identifying Whether an FS4SP Farm Is an Indexing Bottleneck 148 Identifying Whether the Document Processors Are the Indexing Bottleneck 148 Identifying Whether Your Disk Subsystem Is a Bottleneck 148 Backup and Recovery 149 Prerequisites 151 Backup and Restore Conguration 152 Full Backup and Restore 153 Conclusion 157 PART II CREATING SEARCH SOLUTIONS Chapter 6 Search Conguration 161 Overview of FS4SP Conguration 161 SharePoint Administration 162 Windows PowerShell Administration 162 Code Administration 164 Other Means of Administration 166 www.it-ebooks.info Contents ix Index Schema Management 167 The Index Schema 167 Crawled and Managed Properties 168 Full-Text Indexes and Rank Proles 181 Managed Property Boosts 191 Static Rank Components 195 Collection Management 196 Windows PowerShell 197 .NET 197 Scope Management 199 SharePoint 199 Windows PowerShell 201 .NET 203 Property Extraction Management 205 Built-in Property Extraction 206 Keyword, Synonym, and Best Bet Management 211 Keywords 212 Site Promotions and Demotions 227 FQL-Based Promotions 230 User Context Management 230 SharePoint 231 Windows PowerShell 232 Adding More Properties to User Contexts 233 Conclusion 234 Chapter 7 Content Processing 235 Introduction 235 Crawling Source Systems 237 Crawling Content by Using the SharePoint Built-in Connectors 239 Crawling Content by Using the FAST Search Specic Connectors 249 Choosing a Connector 260 www.it-ebooks.info [...]... familiar with C#, consider reading John Sharp’s Microsoft Visual C# 2010 Step by Step (Microsoft Press, 2010) If you are not yet familiar with SharePoint and Windows PowerShell, in addition to the numerous references you’ll find cited in the book, you should read Bill English’s Microsoft SharePoint 2010 Administrator’s Companion (Microsoft Press, 2010) Working with Microsoft FAST Search Server 2010 for SharePoint. .. Engineer at Microsoft xiv Foreword www.it-ebooks.info Introduction M icrosoft FAST Search Server 2010 for SharePoint (FS4SP) is Microsoft s flagship enterprise search product and one of the most capable enterprise search platforms available It provides a feature-rich alternative to the limited out-of-the-box search experience in Microsoft SharePoint 2010 and can be extended to meet complex information... operations Part II, “Creating Search Solutions,” covers configuration, indexing, searching, useful tips and tricks, and example search scenarios Part I is relevant for anyone working with FS4SP Part II is primarily relevant for people creating and setting up search solutions Finding Your Best Starting Point in This Book The two parts of Working with Microsoft FAST Search Server 2010 for SharePoint are intended... Understand the roots of FAST Search & Transfer and FAST Enterprise Search ■■ Understand the differences, advantages, and disadvantages of FSIS, FSIA, and FS4SP ■■ Compare and choose the FAST product that best fits your business needs This chapter provides an introduction to FAST, and specifically to Microsoft FAST Search Server 2010 for SharePoint (FS4SP) It includes a brief history of FAST Search & Transfer—which... Server 2010 for SharePoint (FS4SP) Chapter 1 Introduction to FAST Search Server 2010 for SharePoint 5 www.it-ebooks.info Important FSIA and FSIS have been removed from the product list and are no longer officially for sale to new customers We will still explain all the product offerings because we expect elements from FSIS to move into FS4SP in later versions FSIS FAST Search Server 2010 for Internet... licensed FSIA FAST Search for Internal Applications (FSIA) was FAST ESP 5.3 with SP3 but licensed for internal use As such, FSIA was nothing else than the pre -Microsoft ESP but without the complicated and often confusing features and performance-based license that were used before Microsoft moved FAST over to server- based licenses This product and its features will not likely reappear in any form because... retrieval, so he pushed FAST and its technology into the enterprise search market In 2000, FAST developed FAST DataSearch (FDS), which it supported until version 4 After that, it rebranded the product suite as FAST Enterprise Search Platform (ESP), which was released on January 27, 2004 FAST ESP released updates until version 5.3, which is the present version FAST ESP later became FAST Search for Internet Sites... people working on enterprise search as FAST did before the acquisition In fact, Microsoft made FAST its flagship search product and split the FAST ESP 5.3 product into two search offerings: FSIS and FSIA ESP 5.3 was also used as the basis for FS4SP 4 Part I What You Need to Know www.it-ebooks.info Microsoft is actively developing and integrating FAST while continuing to support existing customers FAST. .. Introduction to FAST Search Server 2010 for SharePoint 3 chapter 2 Search Concepts and Terminology 29 chapter 3 FS4SP Architecture 53 chapter 4 Deployment 73 chapter 5 Operations 115 1 www.it-ebooks.info C h apter 1 Introduction to FAST Search Server 2010 for SharePoint. .. information retrieval requirements If your organization is looking for a fully configurable and scalable search solution, FS4SP may be right for you Working with Microsoft FAST Search Server 2010 for SharePoint provides a thorough introduction to FS4SP The book introduces the core concepts of FS4SP in addition to some of the key concepts of enterprise search It then dives deeper into deployment, operations, . you should read Bill English’s Microsoft SharePoint 2010 Administrator’s Companion (Microsoft Press, 2010) . Working with Microsoft FAST Search Server 2010 for SharePoint uses a lot of XML,. www.it-ebooks.info www.it-ebooks.info Working with Microsoft ® FAST ™ Search Server 2010 for SharePoint ® Mikael Svenson Marcus Johansson Robert Piddocke www.it-ebooks.info Published with the authorization of Microsoft. primarily relevant for people creating and setting up search solutions. Finding Your Best Starting Point in This Book The two parts of Working with Microsoft FAST Search Server 2010 for SharePoint are