1. Trang chủ
  2. » Thể loại khác

John wiley sons interscience discovering knowledge in data an introduction to data mining 2005

241 274 0

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 241
Dung lượng 4,75 MB

Nội dung

DISCOVERING KNOWLEDGE IN DATA An Introduction to Data Mining DANIEL T LAROSE Director of Data Mining Central Connecticut State University A JOHN WILEY & SONS, INC., PUBLICATION DISCOVERING KNOWLEDGE IN DATA DISCOVERING KNOWLEDGE IN DATA An Introduction to Data Mining DANIEL T LAROSE Director of Data Mining Central Connecticut State University A JOHN WILEY & SONS, INC., PUBLICATION Copyright ©2005 by John Wiley & Sons, Inc All rights reserved Published by John Wiley & Sons, Inc., Hoboken, New Jersey Published simultaneously in Canada No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording, scanning, or otherwise, except as permitted under Section 107 or 108 of the 1976 United States Copyright Act, without either the prior written permission of the Publisher, or authorization through payment of the appropriate per-copy fee to the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, 978-750-8400, fax 978-646-8600, or on the web at www.copyright.com Requests to the Publisher for permission should be addressed to the Permissions Department, John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030, (201) 748-6011, fax (201) 748-6008 Limit of Liability/Disclaimer of Warranty: While the publisher and author have used their best efforts in preparing this book, they make no representations or warranties with respect to the accuracy or completeness of the contents of this book and specifically disclaim any implied warranties of merchantability or fitness for a particular purpose No warranty may be created or extended by sales representatives or written sales materials The advice and strategies contained herein may not be suitable for your situation You should consult with a professional where appropriate Neither the publisher nor author shall be liable for any loss of profit or any other commercial damages, including but not limited to special, incidental, consequential, or other damages For general information on our other products and services please contact our Customer Care Department within the U.S at 877-762-2974, outside the U.S at 317-572-3993 or fax 317-572-4002 Wiley also publishes its books in a variety of electronic formats Some content that appears in print, however, may not be available in electronic format Library of Congress Cataloging-in-Publication Data: Larose, Daniel T Discovering knowledge in data : an introduction to data mining / Daniel T Larose p cm Includes bibliographical references and index ISBN 0-471-66657-2 (cloth) Data mining I Title QA76.9.D343L38 2005 006.3 12—dc22 2004003680 Printed in the United States of America 10 Dedication To my parents, And their parents, And so on For my children, And their children, And so on 2004 Chantal Larose CONTENTS PREFACE xi INTRODUCTION TO DATA MINING What Is Data Mining? Why Data Mining? Need for Human Direction of Data Mining Cross-Industry Standard Process: CRISP–DM Case Study 1: Analyzing Automobile Warranty Claims: Example of the CRISP–DM Industry Standard Process in Action Fallacies of Data Mining What Tasks Can Data Mining Accomplish? Description Estimation Prediction Classification Clustering Association Case Study 2: Predicting Abnormal Stock Market Returns Using Neural Networks Case Study 3: Mining Association Rules from Legal Databases Case Study 4: Predicting Corporate Bankruptcies Using Decision Trees Case Study 5: Profiling the Tourism Market Using k-Means Clustering Analysis References Exercises 4 10 11 11 12 13 14 16 17 18 19 21 23 24 25 DATA PREPROCESSING 27 Why Do We Need to Preprocess the Data? Data Cleaning Handling Missing Data Identifying Misclassifications Graphical Methods for Identifying Outliers Data Transformation Min–Max Normalization Z-Score Standardization Numerical Methods for Identifying Outliers References Exercises 27 28 30 33 34 35 36 37 38 39 39 vii ... www.ccsu.edu/datamining CHAPTER INTRODUCTION TO DATA MINING WHAT IS DATA MINING? WHY DATA MINING? NEED FOR HUMAN DIRECTION OF DATA MINING CROSS-INDUSTRY STANDARD PROCESS: CRISP–DM CASE STUDY 1: ANALYZING... xi INTRODUCTION TO DATA MINING What Is Data Mining? Why Data Mining? Need for Human Direction of Data Mining Cross-Industry Standard Process: CRISP–DM Case Study 1: Analyzing Automobile Warranty.. .DISCOVERING KNOWLEDGE IN DATA An Introduction to Data Mining DANIEL T LAROSE Director of Data Mining Central Connecticut State University A JOHN WILEY & SONS, INC., PUBLICATION DISCOVERING KNOWLEDGE

Ngày đăng: 23/05/2018, 15:25

TỪ KHÓA LIÊN QUAN

TÀI LIỆU CÙNG NGƯỜI DÙNG

TÀI LIỆU LIÊN QUAN