Data Mining Concepts and Techniques phần 4 potx
... include data cube–based data aggregation and attribute- oriented induction. From a data analysis point of view, data generalization is a form of descriptive data mining. Descriptive data mining ... primitive-level data. For instance, if “IBM-ThinkPad-R40/P4M” or “Symantec-Norton-Antivirus-2003” each 212 Chapter 4 Data Cube Computation and Data Generalization Example...
Ngày tải lên: 08/08/2014, 18:22
... Statistical Data Mining 666 11.3.3 Visual and Audio Data Mining 667 11.3 .4 Data Mining and Collaborative Filtering 670 11 .4 Social Impacts of Data Mining 675 11 .4. 1 Ubiquitous and Invisible Data Mining ... Time-Series, and Sequence Data 46 7 8.1 Mining Data Streams 46 8 8.1.1 Methodologies for Stream Data Processing and Stream Data Systems 46 9...
Ngày tải lên: 08/08/2014, 18:22
... substructures. 9. Metadata mining. Metadata are data about data. Metadata provide semi-structured data about unstructured data, ranging from text and Web data to multimedia data- bases. It is useful for data ... caffeine and thesal in Figure 9. 14( a) and 9. 14( b) will be good matches. If we relax the query further, the struc- ture in Figure 9. 14( c) could also be an answ...
Ngày tải lên: 08/08/2014, 18:22
Data Mining Concepts and Techniques phần 2 ppsx
... 97 2.7 Summary Data preprocessing is an important issue for both data warehousing and data mining, as real-world data tend to be incomplete, noisy, and inconsistent. Data preprocessing includes data cleaning, ... for smeared data. 2.3 Data Cleaning 63 Sorted data for price (in dollars): 4, 8, 15, 21, 21, 24, 25, 28, 34 Partition into (equal-frequency) bins: Bin 1: 4...
Ngày tải lên: 08/08/2014, 18:22
Data Mining Concepts and Techniques phần 3 docx
... Generalization b 3 b 2 b 1 b 0 B C c 3 c 2 c 1 61 45 29 62 46 30 63 47 31 64 48 32 c 0 a 0 a 1 A a 2 a 3 1 5 9 13 14 15 16 2 3 4 28 44 60 24 40 56 20 36 52 Figure 4. 3 A 3-D array for the dimensions A, B, and C, organized into 64 chunks. Each chunk ... processing, and data mining. We also introduce on-line analytical mining (OLAM), a powerful paradigm...
Ngày tải lên: 08/08/2014, 18:22
Data Mining Concepts and Techniques phần 5 ppt
... is Gini income ∈ {low,medium} (D) = 10 14 Gini(D 1 ) + 4 14 Gini(D 2 ) = 10 14 1− 6 10 2 − 4 10 2 + 4 14 1− 1 4 2 − 3 4 2 = 0 .45 0 = Gini income ∈ {high} (D). Similarly, ... income, we first use Equation (6.5) to obtain SplitInfo A (D) = − 4 14 ×log 2 4 14 − 6 14 ×log 2 6 14 − 4 14 ×log 2 4 14 . = 0.926. From Example 6.1, we...
Ngày tải lên: 08/08/2014, 18:22
Data Mining Concepts and Techniques phần 6 ppt
... functions (Hanson and Burr [HB88]), dynamic adjustment of the network topology (Me´zard and Nadal [MN89], Fahlman and Lebiere [FL90], Le Cun, Denker, and Solla [LDS90], and Harp, Samad, and Guha [HSG90] ), and ... in Cooper and Herskovits [CH92], Buntine [Bun 94] , and Heckerman, Geiger, and Chick- ering [HGC95]. Algorithms for inference on belief networks can be found in...
Ngày tải lên: 08/08/2014, 18:22
Data Mining Concepts and Techniques phần 7 ppsx
... efficiently. 8 Mining Stream, Time-Series, and Sequence Data Our previous chapters introduced the basic concepts and techniques of data mining. The techniques studied, however, were for simple and structured ... structured data sets, such as data in relational databases, transactional databases, and data warehouses. The growth of data in various complex forms (e.g....
Ngày tải lên: 08/08/2014, 18:22
Data Mining Concepts and Techniques phần 9 pot
... multimedia data mining focuses on image data mining. Mining text data and mining the World Wide Web are studied in the two subsequent 638 Chapter 10 Mining Object, Spatial, Multimedia, Text, and Web Data where ... closely linked to image analysis and scientific data mining, and thus many image analysis techniques and scien- tific data analysis methods can be ap...
Ngày tải lên: 08/08/2014, 18:22
Data Mining Concepts and Techniques phần 10 pot
... constraint-based mining) , the integration of data mining with data warehousing and database systems, the standardization of data mining languages, visualization methods, and new meth- ods for handling ... in time-series databases. In Proc. 19 94 ACM-SIGMOD Int. Conf. Management of Data (SIGMOD’ 94) , pages 41 9 42 9, Minneapolis, MN, May 19 94. [FS93] U. Fayyad and P. S...
Ngày tải lên: 08/08/2014, 18:22