... Reference Data in Enterprise Databases: Binding Corporate Data to the Wider World
Malcolm Chisholm
Data Mining: Conceptsand Techniques
Jiawei Hanand Micheline Kamber
Understanding SQL and Java ... Statistical DataMining 666
11.3.3 Visual and Audio DataMining 667
11.3.4 DataMiningand Collaborative Filtering 670
11.4 Social Impacts of DataMining 675
11.4.1 Ubiquitous and Invisible DataMining ... object-relational databases and
specific application-oriented databases, such as spatial databases, time-series databases,
text databases, and multimedia databases. The challenges andtechniques of mining...
... the original data.
PCA is computationally inexpensive, can be applied to ordered and unordered
attributes, and can handle sparse dataand skewed data. Multidimensional data
of more than two dimensions ... (inclusive).
2.3 Data Cleaning 65
2.3.3 Data Cleaning as a Process
Missing values, noise, and inconsistencies contribute to inaccurate data. So far, we have
looked at techniques for handling missing dataand ... 97
2.7
Summary
Data preprocessing is an important issue for both data warehousing anddata mining,
as real-world data tend to be incomplete, noisy, and inconsistent. Data preprocessing
includes data cleaning,...
... processing, and data
mining. We also introduce on-line analytical mining (OLAM), a powerful paradigm that
integrates OLAP with datamining technology.
3.5.1 Data Warehouse Usage
Data warehouses anddata ... summarized data in a data warehouse sets a solid foundation for successful data
mining.
Moreover, we also believe that datamining should be a human-centered process.
Rather than asking a datamining ... Warehouse and OLAP Technology: An Overview
3.5
From Data Warehousing to Data Mining
“How do data warehousing and OLAP relate to data mining? ” In this section, we study the
usage of data warehousing...
... include data cube–based data aggregation and attribute-
oriented induction.
From a data analysis point of view, data generalization is a form of descriptive data
mining. Descriptive datamining ... Cercone, andHan [CCH91] and
further extended by Han, Cai, and Cercone [HCC93], Hanand Fu [HF96], Carter and
Hamilton [CH98], and Han, Nishio, Kawano, and Wang [HNKW98].
4.3 Attribute-Oriented Induction—An ... mining describes data in a concise and summarative manner
and presents interesting general properties of the data. This is different from predic-
tive data mining, which analyzes data in order to...
... R1,
R1: IF age = youth AND student = yes THEN buys
computer = yes.
The“IF”-part(or left-hand side) of a rule isknown astheruleantecedentorprecondition.
The “THEN”-part (or right-hand side) isthe rule ... scalability.
While both SLIQandSPRINThandle disk-resident data sets thatare too large to fit into
memory, the scalabilityof SLIQ islimited by the useof its memory-residentdatastructure.
SPRINT removes ... even for real-world data. RainForest has techniques, however, for handling
the case where the AVC-group does not fit in memory. RainForest can use any attribute
selection measure and was shown to...
... functions
(Hanson and Burr [HB88]), dynamic adjustment of the network topology (Me´zard
and Nadal [MN89], Fahlman and Lebiere [FL90], Le Cun, Denker, and Solla [LDS90],
and Harp, Samad, and Guha ... data in preparation for classification and prediction can involve
data cleaning to reduce noise or handle missing values, relevance analysis to remove
irrelevant or redundant attributes, anddata ... described in
Preparata and Shamos [PS85]. References on case-based reasoning (CBR) include the
texts Riesbeck and Schank [RS89] and Kolodner [Kol93], as well as Leake [Lea96] and
Aamodt and Plazas [AP94]....
... faster than the
viii
Basic ODM Concepts 1-1
1
Basic ODM Concepts
Oracle9i DataMining (ODM) embeds datamining within the Oracle9i database.
The data never leaves the database — the data, data ... SQL/MM for Data Mining. JDM has also influenced these
standards. Oracle9i DataMining will comply with the JDM standard when that
standard is published.
1.2.2 DataMining Server
The DataMining ... main components:
■
Oracle9i DataMining API
■
Data Mining Server (DMS)
1.2.1 Oracle9i DataMining API
The Oracle9i DataMining API is the component of Oracle9i DataMining that
allows users to...
... of techniques to
apply in a particular situation depends on the nature of the datamining task,
the nature of the available data, and the skills and preferences of the data
miner.
Data mining ... By data mining, of
course!
How DataMining Was Applied
Most datamining methods learn by example. The neural network or decision
tree generator or what have you is fed thousands and thousands ... that, on a technical level, the datamining effort is working and
the data is reasonably accurate. This can be quite comforting. If the dataand
the dataminingtechniques applied to it are powerful...