Introduction | |
Introduction | p. 3 |
What is Data Mining? | p. 5 |
What is Needed to Do Data Mining | p. 5 |
Business Data Mining | p. 7 |
Data Mining Tools | p. 8 |
Summary | p. 8 |
Data Mining Process | p. 9 |
CRISP-DM | p. 9 |
Business Understanding | p. 11 |
Data Understanding | p. 11 |
Data Preparation | p. 12 |
Modeling | p. 15 |
Evaluation | p. 18 |
Deployment | p. 18 |
SEMMA | p. 19 |
Steps in SEMMA Process | p. 20 |
Example Data Mining Process Application | p. 22 |
Comparison of CRISP & SEMMA | p. 27 |
Handling Data | p. 28 |
Summary | p. 34 |
Data Mining Methods as Tools | |
Memory-Based Reasoning Methods | p. 39 |
Matching | p. 40 |
Weighted Matching | p. 43 |
Distance Minimization | p. 44 |
Software | p. 50 |
Summary | p. 50 |
Job Application Data Set | p. 51 |
Association Rules in Knowledge Discovery | p. 53 |
Market-Basket Analysis | p. 55 |
Market Basket Analysis Benefits | p. 56 |
Demonstration on Small Set of Data | p. 57 |
Real Market Basket Data | p. 59 |
The Counting Method Without Software | p. 62 |
Conclusions | p. 68 |
Fuzzy Sets in Data Mining | p. 69 |
Fuzzy Sets and Decision Trees | p. 71 |
Fuzzy Sets and Ordinal Classification | p. 75 |
Fuzzy Association Rules | p. 79 |
Demonstration Model | p. 80 |
Computational Results | p. 84 |
Testing | p. 84 |
Inferences | p. 85 |
Conclusions | p. 86 |
Rough Sets | p. 87 |
A Brief Theory of Rough Sets | p. 88 |
Information System | p. 88 |
Decision Table | p. 89 |
Some Exemplary Applications of Rough Sets | p. 91 |
Rough Sets Software Tools | p. 93 |
The Process of Conducting Rough Sets Analysis | p. 93 |
Data Pre-Processing | p. 94 |
Data Partitioning | p. 95 |
Discretization | p. 95 |
Reduct Generation | p. 97 |
Rule Generation and Rule Filtering | p. 99 |
Apply the Discretization Cuts to Test Dataset | p. 100 |
Score the Test Dataset on Generated Rule set (and measuring the prediction accuracy) | p. 100 |
Deploying the Rules in a Production System | p. 102 |
A Representative Example | p. 103 |
Conclusion | p. 109 |
Support Vector Machines | p. 111 |
Formal Explanation of SVM | p. 112 |
Primal Form | p. 114 |
Dual Form | p. 114 |
Soft Margin | p. 114 |
Non-linear Classification | p. 115 |
Regression | p. 116 |
Implementation | p. 116 |
Kernel Trick | p. 117 |
Use of SVM - A Process-Based Approach | p. 118 |
Support Vector Machines versus Artificial Neural Networks | p. 121 |
Disadvantages of Support Vector Machines | p. 122 |
Genetic Algorithm Support to Data Mining | p. 125 |
Demonstration of Genetic Algorithm | p. 126 |
Application of Genetic Algorithms in Data Mining | p. 131 |
Summary | p. 132 |
Loan Application Data Set | p. 133 |
Performance Evaluation for Predictive Modeling | p. 137 |
Performance Metrics for Predictive Modeling | p. 137 |
Estimation Methodology for Classification Models | p. 140 |
Simple Split (Holdout) | p. 140 |
The k-Fold Cross Validation | p. 141 |
Bootstrapping and Jackknifing | p. 143 |
Area Under the ROC Curve | p. 144 |
Summary | p. 147 |
Applications | |
Applications of Methods | p. 151 |
Memory-Based Application | p. 151 |
Association Rule Application | p. 153 |
Fuzzy Data Mining | p. 155 |
Rough Set Models | p. 155 |
Support Vector Machine Application | p. 157 |
Genetic Algorithm Applications | p. 158 |
Japanese Credit Screening | p. 158 |
Product Quality Testing Design | p. 159 |
Customer Targeting | p. 159 |
Medical Analysis | p. 160 |
Predicting the Financial Success of Hollywood Movies | p. 162 |
Problem and Data Description | p. 163 |
Comparative Analysis of the Data Mining Methods | p. 165 |
Conclusions | p. 167 |
Bibliography | p. 169 |
Index | p. 177 |
Table of Contents provided by Ingram. All Rights Reserved. |