• No results found

Data Mining and Statistics for Decision Making. Wiley Series in Computational Statistics

N/A
N/A
Protected

Academic year: 2021

Share "Data Mining and Statistics for Decision Making. Wiley Series in Computational Statistics"

Copied!
8
0
0

Loading.... (view fulltext now)

Full text

(1)

Brochure

More information from http://www.researchandmarkets.com/reports/2171080/

Data Mining and Statistics for Decision Making. Wiley Series in

Computational Statistics

Description: Data Mining and Statistics for Decision Making

Stéphane Tufféry, Universitie of Paris–Dauphine, France

Data mining is the process of automatically searching large volumes of data for models and patterns using computational techniques from statistics, machine learning and information theory; it is the ideal tool for such an extraction of knowledge. Data mining is usually associated with a business or an organization's need to identify trends and profiles, allowing, for example, retailers to discover patterns on which to base

marketing objectives.

This book looks at both classical and modern methods of data mining, such as clustering, discriminate analysis, decision trees, neural networks and support vector machines along with illustrative examples throughout the book to explain the theory of these models. Recent methods such as bagging and boosting, decision trees, neural networks, support vector machines and genetic algorithm are also discussed along with their advantages and disadvantages.

Key Features:

- Presents a comprehensive introduction to all techniques used in data mining and statistical learning. - Includes coverage of data mining with R as well as a thorough comparison of the two industry leaders, SAS and SPSS.

- Gives practical tips for data mining implementation as well as the latest techniques and state of the art theory.

- Looks at a range of methods, tools and applications, such as scoring to web mining and text mining and presents their advantages and disadvantages.

- Supported by an accompanying website hosting datasets and user analysis.

Business intelligence analysts and statisticians, compliance and financial experts in both commercial and government organizations across all industry sectors will benefit from this book.

Contents: Preface. Foreword.

Foreword from the French language edition. List of trademarks.

1. Oveview of data mining. 1.1 What is data mining?

1.2 What is data mining used for? 1.3 Data mining and statistics.

1.4 Data mining and information technology. 1.5 Data mining and protection of persona; data.

(2)

1.6 Implementation of data mining.

2. The development of a data mining study. 2.1 Defining the aims.

2.2 Listing the existing data. 2.3 Collecting the data.

2.4 Exploring and preparing the data. 2.5 Population segmentation.

2.6 Drawing up and validating predictive models.

2.7 Synthesizing predictive models of different segments. 2.8 Iteration of the preceding steps.

2.9 Deploying the models. 2.10 Training the model users. 2.11 Monitoring the models. 2.12 Enriching the models. 2.13 Remarks.

2.14 Life cycle of a model. 2.15 Costs of a pilot project.

3. Data Exploration and preparation. 3.1 The different types of data.

3.2 Examining the distribution of variables. 3.3 Detection of rare or missing values. 3.4 Detection of aberrant values. 3.5 Detection of extreme values. 3.6 Tests of normality.

3.7 Homoscedasticity and heteroscedasticity. 3.8 Detection of the most discriminating variables. 3.9 Transformation of variables.

3.10 Choosing ranges of values of binned variables. 3.11 Creating new variables.

3.12 Detecting interactions. 3.13 Automatic variable selection.

(3)

3.14 Detection of collinearity. 3.15 Sampling.

4. Using commercial data.

4.1 Data used in commercial applications. 4.2 Special data.

4.3 Data used by business sector. 5. Statistical and data mining software.

5.1 Types of data mining and statistical software. 5.2 Essential characteristics of the software. 5.3 The main software packages.

5.4 Comparison of R, SAS and IBM SPSS. 5.5 How to reduce processing time. 6. An outline of data mining methods. 6.1 Classification of the methods. 6.2 Comparison of the methods. 7. Factor analysis.

7.1 Principal component analysis.

7.2 Variants of principal component analysis. 7.3 Correspondence analysis.

7.4 Multiple correspondence analysis. 8. Neural networks.

8.1 General information on neural networks. 8.2 Structure of a neural network.

8.3 Choosing the learning sample.

8.4 Some empirical rules for network design. 8.5 Data normalization.

8.6 Learning algorithms. 8.7 The main neural networks. 9. Cluster analysis.

9.1 Definition of clustering. 9.2 Applications of clustering. 9.3 Complexity of clustering.

(4)

9.4 Clustering structures.

9.5 Some methodological considerations. 9.6 Comparison of factor analysis and clustering. 9.7 Within–cluster and between–cluster sum of squares. 9.8 Measurements of clustering quality.

9.9 Partitioning methods.

9.10 Agglomerative hierarchical clustering. 9.11 Hybrid clustering methods.

9.12 Neural clustering.

9.13 Clustering by similarity aggregation. 9.14 Clustering of numeric variables. 9.15 Overview of clustering methods. 10. Association analysis.

10.1 Principles. 10.2 Using taxonomy.

10.3 Using supplementary variables. 10.4 Applications.

10.5 Example of use.

11. Classification and prediction methods. 11.1 Introduction.

11.2 Inductive and transductive methods.

11.3 Overview of classification and prediction methods. 11.4 Classification by decision tree.

11.5 Prediction by decision tree.

11.6 Classification by discriminant analysis. 11.7 Prediction by linear regression. 11.8 Classification by logistic regression. 11.9 Developments in logistic regression. 11.10 Bayesian methods.

11.11 Classification and prediction by neural networks. 11.12 Classification by support vector machines.

(5)

11.13 Prediction by genetic algorithms.

11.14 Improving the performance of a predictive model. 11.15 Bootstrapping and ensemble methods.

11.16 Using classification and prediction methods. 12. An application of data mining: scoring.

12.1 The different types of score.

12.2 Using propensity scores and risk scores. 12.3 Methodology.

12.4 Implementing a strategic score. 12.5 Implementing an operational score. 12.6 Scoring solutions used in a business.

12.7 An example of credit scoring (data preparation).

12.8 An example of credit scoring (modeling by logistic regression).

12.9 An example of credit scoring (modeling by DISQUAL discriminant analysis). 12.10 A brief history of credit scoring.

13. Factors for success in a data mining project. 13.1 The subject.

13.2 The people. 13.3 The data. 13.4 The IT systems. 13.5 The business culture.

13.6 Data mining: eight common misconceptions. 13.7 Return on investment.

14. Text mining.

14.1 Definition of text mining. 14.2 Text sources used. 14.3 Using text mining. 14.4 Information retrieval. 14.5 Information extraction. 14.6 Multi–type data mining. 15. Web mining.

(6)

15.2 Global analyses. 15.3 Individual analyses. 15.4 Personal analysis.

Appendix A. Elements of statistics. Appendix B. Further reading. Index.

Ordering: Order Online - http://www.researchandmarkets.com/reports/2171080/

Order by Fax - using the form below

Order by Post - print the order form below and send to Research and Markets,

Guinness Centre, Taylors Lane, Dublin 8, Ireland.

(7)

Page 1 of 2

Fax Order Form

To place an order via fax simply print this form, fill in the information below and fax the completed form to 646-607-1907 (from USA) or +353-1-481-1716 (from Rest of World). If you have any questions please visit

http://www.researchandmarkets.com/contact/

Order Information

Please verify that the product information is correct.

Product Format

Please select the product format and quantity you require:

* Shipping/Handling is only charged once per order.

Contact Information

Please enter all the information below in BLOCK CAPITALS

Product Name: Data Mining and Statistics for Decision Making. Wiley Series in Computational Statistics

Web Address: http://www.researchandmarkets.com/reports/2171080/

Office Code: SC

Quantity

Hard Copy (Hard

Back): USD 110 + USD 29 Shipping/Handling

Title: Mr Mrs Dr Miss Ms Prof

First Name: Last Name:

Email Address: * Job Title: Organisation: Address: City:

Postal / Zip Code: Country:

Phone Number: Fax Number:

(8)

Page 2 of 2

Payment Information

Please indicate the payment method you would like to use by selecting the appropriate box.

Please fax this form to:

(646) 607-1907 or (646) 964-6609 - From USA

+353-1-481-1716 or +353-1-653-1571 - From Rest of World

Pay by credit card: You will receive an email with a link to a secure webpage to enter yourcredit card details.

Pay by check: Please post the check, accompanied by this form, to: Research and Markets,

Guinness Center, Taylors Lane, Dublin 8, Ireland.

Pay by wire transfer: Please transfer funds to:

Account number 833 130 83

Sort code 98-53-30

Swift code ULSBIE2D

IBAN number IE78ULSB98533083313083 Bank Address Ulster Bank,

27-35 Main Street, Blackrock, Co. Dublin, Ireland. If you have a Marketing Code please enter it below:

Marketing Code:

Please note that by ordering from Research and Markets you are agreeing to our Terms and Conditions at http://www.researchandmarkets.com/info/terms.asp

References

Related documents

To estimate the e¤ect of strategic sale on banks’costs, in a regression of the logarithm of banks’ costs on a translog function of banks’ outputs and prices of inputs

March 12 General Spring Faculty Meeting – 3:15 .p.m, Watkins Auditorium, University Center March 13 Promotion and/or Tenure recommendations to Academic Affairs from Deans

The aim of this qualitative study is to analyse the process of imple- mentation and assess the impact of the implementation of a flexible cloud-based project management

The first (second) column shows loan growth impulse responses following a permanent one percentage point increase (decrease) in capital requirements, estimated on banks

Entry level jobs deloitte, entry level jobs 10 dollars an hour, student visa australia work restrictions, online part time jobs in kolkata for school students, entry level

Penyajian data dilakukan oleh peneliti dengan mendeskripsikan kumpulan informasi tersusun dari hasil reduksi data yang telah dianalisis untuk penarikan kesimpulan

For the key aspects of your financial life, Ellison Kibler & Associates can bring you access to the banking convenience of Bank of America and the investment insights of

It consists of several sub-programs: registration and reaction to local residents’ information and denunciations received on pager 002 ‘City without Drugs’; collaboration