
Reserve it at BN.com & pick it up in 60 minutes at your local store.
Enter a zip code
Textbook (Hardcover - 1st ed. 2001. Corr. 3rd printing)
Textbook Information
During the past decade there has been an explosion in computation and information technology. With it has come vast amounts of data in a variety of fields such as medicine, biology, finance, and marketing. The challenge of understanding these data has led to the development of new tools in the field of statistics, and spawned new areas such as data mining, machine learning, and bioinformatics.Many of these tools have common underpinnings but are often expressed with different terminology. This book describes the important ideas in these areas in a common conceptual framework. While the approach is statistical, the emphasis is on concepts rather than mathematics. Many examples are given, with a liberal use of color graphics. It should be a valuable resource for statisticians and anyone interested in data mining in science or industry. The book's coverage is broad, from supervised learning (prediction) to unsupervised learning. The many topics include neural networks, support vector machines, classification trees and boostingthe first comprehensive treatment of this topic in any book. Trevor Hastie, Robert Tibshirani, and Jerome Friedman are professors of statistics at Stanford University. They are prominent researchers in this area: Hastie and Tibshirani developed generalized additive models and wrote a popular book of that title. Hastie wrote much of the statistical modeling software in S-PLUS and invented principal curves and surfaces. Tibshirani proposed the Lasso and is co-author of the very successful An Introduction to the Bootstrap. Friedman is the co-inventor of many data-mining tools including CART, MARS, and projection pursuit. FROM THE REVIEWS: TECHNOMETRICS "[This] is a vast and complex book. Generally, it concentrates on explaining why and how the methods work, rather than how to use them. Examples and especially the visualizations are principle features...As a source for the methods of statistical learning...it will probably be a long time before there is a competitor to this book."
Reader Rating:
See Detailed Ratings
December 18, 2005: This book is a very interesting book to learn the main statistical approach of data mining. It's clear and full of examples. If you go a Stanford data mining website you will find all the courses and exercises linked to the book. An important book to have in your own data mining library
Reader Rating:
See Detailed Ratings
September 23, 2002: Nature demands balance in the world. For all the pretty pictures and color graphics the counter entry is a lack of substance. The text is extremely lacking in detail and the problems are poorly posed. The book is "fast and loose". I quote from the preface, "we emphasize the methods and their conceptual underpinnings rather than their theoretical properties," but there is not enough substance to get a good feel for the techniques. I would only recommend this text if you already know all of these techniques and need a general reference.