[下载]Data Mining: Ebook and Software

hanszhu

12138

收藏 2005-01-11

Data Mining: Concepts and Techniques

Jiawei Han and Micheline Kamber, Simon Fraser University

Note: This manuscript is based on a forthcoming book by Jiawei Han and Micheline Kamber, c2000 (c) Morgan Kaufmann Publishers.

------------------------------------------------

https://bbs.pinggu.org/thread-28773-1-1.html

[此贴子已经被作者于2006-1-11 12:15:39编辑过]

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

全部回复

hanszhu

2005-1-11 07:44:00

The book is organized as follows.

Chapter 1 provides an introduction to the multidisciplinary field of data mining. It discusses the evolutionary path of database technology which led up to the need for data mining, and the importance of its application potential. The basic architecture of data mining systems is described, and a brief introduction to the concepts of database systems and data warehouses is given. A detailed classification of data mining tasks is presented, based on the different kinds of knowledge to be mined. A classification of data mining systems is presented, and major challenges in the field are discussed.

Chapter 2 is an introduction to data warehouses and OLAP (On-Line Analytical Processing). Topics include the concept of data warehouses and multidimensional databases, the construction of data cubes, the implementation of on-line analytical processing, and the relationship between data warehousing and data mining.

Chapter 3 describes techniques for preprocessing the data prior to mining. Methods of data cleaning, data integration and transformation, and data reduction are discussed, including the use of concept hierarchies for dynamic and static discretization. The automatic generation of concept hierarchies is also described.

Chapter 4 introduces the primitives of data mining which define the specification of a data mining task. It describes a data mining query language (DMQL), and provides examples of data mining queries. Other topics include the construction of graphical user interfaces, and the specification and manipulation of concept hierarchies.

Chapter 5 describes techniques for concept description, including characterization and discrimination. An attribute-oriented generalization technique is introduced, as well as its different implementations including a generalized relation technique and a multidimensional data cube technique. Several forms of knowledge presentation and visualization are illustrated. Relevance analysis is discussed. Methods for class comparison at multiple abstraction levels, and methods for the extraction of characteristic rules and discriminant rules with interestingness measurements are presented. In addition, statistical measures for descriptive mining are discussed.

Chapter 6 presents methods for mining association rules in transaction databases as well as relational databases and data warehouses. It includes a classification of association rules, a presentation of the basic Apriori algorithm and its variations, and techniques for mining multiple-level association rules, multidimensional association rules, quantitative association rules, and correlation rules. Strategies for finding interesting rules by constraint-based mining and the use of interestingness measures to focus the rule search are also described.

Chapter 7 describes methods for data classification and predictive modeling. Major methods of classification and prediction are explained, including decision tree induction, Bayesian classification, the neural network technique of backpropagation, k-nearest neighbor classifiers, case-based reasoning, genetic algorithms, rough set theory, and fuzzy set approaches. Association-based classification, which applies association rule mining to the problem of classification, is presented. Methods of regression are introduced, and issues regarding classifier accuracy are discussed.

Chapter 8 describes methods of clustering analysis. It first introduces the concept of data clustering and then presents several major data clustering approaches, including partition-based clustering, hierarchical clustering, and model-based clustering. Methods for clustering continuous data, discrete data, and data in multidimensional data cubes are presented. The scalability of clustering algorithms is discussed in detail.

Chapter 9 discusses methods for data mining in advanced database systems. It includes data mining in object-oriented databases, spatial databases, text databases, multimedia databases, active databases, temporal databases, heterogeneous and legacy databases, and resource and knowledge discovery in the Internet information base.

Finally, in Chapter 10, we summarize the concepts presented in this book and discuss applications of data mining and some challenging research issues.

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

hanszhu

2005-1-11 07:46:00

<<Data Mining: Concepts and Techniques>>

Preface

Our capabilities of both generating and collecting data have been increasing rapidly in the last several decades. Contributing factors include the widespread use of bar codes for most commercial products, the computerization of many business, scientific and government transactions and managements, and advances in data collection tools ranging from scanned texture and image platforms, to on-line instrumentation in manufacturing and shopping, and to satellite remote sensing systems. In addition, popular use of the World Wide Web as a global information system has flooded us with a tremendous amount of data and information. This explosive growth in stored data has generated an urgent need for new techniques and automated tools that can intelligently assist us in transforming the vast amounts of data into useful information and knowledge.

This book explores the concepts and techniques of data mining, a promising andourishing frontier in database systems and new database applications. Data mining, also popularly referred to as knowledge discovery in databases (KDD), is the automated or convenient extraction of patterns representing knowledge implicitly stored in large databases, data warehouses, and other massive information repositories.

Data mining is a multidisciplinary field, drawing work from areas including database technology, artificial intelligence, machine learning, neural networks, statistics, pattern recognition, knowledge based systems, knowledge acquisition, information retrieval, high performance computing, and data visualization. We present the material in this book from a database perspective. That is, we focus on issues relating to the feasibility, usefulness, efficiency, and scalability of techniques for the discovery of patterns hidden in large databases. As a result, this book is not intended as an introduction to database systems, machine learning, or statistics, etc., although we do provide the background necessary in these areas in order to facilitate the reader's comprehension of their respective roles in data mining. Rather, the book is a comprehensive introduction to data mining, presented with database issues in focus. It should be useful for computing science students, application developers, and business professionals, as well as researchers involved in any of the disciplines listed above.

Data mining emerged during the late 1980's, has made great strides during the 1990's, and is expected to continue toourish into the new millennium. This book presents an overall picture of the field from a database researcher's point of view, introducing interesting data mining techniques and systems, and discussing applications and research directions. An important motivation for writing this book was the need to build an organized framework for the study of data mining | a challenging task owing to the extensive multidisciplinary nature of this fast developing field. We hope that this book will encourage people with different backgrounds and experiences to exchange their views regarding data mining so as to contribute towards the further promotion and shaping of this exciting and dynamic field.

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

fi03xyc

2005-1-11 13:04:00

hao

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

sailjeff

2005-1-11 13:05:00

thanks a lot

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

黑桃皇后

2005-1-11 20:17:00

好，

[em07][em07]

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

点击查看更多内容…

guoguo99

2005-1-12 09:51:00

好东西, 支持一下

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

zwen

2005-1-12 22:58:00

多谢了

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

xuqifa1975

2005-1-22 17:55:00

谢谢

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

winslow

2005-5-8 02:49:00

it is me, winslow.

you know i like dm. could you let me download the book?

i can pay you later.

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

hanszhu

2005-5-8 09:23:00

[推荐]

Mining the Web: Discovering Knowledge from Hypertext Data

[此贴子已经被作者于2006-1-11 12:17:23编辑过]

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

hanszhu

2005-5-8 09:48:00

[下载]Data Mining: An Overview from Database Perspective (1997)

Ming-Syan Chen, Jiawei Han, Philip S. Yu

Abstract: Mining information and knowledge from large databases has been recognized by many researchers as a key research topic in database systems and machine learning, and by many industrial companies as an important area with an opportunity of major revenues. Researchers in many di#erent #elds have shown great interest in data mining. Several emerging applications in information providing services, such as data warehousing and on-line services over the Internet, also call for various data mining

14218.rar
大小:(850.9 KB)

只需: 1 个论坛币马上下载

本附件包括：

Data Mining, An Overview From Database Perspective.pdf

[此贴子已经被作者于2005-5-8 13:15:41编辑过]

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

hanszhu

2005-5-8 10:42:00

[下载]

Data Mining: Concepts, Models, Methods, and Algorithms Mehmed Kantardzic ISBN: 0-471-22852-4 Paperback 360 pages October 2002, Wiley-IEEE Press

CDN $96.99

Data mining describes the often complex and sophisticated tools used in automatic data analysis such as analyzing a customer's previous buying habits
Emphasizes the selection of appropriate methodologies and data analysis software, as well as parameter tuning
Describes representative state-of-the-art methods and algorithms originating from different disciplines
Offers guidance on how and when to use a particular software tool from among the hundreds offered when faced with a data set to mine
A Wiley-IEEE Press Publication
To view the solutions manual, visit ftp://ftp.wiley.com/public/sci_tech_med/data_mining/

“...clear and well understandable...recommended as basic guidance...practitioners will profit from the author's long experience..." (Zentralblatt Math, Vol. 1027, 2004)

“...reviews state-of-the-art techniques for analyzing enormous quantities of raw data...” (Quarterly of Applied Mathematics, Vol. LXI, No. 3, September 2003)

"…this is a comprehensive textbook that describes the process and methodologies of data mining in an unbiased manner…serves as an excellent starting point for anyone wishing to learn about data mining.” (Journal of Proteome Research, May/ June 2003)

"...a valuable book.... I truly enjoyed reading the book and I am glad to recommend it to anyone working in this fascinating field." (IIE Transactions)

"...detailed, well illustrated, and easy to understand...comprehensive…a good book..." (Mathematical Reviews 2003h)

"...this is probably the first data-mining book that I would select from my bookshelf as reading material for a statistician..." (Technometrics, Vol. 45, No. 3, August 2003)

14225.rar
大小:(8.39 MB)

只需: 25 个论坛币马上下载

本附件包括：

John.Wiley.And.Sons.Data.Mining-Concepts.Models.Methods.and.Algorithms.chm

[UserName=winslow][/UserName]

[此贴子已经被作者于2005-5-8 21:10:52编辑过]

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

hanszhu

2005-5-8 11:40:00

[下载]Managing Data Mining Technologies in Organizations

Managing Data Mining Technologies in Organizations: Techniques and Applications
by Parag Pendharkar (ed)	ISBN:1591400570
Idea Group Publishing © 2003 (288 pages)
This book details the state-of-the-art data mining research, which reflects in a potpourri of chapters that demonstrate diverse use of techniques and their applications for data mining.

14227.rar
大小:(4.86 MB)

只需: 25 个论坛币马上下载

本附件包括：

Idea Group - Managing Data Mining Technologies in Organizations - Techniques and Applications.chm

[UserName=winslow][/UserName]

[此贴子已经被作者于2005-5-8 21:12:50编辑过]

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

hanszhu

2005-5-8 13:55:00

[下载]Wincross(Non-English Version)

14236.zip
大小:(1.23 MB)

只需: 20 个论坛币马上下载

本附件包括：

CIARE10S.CR_
CIARE101.CR_
D54CORRE.CR_
D54IDEAL.CR_
DYNAU54S.CR_
DYNAUD54.CR_
FLAT.CR_
KRBESSEL.DA_
LASTFORM.IC_
LASTSYST.IC_
LEGGIMI.TXT
MIDEXAMP.CR_
setup1.ex_
SETUP.EXE
SETUP.LST
SETUPKIT.DL_
TEST1.CS_
TEST2.CS_
TEST3.CS_
TEST4.CS_
TEST5.CS_
TEST6.CS_
THREED.VB_
TWTEXAMP.CR_
VBRUN300.DL_
VER.DL_
wincross.ex_
WINCROSS.HL_
WOFEXAMP.CR_
8OHM.CR_
CIA101IS.CR_

[此贴子已经被作者于2005-5-8 19:36:11编辑过]

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

jerry

2005-5-10 10:49:00

hao

day day Up up

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

chenchen1

2005-5-12 20:53:00

good job

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

jacksn

2005-5-13 19:57:00

哎，楼主只想赚钱啊，可惜好东西买不起

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

Oswin

2005-5-15 11:43:00

thank you very much for sharing.

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

hanszhu

2005-6-7 08:04:00

[下载]Principles of Data Mining

Principles of Data Mining David J. Hand, Heikki Mannila and Padhraic Smyth

	Full Contents

	List of Tables

	List of Figures

	Series Foreword

	Preface Sample Chapter - Download PDF (64 KB)

1	Introduction Sample Chapter - Download PDF (279 KB)

2	Measurement and Data

3	Visualizing and Exploring Data

4	Data Analysis and Uncertainty

5	A Systematic Overview of Data Mining Algorithms

6	Models and Patterns

7	Score Functions for Data Mining Algorithms

8	Search and Optimization Methods

9	Descriptive Modeling

10	Predictive Modeling for Classification

11	Predictive Modeling for Regression

12	Data Organization and Databases

13	Finding Patterns and Rule

14	Retrieval by Content

	Appendix: Random Variables

	References

	Index Sample Chapter - Download PDF (123 KB)

16531.rar
大小:(3.23 MB)

只需: 25 个论坛币马上下载

本附件包括：

MIT Press - Principles of Data Mining(1).pdf

[此贴子已经被作者于2005-6-7 8:23:40编辑过]

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

sunww1999

2005-7-28 16:23:00

很想要第一本可是没有钱,努力赚钱中!!!

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

shanliang

2005-8-5 07:39:00

多谢了！要是钱要得再少点就好了！

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

hanszhu

2005-8-6 07:21:00

[下载]Jack E. Olsen.Data Quality.The Accuracy Dimension

Data Quality : The Accuracy Dimension (The Morgan Kaufmann Series in Data Management Systems) (Paperback) by Jack E. Olson

Concepts and technical approaches for analyzing and improving the usefullness of source data are presented in a well organized and logical sequence. Not quite a roadmap/algorithm for achieving data quality; it's balanced a little more on the conceptual side. The techniques and concepts presented are ones you _will_ want to begin using if you have any data quality issues in your source data (and who doesn't), I guarantee that. As a working ETL developer referencing disparate, low quality data, this book has had an impact on our approach, our results, and our project. Short and to the point, it's an easy study also!

Jack Olson coined the term "data profiling" and essentially founded this important new field in the area of data quality assessment. His revolutionary techniques, outlined in this book, can provide professionals with an important new set of tools for analyzing data quality. This is a must read for anyone working in the data quality field today. I also recommend it for people in related fields such as data warehousing, Enterprise Application Integration, and database design. With so many "me too" books in the computer field, it's a real joy to find a book that really does break new ground.

22000.rar
大小:(3.74 MB)

只需: 50 个论坛币马上下载

本附件包括：

Jack E. Olsen.Data Quality.The Accuracy Dimension.chm

[此贴子已经被作者于2005-8-6 7:24:01编辑过]

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

iphavor

2005-8-6 15:18:00

see seee

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

pangkf79

2006-1-3 16:23:00

等我储存够了钱就立刻来买。

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

yosiyosi8

2006-1-4 11:49:00

Thanks, good job for us.

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

QQ503

2006-4-27 11:04:00

没钱啊！兄弟

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

pla748

2006-7-5 00:15:00

不错的东西啊！

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

qingdaonyc

2006-7-17 03:43:00

Mathematical Statistics with Applications (Sixth Edition), 2002, ??

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

QQ503

2006-7-17 16:45:00

看不懂！！

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

[推荐]

扫码加我 拉你入群

[下载]Data Mining: An Overview from Database Perspective (1997)

扫码加我 拉你入群

[下载]

扫码加我 拉你入群

[下载]Managing Data Mining Technologies in Organizations

扫码加我 拉你入群

[下载]Wincross(Non-English Version)

扫码加我 拉你入群

hao

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

[下载]Principles of Data Mining

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

[下载]Jack E. Olsen.Data Quality.The Accuracy Dimension

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群