全部版块 我的主页
论坛 数据科学与人工智能 数据分析与数据科学 数据分析与数据挖掘
4538 19
2009-11-22
很很很经典的教材
看到其他人收费很高,动辄五个论坛币
现特别奉献本书,仅售一个论坛币
希望得到大家的支持

特别说明:其实很多时候数据挖掘找不到预想的规则,仅仅是因为数据处理的不够好,所以即使是挖掘经验很丰富的人,也建议重新读一下
附件列表

Data Preparation.pdf

大小:3.22 MB

只需: 1 个论坛币  马上下载

二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

全部回复
2009-11-22 10:10:26
What This Book Is About
This book is about what to do with data to get the most out of it. There is a lot more to that
statement than first meets the eye.
Much information is available today about data warehouses, data mining, KDD, OLTP,
OLAP, and a whole alphabet soup of other acronyms that describe techniques and
methods of storing, accessing, visualizing, and using data. There are books and
magazines about building models for making predictions of all types—fraud, marketing,
new customers, consumer demand, economic statistics, stock movement, option prices,
weather, sociological behavior, traffic demand, resource needs, and many more.
In order to use the techniques, or make the predictions, industry professionals almost
universally agree that one of the most important parts of any such project, and one of the
most time-consuming and difficult, is data preparation. Unfortunately, data preparation
has been much like the weather—as the old aphorism has it, “Everyone talks about it, but
no one does anything about it.” This book takes a detailed look at the problems in
preparing data, the solutions, and how to use the solutions to get the most out of the
data—whatever you want to use it for. This book tells you what can be done about it,
exactly how it can be done, and what it achieves, and puts a powerful kit of tools directly in
your hands that allows you to do it.
How important is adequate data preparation? After finding the right problem to solve, data
preparation is often the key to solving the problem. It can easily be the difference between
success and failure, between useable insights and incomprehensible murk, between
worthwhile predictions and useless guesses.
For instance, in one case data carefully prepared for warehousing proved useless for
modeling. The preparation for warehousing had destroyed the useable information content
for the needed mining project. Preparing the data for mining, rather than warehousing,
produced a 550% improvement in model accuracy. In another case, a commercial baker
achieved a bottom-line improvement approaching $1 million by using data prepared with the
techniques described in this book instead of previous approaches.
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

2010-3-31 10:13:02
谢谢分享
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

2010-4-14 14:17:06
谢谢楼主分享,先拿来看看,感谢啦~
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

2010-4-16 16:57:25
谢谢,下来看看
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

2010-5-7 10:05:15
谢谢搂住啦~~~
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

点击查看更多内容…
相关推荐
栏目导航
热门文章
推荐文章

说点什么

分享

扫码加好友,拉您进群
各岗位、行业、专业交流群