全部版块 我的主页
论坛 计量经济学与统计论坛 五区 计量经济学与统计软件 winbugs及其他软件专版
1350 3
2017-04-08
Abstract
Working with large data sets is increasingly common in researchand industry. There are some distributed data analytics solutions likeHadoop, that offer high scalability and fault-tolerance, but they usuallylack a user interface and only developers can exploit their functionalities.In this paper, we present Radoop, an extension for the RapidMinerdata mining tool which provides easy-to-use operators for running distributedprocesses on Hadoop. We describe integration and developmentdetails and provide runtime measurements for several data transformationtasks. We conclude that Radoop is an excellent tool for big dataanalytics and scales well with increasing data set size and the numberof nodes in the cluster.

本帖隐藏的内容



二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

全部回复
2017-4-8 09:59:22
回复看看下
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

2017-4-24 11:04:11
have a try
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

2017-5-2 19:41:58
看看内容
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

相关推荐
栏目导航
热门文章
推荐文章

说点什么

分享

扫码加好友,拉您进群
各岗位、行业、专业交流群