R is the most popular overall tool among data miners, although Python usage is growing faster. RapidMiner continues to be most popular suite for data mining/data science. Hadoop/Big Data tools usage grew to 29%, propelled by 3x growth in Spark. Other tools with strong growth include H2O (0xdata), Actian, MLlib, and Alteryx.
By Gregory Piatetsky, KDnuggets.
| What Analytics, Big Data, Data mining, Data Science software you used in the past 12 months for a real project? [2759 voters] | |
| Legend: Red: Free/Open Source tools Green: Commercial tools Fuchsia: Hadoop/Big Data tools |
|
| R (1293), 3.6% alone |
|
| RapidMiner (870), 13.7% alone |
|
| SQL (853), 0% alone |
|
| Python (837), 0% alone |
|
| Excel (631), 0% alone |
|
| KNIME (553), 6.7% alone |
|
| Hadoop (507), 0% alone |
|
| Tableau (341), 0% alone |
|
| SAS base (313), 0.6% alone |
|
| Spark (311), 0% alone |
|
| Weka (310), 0% alone |
|
| SAS Enterprise Miner (302), 3.6% alone |
|
| Microsoft SQL Server (268), 0% alone |
|
| MATLAB (243), 0% alone |
|
| scikit-learn (229), 0% alone |
|
| Unix shell/awk/gawk (221), 0% alone |
|
| IBM SPSS Statistics (213), 0% alone |
|
| IBM SPSS Modeler (197), 7.1% alone |
|
| Alteryx (155), 39.4% alone |
|
| Pig (150), 0% alone |
|
| Other programming languages (140), 0% alone |
|
| Other free analytics/data mining tools(138), 0% alone |
|
| Other Hadoop/HDFS-based tools (125), 0% alone |
|
| TIBCO Spotfire (119), 11.8% alone |
|
| Rattle (117), 0.9% alone |
|
| QlikView (116), 0% alone |
|
| Revolution Analytics (now part of Microsoft) (109), 0% alone |
|
| Microsoft Azure ML (102), 1.0% alone |
|
| Microsoft Power BI (98), 0% alone |
|
| MLlib (91), 0% alone |
|
| JMP (86), 0% alone |
|
| SAP (including former KXEN) (82), 26.8% alone |
|
| Perl (79), 0% alone |
|
| Mahout (76), 0% alone |
|
| |
| Pentaho (74), 0% alone |
|
| Other paid analytics/data mining/data science software (66), 6.1% alone |
|
| Salford SPM/CART/Random Forests/MARS/TreeNet (64), 43.8% alone |
|
| Gnu Octave (64), 0% alone |
|
| IBM Watson Analytics (57), 0% alone |
|
| Ayasdi (56), 10.7% alone |
|
| Dataiku (56), 7.1% alone |
|
| Actian (56), 7.1% alone |
|
| H2O (0xdata) (55), 0% alone |
|
| Orange (53), 0% alone |
|
| Mathematica (52), 0% alone |
|
| IBM Cognos (51), 0% alone |
|
| Dell (including StatSoft) (47), 19.1% alone |
|
| XLSTAT for Excel (42), 0% alone |
|
| Stata (36), 2.8% alone |
|
| Lexalytics (35), 28.6% alone |
|
| Vowpal Wabbit (35), 0% alone |
|
| C4.5/C5.0/See5 (35), 0% alone |
|
| Julia (31), 3.2% alone |
|
| Splunk/ Hunk (30), 0% alone |
|
| Datameer (26), 0% alone |
|
| MicroStrategy (24), 0% alone |
|
| BigML (23), 0% alone |
|
| Zementis (22), 31.8% alone |
|
| Miner3D (22), 9.1% alone |
|
| Oracle Data Miner (22), 0% alone |
|
| Amazon Machine Learning (20), 5.0% alone |
|
| F# (18), 0% alone |
|
| BayesiaLab (16), 12.5% alone |
|
| Dato (former Graphlab) (15), 6.7% alone |
|
| Clojure (13), 0% alone |
|
| Alpine Data Labs (13), 0% alone |
|
| Angoss (11), 18.2% alone |
|
| Lavastorm (10), 0% alone |
|
| Lisp (10), 0% alone |
|
| Predixion Software (10), 0% alone |
|
| WordStat (9), 0% alone |
|
| Megaputer Polyanalyst/TextAnalyst (8), 0% alone |
|
| WPS: World Programming System (7), 0% alone |
|
| GoodData (6), 0% alone |
|
| MetaMind (5), 0% alone |
|
| SiSense (5), 0% alone |
|
| RapidInsight/Veera (5), 0% alone |
|
| Skytree (3), 0% alone |
|
| Birst (2), 0% alone |
|
| Ontotext (1), 0% alone |
|
| FICO Model Builder (1), 0% alone |
|
扫码加好友,拉您进群



收藏
