全部版块 我的主页
论坛 数据科学与人工智能 数据分析与数据科学 数据分析与数据挖掘
2306 0
2014-12-10

Text Mining in WEKA Cookbook


In this page I intend to provide useful hints and tips (recipes) for working with text data in WEKA. The information is organized as a list of blog posts and references, plus additional material like code and text collections.
I suggest to read my following posts on text classification with WEKA in the publication order:
I have some other posts on WEKA, like the following ones:
All my posts related to WEKA can be found using the label WEKA.
Interesting references for working with WEKA include:
  • Use WEKA in your Java code provides an excelent introduction to how to use the classes Instances, Filter, Classifier, Clusteres, Evaluation and AttributeSelection, in your own code.
  • WEKA programmatic use describes the learning process life-cycle and, more importantly, it explains how to deal with attributes in your Java code.
  • Text Categorization with WEKA deals with transforming a directory structure of classes (directories) and documents (inside those directories) into ARFF format for further processing. The code is available at ARFF files from Text Collections.
For testing your classifiers and integrating WEKA in your own code, I provide the following stuff:
You will find most of this stuff at my tmweka Github repository.

二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

相关推荐
栏目导航
热门文章
推荐文章

说点什么

分享

扫码加好友,拉您进群
各岗位、行业、专业交流群