OpenRefine (formerly Google Refine) is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data.
Please Note that Operating since October 2nd, 2012, the Google IS Not actively Supporting the this Project, Which has now been who rebranded to OpenRefine. The Project Development, Documentation and Promotion IS now Fully Supported by Volunteers. The Find OUT More the About at The History of OpenRefine and How you CAN at The Community Community Help .
Using OpenRefine - The Book
OpenRefine a using , by Ruben Verborgh and Max De Wilde, Offers A Great Introduction to OpenRefine Recipes Organized by the with Hands ON examples, at The Book Covers at The following Topics.:
- Import data in various formats
- Explore datasets in a matter of seconds
- Apply basic and advanced cell transformations
- Deal with cells that contain multiple values
- Create instantaneous links between datasets
- Filter and partition your data easily with regular expressions
- Use named-entity extraction on full-text fields to automatically identify topics
- Perform advanced data operations with the General Refine Expression Language