Introduction
OpenRefine is a tool for working with messy data;
- Clean
- Transform from one format into another
- Extend with web services and external data
- Formerly Freebase Gridworks and then Google Refine and formerly supported by Google until 10/02/2012
Rebranded as OpenRefine and supported by volunteers on a GitHub open source community.
With a simple tabular format, you can
- Get an overview of a data set
- Resolve inconsistencies in a data set formats
- Resolve inconsistencies in where data appears
- Resolve inconsistencies in terminology used in the data
- Split data up into more granular parts
- Match local data up to other data sets
- Enhance a data set with data from other sources
Installation Guide
You can download OpenRefine from ‘http://openrefine.org/download.html’
Windows/Linux
- Download zip file
- unzip the downloaded file in any directory of choice. OpenRefine should run wherever you put the unzipped folder.
Mac
- Download ‘dmg’ (disk image) file and open it.
- Drag the OpenRefine application to an appropriate folder.
OpenRefine is a java application, and you need to have a ‘java runtime environment’ (JRE) installed
Running OpenRefine
Windows
Linux
Mac
For more details on downloading, installing and running OpenRefine visit
https://github.com/OpenRefine/OpenRefine/wiki/Installation-Instructions