Skip to Main Content
UC Logo
Libraries | Ask the Libraries

OpenRefine Resources

OpenRefine, http://openrefine.org, is a free, powerful, and easy-to-use tool for cleaning up and transforming datasets in order to prepare them for analysis and sharing. You will learn how to leverage OpenRefine’s interface and scripting language for basi

Introduction and Installation

Introduction

OpenRefine is a tool for working with messy data;

  • Clean
  • Transform from one format into another
  • Extend with web services and external data
  • Formerly Freebase Gridworks and then Google Refine and formerly supported by Google until 10/02/2012

Rebranded as OpenRefine and supported by volunteers on a GitHub open source community.

With a simple tabular format, you can 

  • Get an overview of a data set
  • Resolve inconsistencies in a data set formats
  • Resolve inconsistencies in where data appears
  • Resolve inconsistencies in terminology used in the data
  • Split data up into more granular parts
  • Match local data up to other data sets
  • Enhance a data set with data from other sources

 

Installation Guide

You can download OpenRefine from ‘http://openrefine.org/download.html

Windows/Linux

  • Download zip file
  • unzip the downloaded file in any directory of choice. OpenRefine should run wherever you put the unzipped folder.

Mac

  • Download  ‘dmg’ (disk image) file and open it.
  • Drag the OpenRefine application to an appropriate folder.
OpenRefine is a java application, and you need to have a ‘java runtime environment’ (JRE) installed
To download and install JRE go to ‘http://java.com’ and click ‘Free Java Download

Running OpenRefine

Windows

  • Navigate to folder where OpenRefine is unzipped
  • Double-click ‘openrefine.exe’

Linux

  • Open a terminal window
  • Navigate to folder where OpenRefine is unzipped
  • Type ‘./refine’

Mac

  • Navigate to location with OpenRefine
  • Click the OpenRefine icon


For more details on downloading, installing and running OpenRefine visit
https://github.com/OpenRefine/OpenRefine/wiki/Installation-Instructions

University of Cincinnati Libraries

PO Box 210033 Cincinnati, Ohio 45221-0033

Phone: 513-556-1424

Contact Us | Staff Directory

University of Cincinnati

Alerts | Clery and HEOA Notice | Notice of Non-Discrimination | eAccessibility Concern | Privacy Statement | Copyright Information

© 2021 University of Cincinnati