Skip to content

Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.

License

Notifications You must be signed in to change notification settings

GSS-Cogs/databaker

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Databaker

Jupyter notebook tool for converting data that is laid out in a formatted Excel spreadsheet into a normalized form for use by databases.

It depends on okfn/messytables and sensiblecodeio/xypath

Python 3.4+ supported.

Starting up

For development

To install for development, the easiest way is create a virtualenv, activate it:

source bin/activate

and then type

pip install -e git+https://github.com/sensiblecodeio/databaker.git#egg=databaker

This will install the code into src/databaker where you can edit and commit it.

For normal use

Install with pip install databaker

Usage

Launch a Jupyter notebook:

jupyter notebook

and then follow the tutorials as described below.

Documentation

The current documentation is in the form of Jupyter notebooks located inside the tutorial directory.

You can access these directly by creating a new Jupyter notebook and running the following in a Jupyter cell:

from databaker.tutorial import tutorial
tutorial()

which will copy the tutorials to your current directory and provide links to these copied notebooks.

Authors

Made by the Sensible Code Company on behalf of the Office of National Statistics (UK).

About

Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Jupyter Notebook 85.3%
  • Python 14.7%