Skip to content

Latest commit

 

History

History
40 lines (26 loc) · 1.4 KB

README.md

File metadata and controls

40 lines (26 loc) · 1.4 KB

Project Status: WIP – Initial development is in progress, but there has not yet been a stable, usable release suitable for the public.

Herman Melville- Moby Dick

This repo contains a single book in different representations:

The conversion TEI to TF is done with programs/tfFromTei.py.

The conversion TF to Pandas is done with export.ipynb.

For an example of how to use the Pandas file, see programs/pandas.ipynb.

See docs for documentation about provenance and encoding.

Requirements

If you want to reproduce or make your own computations, pip-install the following Python modules

pip install text-fabric pandas pyarrow

Also, clone this repo, preferably in your ~/github/annotation directory.

Author

See about for the authors/editors of the data.

Dirk Roorda is the author of the representation in Text-Fabric of the data, and the programs and docs.