Python ETL samples using docker
- homebrew: to manage macOs dependencies (like pip, mysql etc).
- http://brew.sh/
- pip: to manage python dependencies.
- http://docs.python-guide.org/en/latest/starting/install/osx/
sudo easy_install pip
- docker to create containers
- https://docs.docker.com/docker-for-mac/
- install and start docker
- Install postgres using homebrew
brew install postgres
- Install psycopg2 using
sudo pip install psycopg2
- Start postgres container
postgresql_container.sh
- Run individual python scripts
python create_db.py
python create_table.py
python insert_data.py
- Always work off a new branch.
- Create pull request against "dev" branch.
- Create 2 sample data file with 10 records.
- These two files will be loaded in to the database.
- Create Python scripts to generate large amount of data (do not upload large data file to github).
- Create scripts to install and start mysql.
- Create scripts to create database and table in mysql.
- Create Python ETL script.
- Script to automatically run Python ETL script.