A complete Open Source Data Platform with ETL, DataWarehouse and DataViz
- Docker (started) and docker-compose (just install Docker for Desktop if you are on laptop)
- Clone repo
git clone https://github.com/datafuel/DataPlatform_docker.git
- Run
cd DataPlatform_docker
- Rename .env.example to .env and replace dummy values with yours
- Run
docker-compose up
then access the services
Make sure to wait at least 30 seconds before launching any service !
- Metabase : http://localhost:3000
- Airbyte : http://localhost:8000
- dbt docs : http://localhost:4444 (after running the instructions described below)
- Adminer : http://localhost:8080
-
Run
docker-compose up
-
Wait until all services are ready then open http://localhost:8000
-
Create a File source with the values below (you can input the values of your choice for other fields)
- url : https://www.data.gouv.fr/fr/datasets/r/63352e38-d353-4b54-bfd1-f1b3ee1cabd7
- format : csv (and HTTPS: Public Web in the field below)
- storage : HTTPS
- dataset_name : covid-france
- reader_options :
{"quotechar":"\"", "sep":";"}
-
Create a Postgres destination with the values below (you can input the values of your choice for other fields)
- host : localhost
- Port : 5433
- schema : stg
- database : testdb
- password : (DWH_POSTGRES_PASSWORD in .env)
- username : (DWH_POSTGRES_USERNAME in .env)
-
Create another File source connector with the following arguments :
- url : https://www.data.gouv.fr/fr/datasets/r/70cef74f-70b1-495a-8500-c089229c0254
- format : csv (and HTTPS: Public Web in the field below)
- storage : HTTPS
- dataset_name : departements-france
- reader_options : Leave empty
- Run
docker exec dbt python metabase_setup.py
- Run
docker exec dbt bash run_dbt.sh
- Run
docker exec -d dbt bash serve_dbt_docs.sh
You can then access the docs website on http://localhost:4444
- host : Name of the service, here postgres_dwh
- database : testdb
- password : (DWH_POSTGRES_PASSWORD in .env) (make sure to use 8 characters)
- username : (DWH_POSTGRES_USERNAME in .env)
This project use dbt-metabase. gouline is the owner of the code used in /metabt (except Admin Class that was part of the metabase_configurator service).