dags/ contains Airflow dags for data orchestration to GCP
utils/ contains utils for Airflow dags
configs/ contains configs for Airflow dags
logs/ contains logs from Airflow
plugins/ contains plugins for Airflow
-
Build docker containers with command:
docker compose build
-
Run docker containers with command:
docker-compose up
-
Go to http://localhost:8080/
-
Type
admin
as login andtest
as password in UI:
- In
home
page you can see 3 dags:
- Create variable
gcp_service_account_info
and paste inval
input body of yourservice account
json file:
- To stop all docker containers press
CTRL+C
Runs ETL processes for
application*.csv
files and saves data (in.parquet
) tosteam_data_927433242395
bucket.
Data pipeline diagram:
Runs ETL processes for
PlayerCountHistory/*.csv
files and saves data (in.parquet
) tosteam_data_927433242395
bucket.
Data pipeline diagram:
Runs ETL processes for
PriceHistory/*.csv
files and saves data (in.parquet
) tosteam_data_927433242395
bucket.
Data pipeline diagram: