- Create virtual environment of python using
python3 -m venv <path>
and activate it, then install all the requirements present in requirements.txt usingpip3 install -r requirements.txt
- Install and run mongoDB locally
- My machine was MAC with Apple Sillicon so I followed the docs at https://www.mongodb.com/docs/manual/tutorial/install-mongodb-on-os-x/
- After installing mongoDB using brew, started the local server using
brew services start [email protected]
- Get inside the mongo shell, using command
mongosh
- Create a new database using command
use newDB
(as used in the etl.py file line number 43) - Create a new collection called 'etl_output' using command
db.createCollection("etl_output")
- Check no data is present at the moment in there using command
db.etl_output.find()
, now exit the mongosh - Execute the script using
python3 etl.py
- Head back to mongo shell, using
mongosh
and then change the database using commanduse newDB
- Check your merged data, using command
db.createCollection("etl_output")
-
Notifications
You must be signed in to change notification settings - Fork 0
Rishabh-Hupr/mongoETLpipeline
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
A basic ETL job written in python targeting mongoDB for storing transformed data
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published