Data Module and Environment Enhancement
- Initial Release of Data Module to generate data pipelines. It supports:
- Pyspark Sagemaker Processing jobs
- Schedule the data pipeline to hourly, daily, weekly or custom intervals
- The ability to specify the bucket where the data is read/written to at runtime.
- Environment module enhancement with internet option.
Contributors to this Release
Yin Song (@yinsong1986), Wei Yih Yap (@yihyap), Kian Ho (@kianho), Chen Wu (@chenwuperth), Josiah Davis (@josiahdavis), Verdi March (@verdimrc).