Skip to content

v0.2.0

Latest
Compare
Choose a tag to compare
@josiahdavis josiahdavis released this 06 Apr 06:34
· 232 commits to main since this release

Data Module and Environment Enhancement

  • Initial Release of Data Module to generate data pipelines. It supports:
    • Pyspark Sagemaker Processing jobs
    • Schedule the data pipeline to hourly, daily, weekly or custom intervals
    • The ability to specify the bucket where the data is read/written to at runtime.
  • Environment module enhancement with internet option.

Contributors to this Release

Yin Song (@yinsong1986), Wei Yih Yap (@yihyap), Kian Ho (@kianho), Chen Wu (@chenwuperth), Josiah Davis (@josiahdavis), Verdi March (@verdimrc).