Skip to content
Rajeev edited this page Oct 28, 2024 · 7 revisions

Welcome to the ReStart wiki!

Topics of Data Engineering-

  • SQL
  • NOSQL/MongoDB
  • Git Commands
  • Distributed Storage Fundamentals
  • Distributed Processing Fundamentals
  • Apache Spark
  • Azure Databricks
  • Azure Datafactory (for ingestion)
  • Azure Synapse
  • Data Modeling
  • System design
  • Deployment Part (CICD)
  • Loads of Performance tuning
  • Multi Cloud (Azure & AWS can be good options)
  • In AWS (EMR, Redshift, Athena, Glue, Lambda, S3)
  • Spark Structured Streaming
  • Kafka
  • Lakehouse
  • Open File formats
Clone this wiki locally