Data Engineering Zoomcamp

View in Telegram

Recent Posts

Hi everyone!

We hope you're enjoying the course! Now it's time to put everything we learned into practice

We start working on our projects

Here you can find more information about it: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/projects/README.md

You'll find the links for submitting your projects in the course management platform (and also here: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/cohorts/2025/project.md)
If you're looking for a project idea and interested in Blockchain, check this:

https://bush-thrill-4c5.notion.site/Solana-on-chain-analytics-competition-1aa9ca5fcbe7806dbee4eb96570c10a6?pvs=4

Our friend Dmitry Dremov is organizing a series of analytics competition on Solana data

You can check the past and ongoing competitions to learn more about the dataset or play with the data yourself and see if it's interesting for you to make a project about it
Hi everyone!

The content and the homework for module 6 are ready

You can start working on the module

Module: https://github.com/DataTalksClub/data-engineering-zoomcamp/tree/main/06-streaming

Homework: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/cohorts/2025/06-streaming/homework.md


The homework is based on the PyFlink stream that Zach did, so you can treat the rest of the videos in module 6 as optional

Have fun and let us know in Slack if you have any problems
We're still preparing homework for module 6, so you can continue working on module 5 in the meantime. We leave the homework form open for some time
Stream about streaming with Zach: https://www.youtube.com/watch?v=P2loELMUUeI
(Available for replay later)
We're starting a stream about streaming in 1 hour. May the stream be with you!
Hi everyone!

We accidentally discovered that for Q3 homework 5 different versions of Spark give different answers

We're still trying to figure out why it's happening, but if you came across this issue, select the closest option - it will still be the correct one
Hi everyone!

We will record module 6 on streaming live. It'll be tomorrow (Monday) at 17:00 CET. We'll send a reminder one hour before the start and share the link here too.

See you tomorrow!
Hi everyone!

Today we started module 5: batch processing

Materials: https://github.com/DataTalksClub/data-engineering-zoomcamp/tree/main/05-batch
Homework: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/cohorts/2025/05-batch/homework.md

Have fun!

The form for submitting homework 4 will remain open for some time
Hi everyone!

We are extending module 4 by 3 days. We will start module 5 (spark) on Thursday

Happy learning!
We're starting module 4 on dbt and analytics engineering!

Module: https://github.com/DataTalksClub/data-engineering-zoomcamp/tree/main/04-analytics-engineering
Homework: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/cohorts/2025/04-analytics-engineering/homework.md

Have fun!
Did somebody do the extra dlt homework that we gave at the end of the workshop? We're still waiting for the PR!
We scored homework 3 - you can see the results in the course management platform

We start module 4 next week on Monday, so you have a few days to catch up with the workshop

Have fun!
You asked how to get a dlt T-shirt

The answer: use dlt in your course project!

We will have 10 T-shirts, so we'll do a raffle if more than 10 people use dlt. But virtual hugs from the entire dlt team are guaranteed to everyone!
Workshop stream: https://www.youtube.com/watch?v=pgJWP_xqO1g

https://github.com/DataTalksClub/data-engineering-zoomcamp/tree/main/cohorts/2025/workshops/dlt

watch now or later in recording
The dlt workshop happens today at 16:30 Berlin time

We will share the link to the stream here 5-10 minutes before we start

If you can't make it, don't worry, the stream will be recorded

See you all soon!
We started module 3 yesterday

Materials: https://github.com/DataTalksClub/data-engineering-zoomcamp/tree/main/03-data-warehouse
Homework: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/cohorts/2025/03-data-warehouse/homework.md

Have fun!
Workflow Orchestration office hours

Stream: https://www.youtube.com/watch?v=aBQulSpOgfY
Questions: https://app.sli.do/event/aFZgu1o3nbgJTyX1AH1dtF

Join now or watch later in recording
We're starting in one hour!

You can ask your questions in advance here: https://app.sli.do/event/aFZgu1o3nbgJTyX1AH1dtF

We will share the link to the stream here 5-10 minutes before the start
Tomorrow at 17:00 Berlin time we have office hours with Will

We will quickly go over frequent problems and then answer your questions

You can ask your questions in advance here: https://app.sli.do/event/aFZgu1o3nbgJTyX1AH1dtF

The questions should be about workflow orchestration or Kestra

See you tomorrow!
See more posts

View in Telegram