Coffee & Systems - Week #1 [Stream Processing - Part 1]
The concept is simple, we pick a list of articles/talks to discuss about in a random cafe. The overall aim is to build a community of folks motivated to learn more about systems.
Reading Material(s)
This week focuses on stream processing (in large part because I have the most experience with streaming systems at work and feel I can drive the discussions).
We will read through one of the best articles (IMO) on what streaming processing is, the need for it and the challenges. We will also discuss about watermarks and how it is important to ensure the correctness of results. I have also linked an optional video (also by Tyler Akidau) on the evolution of streaming systems over time.
Streaming 101: The world beyond batch - https://www.oreilly.com/radar/the-world-beyond-batch-streaming-101/
Correct and Low Latency Real Time Streaming Joins - https://www.youtube.com/watch?v=I6MJqNAM2qU
(Optional) The evolution of massive scale data processing - https://www.youtube.com/watch?v=9J_cWustI-A
This will be a dense but beginner-friendly introduction to the topic and we might have a follow-up session to go more in-depth.
Some ground rules
The discussion material will be posted in advance and folks are expected to pre-read
Usually, a single person will drive the initial discussion(pre-decided) summarising the material, post this we can all deep-dive into healthy discussions on the same
This is not a place to discuss interview questions (design YouTube/Instagram/Twitter/Google Docs from scratch…)
The topics should be intermediate level (goldilocks principle - not so hard that folks have a hard time following, not so easy that folks are bored)
There is no fixed schedule for the learning group. We might have this every fortnight/every month or might do this weekly (depending on interest)