withsoon
Built in public ยท Updated as the industry moves

Everything for a
Big Data + AI Engineer

System Design ยท Setup Guides ยท LLM Deep Dives ยท Interview Prep ยท Cheatsheets. One platform โ€” daily work, career growth, and staying current.

๐Ÿš€ Get started

Step-by-step setup guides โ€” copy, paste, run.

All setup guides โ†’

โญ Featured

System Designadvancedโ˜… Featured

System Design: Real-Time Data Pipeline

Design a real-time data pipeline that ingests 1M events/second, processes them, and serves analytics โ€” with architecture diagram and trade-off discussion.

#system-design#interview#kafka
2026-06-05
Setup Guidebeginnerโ˜… Featured

Kafka Local Setup โ€” From Zero to Running in 10 Minutes

Run Kafka locally using Docker Compose. Create topics, produce messages, consume them, and understand what's happening under the hood.

#kafka#setup#docker
2026-06-05
System Designintermediateโ˜… Featured

Spark Architecture โ€” How it Actually Works

Driver, executors, DAG scheduler, shuffle โ€” the complete mental model for how Spark executes a job and where things go wrong.

#spark#system-design#big-data
2026-06-05
Cheatsheetbeginnerโ˜… Featured

SQL Cheatsheet โ€” Window Functions, Joins, Optimization

The most important SQL patterns for data engineers โ€” window functions, CTEs, joins, aggregations, and query optimization.

#sql#cheatsheet#data-engineering
2026-06-04
Interview Q&Aintermediateโ˜… Featured

Kafka Interview Questions โ€” Top 50 with Answers

The 50 most asked Kafka interview questions with detailed answers โ€” from basics to internals to production scenarios.

#kafka#interview#streaming
2026-06-04
Guideintermediateโ˜… Featured

RAG Pipeline โ€” Complete Production Guide

Build a production-ready RAG pipeline: chunking strategy, embedding models, vector DBs, retrieval, re-ranking, and generation.

#rag#embeddings#vector-db
2026-06-04

๐Ÿ• Recently added

System Designadvancedโ˜… Featured

System Design: Real-Time Data Pipeline

Design a real-time data pipeline that ingests 1M events/second, processes them, and serves analytics โ€” with architecture diagram and trade-off discussion.

#system-design#interview#kafka
2026-06-05
Setup Guidebeginnerโ˜… Featured

Kafka Local Setup โ€” From Zero to Running in 10 Minutes

Run Kafka locally using Docker Compose. Create topics, produce messages, consume them, and understand what's happening under the hood.

#kafka#setup#docker
2026-06-05
System Designintermediateโ˜… Featured

Spark Architecture โ€” How it Actually Works

Driver, executors, DAG scheduler, shuffle โ€” the complete mental model for how Spark executes a job and where things go wrong.

#spark#system-design#big-data
2026-06-05
Cheatsheetbeginnerโ˜… Featured

SQL Cheatsheet โ€” Window Functions, Joins, Optimization

The most important SQL patterns for data engineers โ€” window functions, CTEs, joins, aggregations, and query optimization.

#sql#cheatsheet#data-engineering
2026-06-04
Interview Q&Aintermediateโ˜… Featured

Kafka Interview Questions โ€” Top 50 with Answers

The 50 most asked Kafka interview questions with detailed answers โ€” from basics to internals to production scenarios.

#kafka#interview#streaming
2026-06-04
Guideintermediateโ˜… Featured

RAG Pipeline โ€” Complete Production Guide

Build a production-ready RAG pipeline: chunking strategy, embedding models, vector DBs, retrieval, re-ranking, and generation.

#rag#embeddings#vector-db
2026-06-04

Built in public, growing daily

Every guide, comparison, and cheatsheet is added as the industry evolves. No filler โ€” just content that actually works in production.