All EXD content: interactive simulators, benchmark datasets, vLLM configs, and episode write-ups.
EXD
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
π§ EXD β Self-Directed PhD in AI
Engineering mastery through first principles, from the top down.
A deep dive into AI engineering β fine-tuning, architectures, inference optimization, and systems thinking. Work backwards from high-level concepts to fundamentals.
πΊοΈ Episodes
| # | Title | Video | Interactive | Article |
|---|---|---|---|---|
| 01 | Intro to EXD | πΊ Watch | β | β |
| 02 | Setup & First Inference | πΊ Watch | β | π GitHub |
| 03 | Inference Benchmarking | πΊ Watch | π Simulator | π GitHub |
| 04 | Performance Tuning | πΊ Watch | π Sim v2 | π GitHub |
| 05 | Speculative Decoding | πΊ Watch | β‘ Spec Decode | π GitHub |
| 06 | Taking Stock | πΊ Watch | β | β |
| 07 | Tokenization & Embeddings | πΊ Watch | π€ Notebook | π GitHub |
π¦ Artifacts
| Type | Name | Description |
|---|---|---|
| π Dataset | benchmark-results | Performance data from inference sweeps |
| βοΈ Dataset | vllm-configs | Production vLLM configuration profiles |
| π Article | GitHub episodes | Full write-ups for each episode |
| π Notebook | episode-07-tokenization | Tokenization & embeddings deep-dive |
π Links
πΊ YouTube Channel Β· π» GitHub Β· ποΈ @EXDai Β· π Collection
π οΈ Focus Areas
- Model fine-tuning (LoRA, QLoRA, RLHF/DPO)
- Transformer architectures (attention variants, MoE)
- Inference optimization (quantization, KV cache, speculative decoding, compilation)
Work backwards. Understand everything.
models 0
None public yet