Most ML portfolios stop at the demo. I take a model from the GPU kernel up to a real API and measure where it's wrong on data it never saw. Focus: GPU kernels (Triton/CUDA), deep learning, fine-tuning & serving, LLM eval & red-teaming, RAG & agents, MLOps, Python performance, quantization.