Published onApril 7, 2026The Last Mile of LLM InferenceAIComputer-ScienceSecurity(Part 3) Sampling strategies and why inference optimizations pose a security tradeoff
Published onApril 6, 2026⭐Why Your First Token Is Always LateAIComputer-ScienceSystem-Design(Part 2) The inference side of transformers, along with systems tricks that make production LLMs fast
Published onApril 5, 2026You're Billed by the Token. Here's What That Means.AIComputer-ScienceNLP(Part 1) BPE Tokenizers under the hood, and where tokenization breaks math, spelling and code
Published onSeptember 21, 2025⭐Netflix's Livestreaming Disaster- The Engineering Challenge of Streaming at ScaleSystem-DesignComputer-NetworkingCryptographyComputer-ScienceSecurity
Published onJune 2, 2025How to Pick the Perfect Movie for Your Friends (Using Math and AI)Game-TheoryRecommendation-SystemsComputer-ScienceAIMath