Welcome to My Blog
An introduction to what you'll find here - thoughts on AI infrastructure, distributed systems, and lessons learned from building systems at scale.
Welcome!
I'm Chao Li, a Principal Engineer passionate about building the infrastructure that powers AI. After years of working on distributed systems and LLM infrastructure, I've decided to start writing about what I've learned.
What to Expect
On this blog, I'll be sharing:
AI Infrastructure Deep Dives
From distributed training pipelines on Ray and Kubernetes to fine-tuning LLMs with LoRA adapters, I'll break down the technical challenges and solutions I encounter in my work.
High-Performance Systems
Building systems that handle millions of events per second requires careful attention to async patterns, memory management, and system design. I'll share patterns and anti-patterns from real-world experience.
Homelab Adventures
I run my own ML infrastructure at home - from C++ backtesting engines with microsecond latency to Ray clusters on Kubernetes. Expect posts about self-hosting, hardware choices, and making it all work together.
Why Write?
Writing helps me solidify my understanding of complex topics. If these posts help others along the way, even better.
Stay tuned for more technical deep dives!
Feel free to reach out if you have questions or topics you'd like me to cover.