Engineering updates and deep dives.
Why we're building a weight-indexed inference engine for MoE models.