[P] Adaptive load balancing in Go for LLM traffic - harder than expected
Nvidia: End-to-End Test-Time Training for Long Context aka Being Able To Update A Model's Weights In Real-Time As You Use It | "TTT changes the paradigm from retrieving info to learning it on the fly...the TTT model treats the context window as a dataset & trains itself on it in real-time." [R]
[R] statistical learning in machine learning vs cognitive sciences
ISBI 2026: Results Out [D]
[P] my shot at a DeepSeek style moe on a single rtx 5090