Nvidia: End-to-End Test-Time Training for Long Context aka Being Able To Update A Model's Weights In Real-Time As You Use It | "TTT changes the paradigm from retrieving info to learning it on the fly...the TTT model treats the context window as a dataset & trains itself on it in real-time." [R]
[P] my shot at a DeepSeek style moe on a single rtx 5090
ISBI 2026: Results Out [D]
Spine surgery has massive decision variability. Retrospective ML won’t fix it. Curious if a workflow-native, outcome-driven approach could. [D]
[P] Provider outages are more common than you'd think - here's how we handle them