Tag
#efficiency
From Radar
Radar · 2026-05-06
SubQ review: great numbers, but still a test of benchmark faith
Fello AI reviews SubQ's claims: 12M token context window, 52x faster prefill than FlashAttention on 1M tokens and frontier-class benchmark positioning. The numbers are striking enough to need independent verification before they change architecture decisions.
Read →Radar · 2026-05-05
Subquadratic raises $29M for 12M-token context windows
Subquadratic has launched with $29 million in seed funding and introduced SubQ, a model built on a subquadratic architecture and sparse attention to push context windows as high as 12 million tokens. The promise is longer context, higher speed, better accuracy and lower cost. The proof still needs independent benchmarks.
Read →