AIToday

Miami-based AI startup Subquadratic claims its new model SubQ solves a longstanding computational bottleneck in LLMs, making them faster and far cheaper to run—and independent tests seem to back up the claim.

MIT Technology Review AI14h ago3 min read
Miami-based AI startup Subquadratic claims its new model SubQ solves a longstanding computational bottleneck in LLMs, making them faster and far cheaper to run—and independent tests seem to back up the claim.

Summaries like this, in your inbox every morning.

Sign up free →

3 Key Points

  1. 1

    What happened: Subquadratic emerged from stealth last month claiming to have cracked a mathematical problem that has constrained large language models for nearly a decade. The company's new model, SubQ, uses sparse attention (a method that drastically cuts the number of computations needed) instead of the dense attention mechanism that powers today's most advanced models. An independent evaluation by third-party firm Appen found that SubQ was 56 times faster than models using FlashAttention, a previous sparse-attention technique, and matched the coding performance of top models from Google DeepMind, OpenAI, and Anthropic on standard benchmarks.

  2. 2

    Why it matters: Most LLMs rely on a transformer architecture that requires multiplying every word's numerical encoding with every other word's encoding—a process that becomes exponentially more expensive as text grows longer. SubQ's sparse attention selects only the most relevant word relationships to process, which the company claims could dramatically lower costs and energy use without sacrificing performance. If SubQ's results hold up, it could reshape how companies build language models going forward and make AI applications far cheaper to operate.

  3. 3

    What to watch: Subquadratic says SubQ can process up to 12 million tokens at once in its context window, compared with one million tokens for most top models today. The company has not yet made SubQ widely available for public testing, and cost claims are difficult to verify independently at this stage. According to the CEO, running Anthropic's Opus 4.6 through a standard test cost $2600, while SubQ cost eight dollars—but until the model is available to the broader market, this comparison cannot be independently confirmed.

Discussion

No comments yet. Be the first to share your thoughts!

Log in to join the discussion

Related Articles

Stay ahead with AI news

Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.

Get Started Free

Free · takes 30 seconds · unsubscribe anytime

5 minutes a day. The AI essentials.

200+ sources · Email / LINE / Slack

Get it free →