Miami-based AI startup Subquadratic claims its new model SubQ solves a longstanding computational bottleneck in LLMs, making them faster and far cheaper to run—and independent tests seem to back up the claim.

MIT Technology Review AI14h ago3 min read

Summaries like this, in your inbox every morning.

3 Key Points

1
What happened: Subquadratic emerged from stealth last month claiming to have cracked a mathematical problem that has constrained large language models for nearly a decade. The company's new model, SubQ, uses sparse attention (a method that drastically cuts the number of computations needed) instead of the dense attention mechanism that powers today's most advanced models. An independent evaluation by third-party firm Appen found that SubQ was 56 times faster than models using FlashAttention, a previous sparse-attention technique, and matched the coding performance of top models from Google DeepMind, OpenAI, and Anthropic on standard benchmarks.
2
Why it matters: Most LLMs rely on a transformer architecture that requires multiplying every word's numerical encoding with every other word's encoding—a process that becomes exponentially more expensive as text grows longer. SubQ's sparse attention selects only the most relevant word relationships to process, which the company claims could dramatically lower costs and energy use without sacrificing performance. If SubQ's results hold up, it could reshape how companies build language models going forward and make AI applications far cheaper to operate.
3
What to watch: Subquadratic says SubQ can process up to 12 million tokens at once in its context window, compared with one million tokens for most top models today. The company has not yet made SubQ widely available for public testing, and cost claims are difficult to verify independently at this stage. According to the CEO, running Anthropic's Opus 4.6 through a standard test cost $2600, while SubQ cost eight dollars—but until the model is available to the broader market, this comparison cannot be independently confirmed.

Discussion

No comments yet. Be the first to share your thoughts!

Minovative Mind releases a CLI tool that orchestrates multiple AI models to generate and modify code with built-in safeguards against errors and malicious input.

Hacker News2h ago

Meta launches AI agents that handle customer service, sales, and transactions directly—shifting the company from selling ads to controlling the commercial moment itself.

Hacker News2h ago

Konxios, a local-first AI operating system that integrates multiple AI models and services, has entered public beta, allowing developers and creators to build and run custom AI agents with privacy controls on their own machines.

Hacker News2h ago

Ratchet, a new open-source toolkit, lets users and AI agents reflash corrupted BIOS on motherboards using inexpensive USB programmers.

Hacker News2h ago

SkillsGuard, a free static security scanner, launches to detect malicious code in AI agent skill packages before they run—no account, token, or LLM endpoint required.

Hacker News2h ago

An engineer warns that using AI to automatically write incident reports risks hiding critical system failures because nobody reads them carefully enough to catch fabricated details.

Hacker News2h ago

Stay ahead with AI news

Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.

Get Started Free

Free · takes 30 seconds · unsubscribe anytime

5 minutes a day. The AI essentials.

200+ sources · Email / LINE / Slack

Get it free →

Miami-based AI startup Subquadratic claims its new model SubQ solves a longstanding computational bottleneck in LLMs, making them faster and far cheaper to run—and independent tests seem to back up the claim.

3 Key Points

Discussion

Related Articles

Minovative Mind releases a CLI tool that orchestrates multiple AI models to generate and modify code with built-in safeguards against errors and malicious input.

Meta launches AI agents that handle customer service, sales, and transactions directly—shifting the company from selling ads to controlling the commercial moment itself.

Konxios, a local-first AI operating system that integrates multiple AI models and services, has entered public beta, allowing developers and creators to build and run custom AI agents with privacy controls on their own machines.

Ratchet, a new open-source toolkit, lets users and AI agents reflash corrupted BIOS on motherboards using inexpensive USB programmers.

SkillsGuard, a free static security scanner, launches to detect malicious code in AI agent skill packages before they run—no account, token, or LLM endpoint required.

An engineer warns that using AI to automatically write incident reports risks hiding critical system failures because nobody reads them carefully enough to catch fabricated details.

Stay ahead with AI news