
OpenAI and Broadcom have announced Jalapeño, a custom chip designed specifically for large language model inference in data centers. The chip was built from scratch based on insights from OpenAI researchers and aims to be more efficient than general-purpose chips currently used for this task, with early tests suggesting better performance per watt.
Summaries like this, in your inbox every morning.
Sign up free →What happened
OpenAI and Broadcom unveiled a new chip called Jalapeño, an application-specific integrated circuit (ASIC) built from scratch for LLM inference in large data centers. The two companies say this is the first generation of a long-term project, and the chip's design was informed by OpenAI's roadmap and took nine months to develop.
Why it matters
Current data centers rely on general-purpose chips that are not tailored to the specific demands of language models. Jalapeño is intended to be more specialized for these workloads, which may help companies reduce energy costs and improve efficiency when running AI systems at scale.
What to watch
OpenAI states that early testing shows Jalapeño will deliver performance per watt substantially better than current state-of-the-art, though the company notes it is still measuring performance and will present a detailed technical report in the coming months.
No comments yet. Be the first to share your thoughts!
Log in to join the discussion




Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.
Get Started FreeFree · takes 30 seconds · unsubscribe anytime
5 minutes a day. The AI essentials.
200+ sources · Email / LINE / Slack