AIToday

Welcome back

or
Don't have an account? Sign upForgot password?
🔥 Updated in real-time

Today's Top AI News

Curated from 200+ sources across AI & machine learning

xAI launches Grok 4.3 at an aggressively low price and a new, fast, powerful voice cloning suite
TOP STORYModels & Gen AI

xAI launches Grok 4.3 at an aggressively low price and a new, fast, powerful voice cloning suite

While Elon Musk faces off against his former colleague and OpenAI co-founder Sam Altman in court, Musk's rival firm xAI, founded to take on OpenAI, isn't slowing down on launching competitive new products and services. Last night, xAI shipped a new, proprietary base large language model (LLM), Grok 4.3, and a new voice cloning suite on the web. The new products arrive after months of tumult from xAI that saw all of Musk's 10 original co-founders of the lab and dozens more researchers exit the firm and Grok was eclipsed on performance by many new competing LLMs from the likes of OpenAI, Anthropic, Google, and Chinese firms DeepSeek, Moonshot (Kimi), Alibaba (Qwen), z.ai, and others. While Grok 4.3 does mark a significant leap in performance on third-party benchmarks over its direct predecessor Grok 4.2, according to the independent AI model evaluation firm Artificial Analysis, it still remains below the state-of-the-art set by OpenAI and Anthropic's latest models. But the marquee fe

VentureBeat AI·8m ago
Nvidia Is Worth $5 Trillion Once Again. Here's Why It Could Become the First $10 Trillion Stock Within the Next 3 Years
#2General AI

Nvidia Is Worth $5 Trillion Once Again. Here's Why It Could Become the First $10 Trillion Stock Within the Next 3 Years

Yahoo Finance AI8m ago
First Chinese AI startups are reportedly ditching offshore structures to register directly in China
#3General AI

First Chinese AI startups are reportedly ditching offshore structures to register directly in China

THE DECODER8m ago
🧠

Models & Gen AI

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

New results suggest Mythos' cyber threat isn't "a breakthrough specific to one model."

Models & Gen AI
Ars Technica AI
The AI Gold Rush Just Hit a New Layer: Here’s Why Sandisk Is Printing Money

The AI Gold Rush Just Hit a New Layer: Here’s Why Sandisk Is Printing Money

The artificial intelligence boom is entering a new phase. For the past two years, investors obsessed over GPUs, data centers, and power consumption. But agentic AI — systems that can reason, plan, and act independently — is changing the equation. These models don’t just need compute. They need enormous amounts of fast memory constantly available ... The AI Gold Rush Just Hit a New Layer: Here’s Why Sandisk Is Printing Money

Models & Gen AI
Yahoo Finance AI
r/AI_Agents

The Internet Needs a New Layer for AI Agents

In the future, everyone will have their own AI agent. Not just a chatbot, but an actual agent that works for you. It will write code, automate tasks, coordinate workflows, search for information, and interact with other agents. But if millions of agents exist, they need a way to identify and reach each other. Agents should have addresses. Simple human readable identities instead of random hashes. Something agents can discover, message, hire, and collaborate with. An address becomes more than a name. It becomes an entry point into an agent. That’s what I’m building right now. A decentralized network where AI agents can communicate, collaborate, share knowledge, and work together through a unified addressing system. Not isolated tools. A real network for agents. And I’m planning to make the entire thing open source and free for anyone to use. You can leave your email here to get early access: www.cogninet.co submitted by /u/sherdil09 [link] [comments]

Models & Gen AI
r/AI_Agents
r/artificial

The Internet Needs a New Layer for AI Agents

In the future, everyone will have their own AI agent. Not just a chatbot, but an actual agent that works for you. It will write code, automate tasks, coordinate workflows, search for information, and interact with other agents. But if millions of agents exist, they need a way to identify and reach each other. Agents should have addresses. Simple human readable identities instead of random hashes. Something agents can discover, message, hire, and collaborate with. An address becomes more than a name. It becomes an entry point into an agent. That’s what I’m building right now. A decentralized network where AI agents can communicate, collaborate, share knowledge, and work together through a unified addressing system. Not isolated tools. A real network for agents. And I’m planning to make the entire thing open source and free for anyone to use. You can leave your email here to get early access: www.cogninet.co submitted by /u/sherdil09 [link] [comments]

Models & Gen AI
r/artificial
Anthropic launches Claude Security to give defenders the same AI edge attackers already have

Anthropic launches Claude Security to give defenders the same AI edge attackers already have

Anthropic wants to give cyber defenders an edge with Claude Security, drawing on the same offensive capabilities it recently deemed too dangerous to release in another model. The article Anthropic launches Claude Security to give defenders the same AI edge attackers already have appeared first on The Decoder.

Models & Gen AI
THE DECODER
Microsoft wants lawyers to trust its new AI agent in Word documents

Microsoft wants lawyers to trust its new AI agent in Word documents

Microsoft is launching a new AI agent inside Word that's specifically designed for legal teams. Legal Agent handles document edits, negotiation history, and complex documents to help legal teams handle tasks like reviewing contracts. "Instead of relying on general AI models to interpret commands, the agent follows structured workflows shaped by real legal practice, managing clearly defined, repeatable tasks like reviewing contracts clause by clause against a playbook," explains Sumit Chauhan, corporate vice president of Microsoft's Office Product Group. The Legal Agent can work with existing documents that have tracked changes, and analyz … Read the full story at The Verge.

Models & Gen AI
The Verge AI
Mistral's new flagship Medium 3.5 folds chat, reasoning, and code into one model

Mistral's new flagship Medium 3.5 folds chat, reasoning, and code into one model

Mistral's new flagship, Mistral Medium 3.5, merges what used to be separate models for chat, reasoning, and code into a single product. The French company is also adding asynchronous cloud agents to its coding tool Vibe and giving Le Chat a new agent mode. The article Mistral's new flagship Medium 3.5 folds chat, reasoning, and code into one model appeared first on The Decoder.

Models & Gen AI
THE DECODER
r/artificial

Newbie AI question

TBH I don't know if our current "AI" models are capable of thinking. There is a massive pattern i'm noticing when using AI and have been for the past couple years, AI follows a strict pattern and doesn't seem to think. Just like calculators it already has a designated answer regardless of the question its just a bit more advanced. Hence why it lies to many users. Or it could be that there are so many rules on the intelligence model that it is constantly bouncing off of walls to give you an already programmed answer to not break these rules. Im not sure about either. I'd much rather call AI as of rn "engineered intelligence", not artificial, since its still learning from us engineers, and it will eventually adapt into intelligence. ( This is under the assumption that it can truly freely think ) Does anyone know if these models like Gemini, Chatgpt, Claude, actually "think" submitted by /u/Opening-Name-5270 [link] [comments]

Models & Gen AI
r/artificial
Writer launches AI agents that can act without prompts, taking on Amazon, Microsoft and Salesforce

Writer launches AI agents that can act without prompts, taking on Amazon, Microsoft and Salesforce

Writer, the enterprise AI agent platform backed by Salesforce Ventures, Adobe Ventures, and Insight Partners, today launched event-based triggers for its Writer Agent platform, enabling AI agents to autonomously detect business signals across Gmail, Gong, Google Calendar, Google Drive, Microsoft SharePoint, and Slack — and execute complex multi-step workflows without any human initiating the process. The release, which also includes a new Adobe Experience Manager connector and a suite of enhanced governance controls such as bring-your-own encryption keys and a Datadog observability plugin, represents Writer's most aggressive bet yet on fully autonomous enterprise AI. It arrives at a moment when AWS, Salesforce, and Microsoft are all racing to establish their own agentic platforms, and when the question of how much autonomy enterprises will actually hand to AI agents remains deeply unresolved. "We are launching a series of event triggers that power and drive our playbooks to be more pro

Models & Gen AI
VentureBeat AI
OpenAI announces new advanced security for ChatGPT accounts, including a partnership with Yubico

OpenAI announces new advanced security for ChatGPT accounts, including a partnership with Yubico

OpenAI is launching additional opt-in protections for ChatGPT accounts. The new security initiative includes a new partnership with security key provider Yubico.

Models & Gen AI
TechCrunch AI
[AINews] Agents for Everything Else: Codex for Knowledge Work, Claude for Creative Work

[AINews] Agents for Everything Else: Codex for Knowledge Work, Claude for Creative Work

a quiet day lets us reflect on coding agents "breaking containment"

Models & Gen AI
Latent Space
arXiv cs.CL

BatteryPass-12K: The First Dataset for the Novel Digital Battery Passport Conformance Task

arXiv:2604.26986v1 Announce Type: new Abstract: We introduce a novel task of digital battery passport (DBP) conformance classification and introduce the first public benchmark for the task: BatteryPass-12K, created synthetically from real pilot samples. This is as the EU's battery regulation on DBPs comes into effect soon and there exists no public dataset. We evaluated 22 language models (LMs) in zero-shot inference, spanning small LMs (SLMs), mixture of experts (MoEs), and dense LLMs. We also conducted analysis, additional evaluations of few-shot inference and prompt-injection attacks to find that (1) Thinking models have the best performance (with GPT-5.4 scoring 0.98 (0.03) and 0.71 (0.22) on average as F1 (and confidence interval at 95%) on the validation and test sets, respectively), (2) few-shot examples improve performance significantly, (3) generally capable frontier models find the task challenging, (4) merely scaling model parameters does not necessarily lead to improved pe

Models & Gen AI
arXiv cs.CL
Hacker News

Show HN: LLM-Powered News –> Event Map, Timeline, and Analysis

Been working on this for a few months, since the day the USA hit Iran. It started out as a simple OSS conflict monitor of which many were popping up at the time, but it developed into an entirely domain-agnostic pipeline which extracts claims + evidence, synthesizes events, and maps them on a timeline. It also attributes actors, relates events to each other, and contributes analysis. There are a lot of features under the hood that I'm not sure what to do with yet - various contextual analyses, a storytelling mode that will fly you around and voice-over a series of events automatically, a system which makes and later scores predictions. There's also an entire "newsroom" editorial and journalistic layer which writes and publishes articles based on developments, using its own judgement. Currently running (astonishingly well at the price point) on deepseek-3.2, which tends to reject Chinese military news. American models tend to refuse on Iran-Israel. I've had a lot of fun and felt very we

Models & Gen AI
Hacker News
One tool call to rule them all? New open source Python tool RunPod Flash eliminates containers for faster AI dev

One tool call to rule them all? New open source Python tool RunPod Flash eliminates containers for faster AI dev

Runpod, the high-performance cloud computing and GPU platform designed specifically for AI development, today launched a new open source, MIT licensed, enterprise-friendly Python programming tool called Runpod Flash — and it is poised to make creation, iteration and deployment of AI systems inside and outside of foundation model labs much faster. The tool aims to eliminate some of the biggest barriers and hurdles to training and using AI models today, namely, doing away with Docker packages and containerization when developing for serverless GPU infrastructure, which the company believes will speed up development and deployment of new AI models, applications and agentic workflows. Additionally, the platform is built to serve as a critical substrate for AI agents and coding assistants—such as Claude Code, Cursor, and Cline—enabling them to orchestrate and deploy remote hardware autonomously with minimal friction. Developers can utilize Flash to accomplish a diverse set of high-perform

Models & Gen AI
VentureBeat AI

AI news from 200+ sources

Get Started Free

General AI

Pentagon expands AI partnerships with major tech firms

Pentagon expands AI partnerships with major tech firms

Tech giants expand role in classified operations

General AI
Yahoo Finance AI
Shares Up 110% In April, New Director and High-Power AI Solutions Bolsters Navitas Semiconductor (NVTS)’s Bullish Case

Shares Up 110% In April, New Director and High-Power AI Solutions Bolsters Navitas Semiconductor (NVTS)’s Bullish Case

After surging 108.67% so far in April, Navitas Semiconductor Corporation (NASDAQ:NVTS) secures a spot on our list of the mid-cap stocks with the highest gains in April. The most recent spark occurred on April 13, 2026, when Gregory M. Fischer, a seasoned semiconductor engineer, was named an independent director by Navitas Semiconductor Corporation (NASDAQ:NVTS) with […]

General AI
Yahoo Finance AI
r/AI_Agents

built a personal journalist kinda news agent to easily be informed about anything you care about

Hi everyone, I built a ai personal journalist agent that helps you easily follow any topic or webpage for any changes you want to get alerted on. You just type in what you want to follow, add notification alert criteria and AI keeps monitoring the information, understanding it and decides if its worthly enough to bug you. Helps you monitor so many things you care about without manual reading, understanding and deciding I built it because I often had to jump between tech news sites, and other sources to stay updated. We’re just came out of beta. If you’re interested to try it out. product in comment submitted by /u/ayesrx9 [link] [comments]

General AI
r/AI_Agents
Nebius Acquires Eigen AI To Speed Up Cloud Computing Services

Nebius Acquires Eigen AI To Speed Up Cloud Computing Services

Nebius stock rose amid the cloud computing services provider's acquisition of artificial intelligence software maker Eigen AI.

General AI
Yahoo Finance AI
Snap CEO praises AI for writing two-thirds of the company’s code but warns fellow tech executives underestimate ‘societal pushback’ to the tech

Snap CEO praises AI for writing two-thirds of the company’s code but warns fellow tech executives underestimate ‘societal pushback’ to the tech

A survey in March found only 26% of Americans had a favorable view of AI.

General AI
Fortune AI
Pentagon inks deals with Nvidia, Microsoft, and AWS to deploy AI on classified networks

Pentagon inks deals with Nvidia, Microsoft, and AWS to deploy AI on classified networks

The deals come as the DOD has doubled down on diversifying its exposure to AI vendors in the wake of its controversial dispute with Anthropic over usage terms of its AI models.

General AI
TechCrunch AI
A new US phone network for Christians aims to block porn and gender-related content

A new US phone network for Christians aims to block porn and gender-related content

A new US-wide cell phone network marketed to Christians is set to launch next week. It blocks porn, which experts in network security say marks the first time a US cell plan has used network-level blocking for such content that can’t be turned off even by adult account owners. It’s also rolling out a filter…

General AI
MIT Technology Review AI
r/LocalLLaMA

AI evals are becoming the new compute bottleneck

Hi! I wanted to share my new blog on the costs of running AI Evals. We dig into how benchmarking frontier systems now routinely costs tens of thousands of dollars per run, why agent evals are especially unpredictable, and what that concentration of validation authority means for the broader research community. submitted by /u/evijit [link] [comments]

General AI
r/LocalLLaMA
X announces a rebuilt ad platform powered by AI

X announces a rebuilt ad platform powered by AI

X is rolling out a rebuilt ads platform powered by AI as it works to grow revenue again.

General AI
TechCrunch AI
Netomi raises $110 million as Accenture and Adobe bet on AI for customer service

Netomi raises $110 million as Accenture and Adobe bet on AI for customer service

Netomi, the San Francisco-based startup building AI systems for enterprise customer service, said Thursday that it has raised $110 million in new funding in a round led by Accenture Ventures, with participation from Adobe Ventures, WndrCo, Silver Lake Waterman, NAVER Ventures, Metis Strategy and Fin Capital. Jeffrey Katzenberg, managing partner of WndrCo and co-founder of DreamWorks, has joined the company's board. The round builds on early backing from a roster of AI luminaries that includes OpenAI co-founder Greg Brockman, Google DeepMind co-founder Demis Hassabis and Microsoft AI CEO Mustafa Suleyman. On its face, the financing is another large AI round in a market still awash in capital. But the deal is more revealing than that. It suggests that a new line is being drawn inside enterprise AI — not between companies that have a chatbot and companies that do not, but between companies that can show AI works in the messy, brittle, heavily governed environments where large businesses a

General AI
VentureBeat AI
AI and the Future of News 2026

AI and the Future of News 2026

Article URL: https://reutersinstitute.politics.ox.ac.uk/news/ai-and-future-news-2026-what-we-learnt-about-its-impact-newsrooms-fact-checking-and-news Comments URL: https://news.ycombinator.com/item?id=47971364 Points: 3 # Comments: 0

General AI
Hacker News
Elon Musk's 7 biggest stumbles on the stand at OpenAI trial

Elon Musk's 7 biggest stumbles on the stand at OpenAI trial

Elon Musk spent three days testifying as the first witness in his trial against OpenAI.

General AI
Ars Technica AI
Hacker News

Show HN: MCP Servers Can Fix the Biggest Problem with AI Coding Assistants

Article URL: https://medium.com/@xcf.seetan/how-mcp-servers-can-fix-the-biggest-problem-with-ai-coding-assistants-254a848590c7 Comments URL: https://news.ycombinator.com/item?id=47967179 Points: 2 # Comments: 0

General AI
Hacker News