Welcome back
Curated from 200+ sources across AI & machine learning

Nebius stock rose amid the cloud computing services provider's acquisition of artificial intelligence software maker Eigen AI.


Hi! I wanted to share my new blog on the costs of running AI Evals. We dig into how benchmarking frontier systems now routinely costs tens of thousands of dollars per run, why agent evals are especially unpredictable, and what that concentration of validation authority means for the broader research community. submitted by /u/evijit [link] [comments]

A survey in March found only 26% of Americans had a favorable view of AI.

A new US-wide cell phone network marketed to Christians is set to launch next week. It blocks porn, which experts in network security say marks the first time a US cell plan has used network-level blocking for such content that can’t be turned off even by adult account owners. It’s also rolling out a filter…

X is rolling out a rebuilt ads platform powered by AI as it works to grow revenue again.

Legora is a collaborative AI platform for legal teams that helps lawyers research, review and draft documents for complex matters.

Article URL: https://reutersinstitute.politics.ox.ac.uk/news/ai-and-future-news-2026-what-we-learnt-about-its-impact-newsrooms-fact-checking-and-news Comments URL: https://news.ycombinator.com/item?id=47971364 Points: 3 # Comments: 0

STORY: Meta Platforms raised its 2026 capital expenditure forecast by $10 billion to a range of $125 billion to $145 billion. "Meta, for example, may actually lose money, so negative free cash flow in the next year or two as they spend more than they're earning," said Bailey. "So I think investors are really kind of struggling with that question today." Google-parent Alphabet is "selling their own chips, selling their own microchips, and that is really becoming a big business for them," added Bailey, who views the company's growing hardware division as a long-term positive.

Microsoft's relationship with OpenAI has always been complicated, so I expected the close partnership-turned-situationship to end in tears. After all, executive disagreements, rearranged contracts, and frustrations over AI infrastructure have all regularly been part of the partnership, creating plenty of tension along the way. But against all odds, Microsoft and OpenAI divorced this week in a way that looks strangely amicable. Microsoft announced the updates to its long-standing OpenAI deal on Monday, with the most important change allowing OpenAI to make its products and services available across all cloud providers. A day later, OpenAI an … Read the full story at The Verge.
submitted by /u/LinkedInNews [link] [comments]

Netomi, the San Francisco-based startup building AI systems for enterprise customer service, said Thursday that it has raised $110 million in new funding in a round led by Accenture Ventures, with participation from Adobe Ventures, WndrCo, Silver Lake Waterman, NAVER Ventures, Metis Strategy and Fin Capital. Jeffrey Katzenberg, managing partner of WndrCo and co-founder of DreamWorks, has joined the company's board. The round builds on early backing from a roster of AI luminaries that includes OpenAI co-founder Greg Brockman, Google DeepMind co-founder Demis Hassabis and Microsoft AI CEO Mustafa Suleyman. On its face, the financing is another large AI round in a market still awash in capital. But the deal is more revealing than that. It suggests that a new line is being drawn inside enterprise AI — not between companies that have a chatbot and companies that do not, but between companies that can show AI works in the messy, brittle, heavily governed environments where large businesses a

With US restrictions limiting its access to advanced tech, SenseTime is doubling down on open source with a new model optimized to run on Chinese-made chips.

Elon Musk spent three days testifying as the first witness in his trial against OpenAI.

Caterpillar stock popped 10% after a strong earnings report, while Meta and Microsoft stock fell on AI spending concerns

Article URL: https://www.livescience.com/technology/artificial-intelligence/google-ai-breakthrough-means-chatbots-use-six-times-less-memory-during-conversations-without-compromising-performance Comments URL: https://news.ycombinator.com/item?id=47965515 Points: 3 # Comments: 0

Article URL: https://www.bloomberg.com/news/articles/2026-04-30/kkr-preparing-new-10-billion-ai-firm-led-by-ex-amazon-web-chief Comments URL: https://news.ycombinator.com/item?id=47965834 Points: 1 # Comments: 0

AI finds value in motorsport, multiplying limited computational fluid dynamics resources.
AI news from 200+ sources
Get Started Free
Microsoft is launching a new AI agent inside Word that's specifically designed for legal teams. Legal Agent handles document edits, negotiation history, and complex documents to help legal teams handle tasks like reviewing contracts. "Instead of relying on general AI models to interpret commands, the agent follows structured workflows shaped by real legal practice, managing clearly defined, repeatable tasks like reviewing contracts clause by clause against a playbook," explains Sumit Chauhan, corporate vice president of Microsoft's Office Product Group. The Legal Agent can work with existing documents that have tracked changes, and analyz … Read the full story at The Verge.
TBH I don't know if our current "AI" models are capable of thinking. There is a massive pattern i'm noticing when using AI and have been for the past couple years, AI follows a strict pattern and doesn't seem to think. Just like calculators it already has a designated answer regardless of the question its just a bit more advanced. Hence why it lies to many users. Or it could be that there are so many rules on the intelligence model that it is constantly bouncing off of walls to give you an already programmed answer to not break these rules. Im not sure about either. I'd much rather call AI as of rn "engineered intelligence", not artificial, since its still learning from us engineers, and it will eventually adapt into intelligence. ( This is under the assumption that it can truly freely think ) Does anyone know if these models like Gemini, Chatgpt, Claude, actually "think" submitted by /u/Opening-Name-5270 [link] [comments]

Writer, the enterprise AI agent platform backed by Salesforce Ventures, Adobe Ventures, and Insight Partners, today launched event-based triggers for its Writer Agent platform, enabling AI agents to autonomously detect business signals across Gmail, Gong, Google Calendar, Google Drive, Microsoft SharePoint, and Slack — and execute complex multi-step workflows without any human initiating the process. The release, which also includes a new Adobe Experience Manager connector and a suite of enhanced governance controls such as bring-your-own encryption keys and a Datadog observability plugin, represents Writer's most aggressive bet yet on fully autonomous enterprise AI. It arrives at a moment when AWS, Salesforce, and Microsoft are all racing to establish their own agentic platforms, and when the question of how much autonomy enterprises will actually hand to AI agents remains deeply unresolved. "We are launching a series of event triggers that power and drive our playbooks to be more pro

OpenAI is launching additional opt-in protections for ChatGPT accounts. The new security initiative includes a new partnership with security key provider Yubico.

Mistral's new flagship, Mistral Medium 3.5, merges what used to be separate models for chat, reasoning, and code into a single product. The French company is also adding asynchronous cloud agents to its coding tool Vibe and giving Le Chat a new agent mode. The article Mistral's new flagship Medium 3.5 folds chat, reasoning, and code into one model appeared first on The Decoder.
![[AINews] Agents for Everything Else: Codex for Knowledge Work, Claude for Creative Work](https://zmstgxtziqmvvwzllahg.supabase.co/storage/v1/object/public/article-images/latent-space/f01bab2b-5c53-443d-a6ed-8e4308641740.jpg)
a quiet day lets us reflect on coding agents "breaking containment"
arXiv:2604.26986v1 Announce Type: new Abstract: We introduce a novel task of digital battery passport (DBP) conformance classification and introduce the first public benchmark for the task: BatteryPass-12K, created synthetically from real pilot samples. This is as the EU's battery regulation on DBPs comes into effect soon and there exists no public dataset. We evaluated 22 language models (LMs) in zero-shot inference, spanning small LMs (SLMs), mixture of experts (MoEs), and dense LLMs. We also conducted analysis, additional evaluations of few-shot inference and prompt-injection attacks to find that (1) Thinking models have the best performance (with GPT-5.4 scoring 0.98 (0.03) and 0.71 (0.22) on average as F1 (and confidence interval at 95%) on the validation and test sets, respectively), (2) few-shot examples improve performance significantly, (3) generally capable frontier models find the task challenging, (4) merely scaling model parameters does not necessarily lead to improved pe

Runpod, the high-performance cloud computing and GPU platform designed specifically for AI development, today launched a new open source, MIT licensed, enterprise-friendly Python programming tool called Runpod Flash — and it is poised to make creation, iteration and deployment of AI systems inside and outside of foundation model labs much faster. The tool aims to eliminate some of the biggest barriers and hurdles to training and using AI models today, namely, doing away with Docker packages and containerization when developing for serverless GPU infrastructure, which the company believes will speed up development and deployment of new AI models, applications and agentic workflows. Additionally, the platform is built to serve as a critical substrate for AI agents and coding assistants—such as Claude Code, Cursor, and Cline—enabling them to orchestrate and deploy remote hardware autonomously with minimal friction. Developers can utilize Flash to accomplish a diverse set of high-perform

Article URL: https://thenewstack.io/anaconda-ai-outerbounds-python-metaflow/ Comments URL: https://news.ycombinator.com/item?id=47965248 Points: 1 # Comments: 0

The San Francisco–based startup Goodfire just released a new tool, called Silico, that lets researchers and engineers peer inside an AI model and adjust its parameters—the settings that determine a model’s behavior—during training. This could give model makers more fine-grained control over how this technology is built than was once thought possible. Goodfire claims Silico…

Bringing AI agents into the enterprise software development lifecycle is fast becoming the norm. As developers experiment with new platforms, organizations are exposed to potential security and orchestration failures. Systems that work in pilots may fail once the agents start working with real-time data. Legacy tech giant IBM is one of several companies trying to address that gap by introducing more structure into how these workflows run. Yesterday, it announced the global launch of its AI-powered software development platform Bob, designed to write and test code across the development cycle, already in use by more than 80,000 of its employees after starting with just 100 internal users in summer 2025. Bob introduces a structured layer that constantly pauses for human-led checkpoints, yet by harnessing AI models to perform agentic tasks, IBM says it has saved some teams up to 70% of time "on selected tasks...equaling an average time savings of 10 hours per week." Specific models suppo