Welcome back
Curated from 200+ sources across AI & machine learning

AI-powered marketing platform Nectar Social announced Thursday that it raised a $30 million Series A round led by Menlo Ventures and its Anthology Fund, which was created alongside Anthropic.



For AI systems to keep improving in knowledge work, they need either a reliable mechanism for autonomous self-improvement or human evaluators capable of catching errors and generating high-quality feedback. The industry has invested enormously in the first. It's giving almost no thought to what's happening to the second. I’d argue that we need to treat the human evaluation problem with just as much rigor and investment as we put into building the model capabilities themselves. New grad hiring at major tech companies has dropped by half since 2019. Document review, first-pass research, data cleaning, code review: Models handle these now. The economists tracking this call it displacement. The companies doing it call it efficiency. Neither are focusing on the future problem. Why self-improvement has limits in knowledge work The obvious pushback is reinforcement learning (RL). AlphaZero learned Go, chess, and Shogi at superhuman levels without human data and generated novel strategies in t

The vibes around the current AI boom aren't great, even in the tech industry.

Bill Ackman's Pershing Square fund has taken a major new position in Microsoft (NasdaqGS:MSFT), going public with the stake as other large investors such as TCI and the Gates Foundation have been reducing their holdings. The move comes as Microsoft ramps up its AI efforts beyond OpenAI, including interest in larger AI startup acquisitions, an expanded partnership with OneStream in enterprise finance, and visible traction for new AI products. Ackman cites Azure, Microsoft 365, evolving AI...

Article URL: https://thenewstack.io/streaming-ai-energy-efficiency/ Comments URL: https://news.ycombinator.com/item?id=48161187 Points: 1 # Comments: 0

Workday (WDAY) is back in focus after rolling out its Sana Self-Service Agent inside Microsoft 365 Copilot and launching an AI-focused solopreneur accelerator program, moves that sharpen attention on how its AI tools support everyday business work. See our latest analysis for Workday. Despite the fresh attention on its AI tools and partnerships, Workday’s share price has moved unevenly, with a 1-day share price return of 5.27%, a year-to-date share price return that is down 39.25%, and a...

The variety of terrible is impressive. After Sony drew some unwanted attention for a post demonstrating its AI Camera Assistant on the Xperia 1 XIII, it's trying to clarify how the feature works. The company says it doesn't edit photos, but makes suggestions based on lighting, depth, and subject. Point the camera at something, and it will give you four options for changing exposure, color, and background blur. In its product video, Sony says that the AI Camera Assistant will also suggest "the most photogenic angle." Though the clip only shows it suggesting that someone zoom in, which is not the same as suggesting a camera angle. The examples that Sony posted on X, while bette … Read the full story at The Verge.

Lake Tahoe, Silicon Valley's favorite ski spot, is about to get hit with higher energy prices as AI drives demand for electricity.

Article URL: https://logicalintelligence.com/blog/energy-based-model-sudoku-demo Comments URL: https://news.ycombinator.com/item?id=48160890 Points: 1 # Comments: 0

Article URL: https://thinkpol.ca/2026/05/15/ai-code-looks-clean-thats-the-trap/ Comments URL: https://news.ycombinator.com/item?id=48161224 Points: 1 # Comments: 1

Article URL: https://fortune.com/2026/04/23/goldman-sachs-ai-world-model-missing-link/ Comments URL: https://news.ycombinator.com/item?id=48160562 Points: 1 # Comments: 2

You can seek explicit guidance from AI, or you can use it as a tool for critical engagement.

We watched Snap lose to Meta's platform dominance and see the same pattern repeating in AI—only this time, the stakes aren't market share.

“I’ve got one hand on the keyboard, one hand down below,” an artist who role-plays with their chatbot tells WIRED. But some asexual advocates aren’t thrilled about the association.

In the final week of the Musk v. Altman trial, lawyers traded blows over Elon Musk’s and OpenAI CEO Sam Altman’s credibility. Altman was grilled on his alleged history of lying and self-dealing involving companies that do business with OpenAI. But he fired back, painting Musk as a power-seeker who wanted to control the development…

Anthropic is raising another $30 billion just three months after a round of the same size. The AI lab's valuation jumps to $900 billion, surpassing rival OpenAI for the first time. Fueling the surge: annualized revenue approaching $45 billion, a fivefold increase since the end of 2024. The article Anthropic's $900 billion valuation would make it more valuable than OpenAI for the first time appeared first on The Decoder.

The Trump administration is being pressed to quickly deliver a plan to address AI-enabled attacks.
AI news from 200+ sources
Get Started Free
ArXiv is doing more to crack down on the careless use of large language models in scientific papers.

Researchers at Carnegie Mellon University built a new benchmark that measures how far AI agents can go when exploiting real vulnerabilities in Google's V8 engine. Mythos leads GPT-5.5 by a wide margin but costs twelve times as much. The article New benchmark shows Claude Mythos and GPT-5.5 can develop real browser exploits autonomously appeared first on The Decoder.

A new benchmark called WorldReasonBench tests video generators not on image quality, but on physical and logical plausibility. ByteDance's Seedance 2.0 leads the field ahead of Veo 3.1 and Sora 2, with commercial models scoring roughly twice as high as open-source alternatives. Logical reasoning remains the hardest category for every model by a wide margin. The jump from pixel generator to actual world model still hasn't happened. The article New benchmark confirms AI video generators look stunning but still can't reason about the world appeared first on The Decoder.

OpenAI's latest shakeup comes as the company reportedly plans to combine ChatGPT and its programming product Codex.

The company formerly known as Intercom just did something that no major customer service platform has attempted at scale: it built an AI agent whose sole job is to manage another AI agent. Fin Operator, announced Thursday at a live event in San Francisco, is a new AI-powered system designed specifically for the back-office teams that configure, monitor, and improve Fin, the company's customer-facing AI agent. Rather than replacing human support agents — which is what Fin itself does on the front lines — Operator targets the growing army of support operations professionals who spend their days updating knowledge bases, debugging conversation failures, and combing through performance dashboards. "Fin is an agent for your customers," Brian Donohue, the company's VP of Product, told VentureBeat in an exclusive interview ahead of the launch. "Operator is an agent for your support ops team. This is an agent for the back office team who manages Fin and then manages their human agents." The an

Once users connect their accounts, they will see a dashboard of their portfolio performance, spending, subscriptions, and upcoming payments.

Article URL: https://github.com/microsoft/AI-Engineering-Coach Comments URL: https://news.ycombinator.com/item?id=48161004 Points: 2 # Comments: 0

RLWRLD, a physical AI company developing robotics foundation models for dexterous manipulation, unveiled RLDX-1 at “Dexterity Night in SF”, introducing a model designed to help humanoid robots perform contact-rich tasks such as grasping, pouring and tool use. The company also reported benchmark results across humanoid tabletop, kitchen manipulation and real-world coffee-pouring evaluations, and said the […]

Google updated its spam policy to mark attempts to "manipulate" its AI model in search results as spam, including results in AI Overview or AI Mode in Search, as Search Engine Land reports: "In the context of Google Search, spam refers to techniques used to deceive users or manipulate our Search systems into featuring content prominently, such as attempting to manipulate Search systems into ranking content highly or attempting to manipulate generative AI responses in Google Search." Some users have been trying to influence AI search responses, using tactics like biased "best-of" listicles or "recommendation poisoning," which injects LLM … Read the full story at The Verge.

The MIPI Alliance, an international organization that develops specifications that standardize wired interfaces for mobile and other connected ecosystems, has announced the formation of a “Physical AI Birds of a Feather (BoF) group” dedicated to exploring upcoming technologies and trends in the physical AI market, with an initial focus on humanoids. The group will examine […]

All3, a European company developing a heavy-duty robotic platform for construction, has announced a $25 million seed funding round. The round is led by RTP Global with significant participation from SuperSeed, and additional investment from Begin Capital, s16vc and VNV Global. All3 has re-engineered every step of construction into one end-to-end process through three integrated […]