AIToday

Microsoft releases ASSERT, an open source framework that uses AI to turn natural-language descriptions of intended AI behavior into automated tests.

TechCrunch AI9h ago3 min read
Microsoft releases ASSERT, an open source framework that uses AI to turn natural-language descriptions of intended AI behavior into automated tests.

Summaries like this, in your inbox every morning.

Sign up free →

3 Key Points

  1. 1

    Microsoft on Tuesday unveiled ASSERT (Adaptive Spec-driven Scoring for Evaluation and Regression Testing), an open source framework designed to simplify testing of application-specific AI behavior by converting plain-language descriptions of goals, policies, or intended behaviors into scored test cases.

  2. 2

    The framework takes natural-language descriptions, structures them into acceptable and unacceptable behaviors, generates problem scenarios and test cases, runs them against the target system, and scores the results. Developers can also customize evaluations by providing system context, tools, and constraints; ASSERT records the AI system's decision paths so developers can inspect where failures occur.

  3. 3

    According to Microsoft's Sarah Bird, chief product officer of Responsible AI, ASSERT addresses a gap in broader evaluations by focusing on application-specific dimensions. The framework can be used when systems are being built, after deployment, and for continuous monitoring.

  4. 4

    The release reflects a broader shift in the AI industry toward repeatable testing and regression checks, with efforts like Stanford's HELM, MLCommons' AILuminate, and evaluation groups like METR developing benchmarks to measure model behavior under different conditions.

Discussion

No comments yet. Be the first to share your thoughts!

Log in to join the discussion

Related Articles

Stay ahead with AI news

Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.

Get Started Free

5 minutes a day. The AI essentials.

200+ sources · Email / LINE / Slack

Get it free →