Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Disney Legend Peabo Bryson Dies at 75 After Stroke

    June 3, 2026

    El jefe de la UCO sugiere sin pruebas una “influencia política superior” en la creación de la plaza que ganó David Sánchez

    June 3, 2026

    aespa And Zedd Deliver An Unexpected Crossover On ‘Walk My Way’

    June 3, 2026
    Facebook X (Twitter) Instagram
    Select Language
    Facebook X (Twitter) Instagram
    NEWS ON CLICK
    Subscribe
    Wednesday, June 3
    • Home
      • United States
      • Canada
      • Spain
      • Mexico
    • Top Countries
      • Canada
      • Mexico
      • Spain
      • United States
    • Politics
    • Business
    • Entertainment
    • Fashion
    • Health
    • Science
    • Sports
    • Travel
    NEWS ON CLICK
    Home»Science & Technology»US Science & Tech»New Microsoft tool lets devs spin up AI behavior tests using text descriptions
    US Science & Tech

    New Microsoft tool lets devs spin up AI behavior tests using text descriptions

    News DeskBy News DeskJune 2, 2026No Comments3 Mins Read
    Share Facebook Twitter Pinterest Copy Link LinkedIn Tumblr Email VKontakte Telegram
    New Microsoft tool lets devs spin up AI behavior tests using text descriptions
    Share
    Facebook Twitter Pinterest Email Copy Link

    AI researchers and labs have advanced by leaps and bounds in evaluating AI models for everything from safety and compliance to sycophancy and alignment. But it appears companies and developers are faced with a new, specific need: making sure their AI system behaves as intended for their specific product or service.

    In a bid to make that testing process simpler, Microsoft on Tuesday took the wraps off ASSERT, short for Adaptive Spec-driven Scoring for Evaluation and Regression Testing.

    The open source framework, Microsoft says, makes evaluating application-specific AI behavior easy by using AI to turn high-level, natural-language descriptions of goals, policies, or intended behaviors into thorough, scored tests that can be investigated.

    ASSERT takes plain-language descriptions of an AI model’s expected behavior and policies, turns them into a structured set of acceptable and unacceptable behaviors, generates problem scenarios and test cases, runs them against the target system, and scores the results. It can also record the paths the AI system takes, including intermediate actions and tool calls, so developers can inspect where failures happen.

    Devs can provide system context, tools, and constraints, too, if they want to further customize what the evaluations cover.

    For example, a developer could specify that a document research AI agent shouldn’t send emails to people outside the company, and it should limit confidential information to C-level executives and provide concise summaries with prior context in mind. ASSERT will use those rules to generate test cases that check whether the system follows those rules on an ongoing basis.

    Image Credits:Microsoft

    The framework, according to Microsoft, fills a gap that broader, more general evaluations cannot when AI models are intended to behave in a manner that is shaped by an application or product’s context, policies, and tools.

    “One of the things we’ve learned is that evaluations are absolutely critical to making good decisions,” said Sarah Bird, chief product officer of Responsible AI at Microsoft. “Because if you don’t understand the behavior of the AI system, it’s really hard to know if it’s meeting your organization’s bar … What we found is that if you really want to have a trustworthy system, you should evaluate many more dimensions that are application-specific.”

    Bird said ASSERT can be used to evaluate systems when they’re being built, after deployment, and even for continuous monitoring.

    The release comes amidst a gradual but broader shift in the AI industry. As models grow more capable, researchers are focusing on repeatable testing and regression checks, with Stanford’s HELM, MLCommons’ AILuminate, and evaluation groups like METR rolling out benchmarks to measure how models behave under different conditions.

    When you purchase through links in our articles, we may earn a small commission. This doesn’t affect our editorial independence.

    ai evaluations AI regression testing Microsoft
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link
    News Desk
    • Website

    News Desk is the dedicated editorial force behind News On Click. Comprised of experienced journalists, writers, and editors, our team is united by a shared passion for delivering high-quality, credible news to a global audience.

    Related Posts

    US Science & Tech

    Google Shares Fitbit Air Blueprints So You Can 3D Print Your Own Accessories

    June 3, 2026
    US Science & Tech

    Poland Wants To Ban Phones And Smartwatches In Schools

    June 3, 2026
    US Science & Tech

    Meta Will Reportedly Let Employees Take 30-Minute Breaks From Its Tracking Program

    June 3, 2026
    US Science & Tech

    Squishmallows, dentures, and an ‘I Heart Hot Dads’ bag: Uber has found thousands of items left in robotaxis

    June 2, 2026
    US Science & Tech

    Until Dawn 2 Looks Like Cabin In The Woods, But In A Jungle

    June 2, 2026
    US Science & Tech

    Way Of The Sword Arrives September 25 But There’s A Demo Today

    June 2, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Don't Miss

    Disney Legend Peabo Bryson Dies at 75 After Stroke

    News DeskJune 3, 20260

    Peabo Bryson’s unmistakable voice soundtracked some of Disney’s most beloved love stories, but now fans…

    El jefe de la UCO sugiere sin pruebas una “influencia política superior” en la creación de la plaza que ganó David Sánchez

    June 3, 2026

    aespa And Zedd Deliver An Unexpected Crossover On ‘Walk My Way’

    June 3, 2026

    Zara Owner Inditex Defies Consumer Gloom With Strong Sales

    June 3, 2026
    Tech news by Newsonclick.com
    Top Posts

    Disney Legend Peabo Bryson Dies at 75 After Stroke

    June 3, 2026

    Cinco recetas con huevo perfectas para desayunar, comer o cenar

    May 4, 2026

    The future looks bright as Canadian business travel continues to prove its resilience

    May 4, 2026

    Christina Hendricks Marks Another Year with a Caption That Won’t Give Anything Away

    May 4, 2026
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    Editors Picks

    Disney Legend Peabo Bryson Dies at 75 After Stroke

    June 3, 2026

    El jefe de la UCO sugiere sin pruebas una “influencia política superior” en la creación de la plaza que ganó David Sánchez

    June 3, 2026

    aespa And Zedd Deliver An Unexpected Crossover On ‘Walk My Way’

    June 3, 2026

    Zara Owner Inditex Defies Consumer Gloom With Strong Sales

    June 3, 2026
    About Us

    NewsOnClick.com is your reliable source for timely and accurate news. We are committed to delivering unbiased reporting across politics, sports, entertainment, technology, and more. Our mission is to keep you informed with credible, fact-checked content you can trust.

    We're social. Connect with us:

    Facebook X (Twitter) Instagram Pinterest YouTube
    Latest Posts

    Disney Legend Peabo Bryson Dies at 75 After Stroke

    June 3, 2026

    El jefe de la UCO sugiere sin pruebas una “influencia política superior” en la creación de la plaza que ganó David Sánchez

    June 3, 2026

    aespa And Zedd Deliver An Unexpected Crossover On ‘Walk My Way’

    June 3, 2026

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Editorial Policy
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    • Advertise
    • Contact Us
    © 2026 Newsonclick.com || Designed & Powered by ❤️ Trustmomentum.com.

    Type above and press Enter to search. Press Esc to cancel.