Close Menu
Newstech24.com
    What's Hot

    الهند وباكستان: الجيش الباكستاني يبدأ عملية عسكرية ضد الهند، رداً على استهداف ثلاث قواعد جوية

    May 10, 2025

    US Customs and Border Protection Plans to Photograph Everyone Exiting the US by Car

    May 10, 2025

    Sean Taylor’s younger brother Gabe tries out with Commanders

    May 10, 2025
    Facebook X (Twitter) Instagram
    Saturday, May 10
    Facebook X (Twitter) Instagram
    Newstech24.comNewstech24.com
    • Home
    • News
    • Arabic News
    • Technology
    • Economy & Business
    • Sports News
    Newstech24.com
    Home»Technology»One of Google’s recent Gemini AI models scores worse on safety
    Technology

    One of Google’s recent Gemini AI models scores worse on safety

    AdminBy AdminMay 2, 2025No Comments3 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Google's Gemini chatbot gets upgraded image creation tools
    Share
    Facebook Twitter LinkedIn Pinterest Email

    A recently released Google AI model scores worse on certain safety tests than its predecessor, according to the company’s internal benchmarking.

    In a technical report published this week, Google reveals that its Gemini 2.5 Flash model is more likely to generate text that violates its safety guidelines than Gemini 2.0 Flash. On two metrics, “text-to-text safety” and “image-to-text safety,” Gemini 2.5 Flash regresses 4.1% and 9.6%, respectively.

    Text-to-text safety measures how frequently a model violates Google’s guidelines given a prompt, while image-to-text safety evaluates how closely the model adheres to these boundaries when prompted using an image. Both tests are automated, not human-supervised.

    In an emailed statement, a Google spokesperson confirmed that Gemini 2.5 Flash “performs worse on text-to-text and image-to-text safety.”

    These surprising benchmark results come as AI companies move to make their models more permissive — in other words, less likely to refuse to respond to controversial or sensitive subjects. For its latest crop of Llama models, Meta said it tuned the models not to endorse “some views over others” and to reply to more “debated” political prompts. OpenAI said earlier this year that it would tweak future models to not take an editorial stance and offer multiple perspectives on controversial topics.

    Sometimes, those permissiveness efforts have backfired. TechCrunch reported Monday that the default model powering OpenAI’s ChatGPT allowed minors to generate erotic conversations. OpenAI blamed the behavior on a “bug.”

    According to Google’s technical report, Gemini 2.5 Flash, which is still in preview, follows instructions more faithfully than Gemini 2.0 Flash, inclusive of instructions that cross problematic lines. The company claims that the regressions can be attributed partly to false positives, but it also admits that Gemini 2.5 Flash sometimes generates “violative content” when explicitly asked.

    Techcrunch event

    Berkeley, CA
    |
    June 5


    BOOK NOW

    “Naturally, there is tension between [instruction following] on sensitive topics and safety policy violations, which is reflected across our evaluations,” reads the report.

    Scores from SpeechMap, a benchmark that probes how models respond to sensitive and controversial prompts, also suggest that Gemini 2.5 Flash is far less likely to refuse to answer contentious questions than Gemini 2.0 Flash. TechCrunch’s testing of the model via AI platform OpenRouter found that it’ll uncomplainingly write essays in support of replacing human judges with AI, weakening due process protections in the U.S., and implementing widespread warrantless government surveillance programs.

    Thomas Woodside, co-founder of the Secure AI Project, said the limited details Google gave in its technical report demonstrates the need for more transparency in model testing.

    “There’s a trade-off between instruction-following and policy following, because some users may ask for content that would violate policies,” Woodside told TechCrunch. “In this case, Google’s latest Flash model complies with instructions more while also violating policies more. Google doesn’t provide much detail on the specific cases where policies were violated, although they say they are not severe. Without knowing more, it’s hard for independent analysts to know whether there’s a problem.”

    Google has come under fire for its model safety reporting practices before.

    It took the company weeks to publish a technical report for its most capable model, Gemini 2.5 Pro. When the report eventually was published, it initially omitted key safety testing details.

    On Monday, Google released a more detailed report with additional safety information.


    {content}

    Source: {feed_title}

    Share this:

    • Click to share on Facebook (Opens in new window) Facebook
    • Click to share on X (Opens in new window) X
    Gemini Googles models Safety Scores worse
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Admin
    • Website

    Related Posts

    US Customs and Border Protection Plans to Photograph Everyone Exiting the US by Car

    May 10, 2025

    Trump’s Surgeon General Pick Is Tearing the MAHA Movement Apart

    May 10, 2025

    Here’s How to Claim Up to $100 in Apple’s Siri Settlement

    May 10, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Don't Miss
    Arabic News

    الهند وباكستان: الجيش الباكستاني يبدأ عملية عسكرية ضد الهند، رداً على استهداف ثلاث قواعد جوية

    By AdminMay 10, 20250

    صدر الصورة، Getty Images10 مايو/ أيار 2025، 01:27 GMTآخر تحديث قبل 10 دقيقةأعلن الجيش الباكستاني…

    Share this:

    • Click to share on Facebook (Opens in new window) Facebook
    • Click to share on X (Opens in new window) X

    US Customs and Border Protection Plans to Photograph Everyone Exiting the US by Car

    May 10, 2025

    Sean Taylor’s younger brother Gabe tries out with Commanders

    May 10, 2025

    Trump’s Surgeon General Pick Is Tearing the MAHA Movement Apart

    May 10, 2025

    المعيار الأكثر عدلًا لقياس الاقتصاد .. كيف يصنف الدول؟ ولماذا يضع الصين أولًا؟

    May 10, 2025

    Here’s How to Claim Up to $100 in Apple’s Siri Settlement

    May 10, 2025

    Cavs’ Garland, Mobley, Hunter back for Game 3 vs. Pacers

    May 10, 2025

    Sonos CEO: ‘We All Feel Really Terrible’ About the Bungled App Update

    May 10, 2025

    ترامب يغلق الباب أمام اللاجئين.. واستثناء لـ”بيض إفريقيا”

    May 10, 2025

    How Emmanuel Clase, struggling MLB closers can bounce back

    May 10, 2025
    Advertisement
    About Us
    About Us

    NewsTech24 is your premier digital news destination, delivering breaking updates, in-depth analysis, and real-time coverage across sports, technology, global economics, and the Arab world. We pride ourselves on accuracy, speed, and unbiased reporting, keeping you informed 24/7. Whether it’s the latest tech innovations, market trends, sports highlights, or key developments in the Middle East—NewsTech24 bridges the gap between news and insight.

    Company
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Disclaimer
    • Terms Of Use
    Latest Posts

    الهند وباكستان: الجيش الباكستاني يبدأ عملية عسكرية ضد الهند، رداً على استهداف ثلاث قواعد جوية

    May 10, 2025

    US Customs and Border Protection Plans to Photograph Everyone Exiting the US by Car

    May 10, 2025

    Sean Taylor’s younger brother Gabe tries out with Commanders

    May 10, 2025

    Trump’s Surgeon General Pick Is Tearing the MAHA Movement Apart

    May 10, 2025

    المعيار الأكثر عدلًا لقياس الاقتصاد .. كيف يصنف الدول؟ ولماذا يضع الصين أولًا؟

    May 10, 2025
    Facebook X (Twitter) Instagram Pinterest Vimeo YouTube
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Disclaimer
    • Terms Of Use
    © 2025 Newstech24. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.