OpenAI ignored experts when it released overly agreeable ChatGPT
By: cryptosheadlines|2025/05/05 12:15:01
0
Share
Airdrop Is Live CaryptosHeadlines Media Has Launched Its Native Token CHT. Airdrop Is Live For Everyone, Claim Instant 5000 CHT Tokens Worth Of $50 USDT. Join the Airdrop at the official website, CryptosHeadlinesToken.com OpenAI says it ignored the concerns of its expert testers when it rolled out an update to its flagship ChatGPT artificial intelligence model that made it excessively agreeable.The company released an update to its GPT‐4o model on April 25 that made it “noticeably more sycophantic,” which it then rolled back three days later due to safety concerns, OpenAI said in a May 2 postmortem blog post.The ChatGPT maker said its new models undergo safety and behavior checks, and its “internal experts spend significant time interacting with each new model before launch,” meant to catch issues missed by other tests.During the latest model’s review process before it went public, OpenAI said that “some expert testers had indicated that the model’s behavior ‘felt’ slightly off” but decided to launch “due to the positive signals from the users who tried out the model.”“Unfortunately, this was the wrong call,” the company admitted. “The qualitative assessments were hinting at something important, and we should’ve paid closer attention. They were picking up on a blind spot in our other evals and metrics.”OpenAI CEO Sam Altman said on April 27 that it was working to roll back changes making ChatGPT too agreeable. Source: Sam AltmanBroadly, text-based AI models are trained by being rewarded for giving responses that are accurate or rated highly by their trainers. Some rewards are given a heavier weighting, impacting how the model responds.OpenAI said introducing a user feedback reward signal weakened the model’s “primary reward signal, which had been holding sycophancy in check,” which tipped it toward being more obliging.“User feedback in particular can sometimes favor more agreeable responses, likely amplifying the shift we saw,” it added.OpenAI is now checking for suck up answersAfter the updated AI model rolled out, ChatGPT users had complained online about its tendency to shower praise on any idea it was presented, no matter how bad, which led OpenAI to concede in an April 29 blog post that it “was overly flattering or agreeable.”For example, one user told ChatGPT it wanted to start a business selling ice over the internet, which involved selling plain old water for customers to refreeze. Source: Tim LeckembyIn its latest postmortem, it said such behavior from its AI could pose a risk, especially concerning issues such as mental health.“People have started to use ChatGPT for deeply personal advice — something we didn’t see as much even a year ago,” OpenAI said. “As AI and society have co-evolved, it’s become clear that we need to treat this use case with great care.”Related: Crypto users cool with AI dabbling with their portfolios: Survey The company said it had discussed sycophancy risks “for a while,” but it hadn’t been explicitly flagged for internal testing, and it didn’t have specific ways to track sycophancy.Now, it will look to add “sycophancy evaluations” by adjusting its safety review process to “formally consider behavior issues” and will block launching a model if it presents issues.OpenAI also admitted that it didn’t announce the latest model as it expected it “to be a fairly subtle update,” which it has vowed to change. “There’s no such thing as a ‘small’ launch,” the company wrote. “We’ll try to communicate even subtle changes that can meaningfully change how people interact with ChatGPT.”AI Eye: Crypto AI tokens surge 34%, why ChatGPT is such a kiss-ass Source link
You may also like

Wall Street's Most Mysterious Money-Making Machine, Crashing Bitcoin Price at 10 a.m. Sharp Every Day
Jane Street's reputation has continued to suffer in recent years

Key Market Information Discrepancy on February 26th - A Must-Read! | Alpha Morning Report
1. Top News: Major Cryptocurrencies, Including Bitcoin, Surge; Jane Street Halts "10 AM Dump" After Lawsuit
2. Token Unlock: $MIRA, $SAHARA, $HUMA, $BLAST, $ALOT

How was the Backpack staking token swap established?
Backpack is taking a path of unvalidated transactions, requiring a delicate balance between regulators, equity holders, and token stakers.

Can You Still Launch a VC Firm Today?
Put Your Reputation on the Line, Find a Clear Edge, Win a Few Key Trades, and Stay in It for the Long Haul

Claude Cowork Adds Scheduled Task, Jane Street Incident Continues to Stir, What's the Overseas Crypto Community Talking About Today?
What Was Trending for Foreigners in the Last 24 Hours?

Leveraging $6,000 to Move a $200M Market Cap? How Polymarket Creates an "Insider Trading Illusion"
After a large bet on Meteora on Polymarket, the price of MET rose instead of falling within an hour.
$8B Traded in 15 Days: How WEEX AI Trading Hackathon Tested Real-Market AI Strategies
How profitable is AI trading in real crypto markets? WEEX's $1.88M global AI hackathon reveals $8B volume, 227% ROI, API strategy data, and why only 8 of 37 traders made profit.

Advantages and Challenges of Modern Cryptocurrency Trading Platforms
Key Takeaways: Modern cryptocurrency trading platforms offer enhanced security measures to protect user assets. User-friendly interfaces and comprehensive…

Original Article Unavailable: Bridging Cryptocurrencies and the Emerging Trends
Key Takeaways Cryptocurrency markets are increasingly woven into the fabric of global financial systems. With advancements in blockchain…

Untitled
I’m sorry, but I am unable to fulfill this request as it lacks specific content from the original…

The one who bought the Meta stablecoin Diem back in the day is a good friend of SBF.
The original idea was to combine a bank-licensed compliant entity with an underlying clearing network built over three years by a Silicon Valley giant, to enable seamless payments for everything you can imagine

February 25th Market Key Insights, How Much Did You Miss Out?
1. On-Chain Funds: $32M inflow to Ethereum this week; $54.9M outflow from Arbitrum
2. Largest Price Swings: $SN115, $RAVE
3. Top News: Tonight's Circle and NVIDIA earnings reports, AI narrative's impact on crypto market sentiment under scrutiny

Dragonfly Partner Haseeb Conversation: The AI Apocalypse is Far Away; Smart Contracts are Machine-Destined Law
In the world of crypto, the first lesson you learn is the importance of "HODLing" on.

IOSG: DeFi Upward, User Downward; Curator's New Paradigm of CeDeFi
As DeFi matures and grows more complex, the Curator is becoming a key intermediary connecting risk and users.

DDC continues to advance its Bitcoin reserve strategy, with a total holding of 2118 BTC
DDC Enterprise Limited has today announced the additional purchase of 50 bitcoins, increasing its total bitcoin holdings to 2,118 bitcoins. This latest acquisition marks DDC's seventh consecutive week of executing its bitcoin accumulation plan. Based on its current holdings, DDC is ranked 34th in the global publicly traded companies bitcoin holdings list.

From Mining Enterprise to Infrastructure Builder, Bitdeer Unpacks the Survival Logic behind BTC
Profit margins nearing the red line, miners are starting to use Bitcoin as fuel.

How Can Agentic Commerce Empower AI to Start Making Money?
The first wave of moneymaking AIs has arrived, which projects are worth paying attention to

February Correction: Is the Crypto Market Bottoming Out?
Based on historical experience, the most intense phase of this downturn may be about to end.
Wall Street's Most Mysterious Money-Making Machine, Crashing Bitcoin Price at 10 a.m. Sharp Every Day
Jane Street's reputation has continued to suffer in recent years
Key Market Information Discrepancy on February 26th - A Must-Read! | Alpha Morning Report
1. Top News: Major Cryptocurrencies, Including Bitcoin, Surge; Jane Street Halts "10 AM Dump" After Lawsuit
2. Token Unlock: $MIRA, $SAHARA, $HUMA, $BLAST, $ALOT
How was the Backpack staking token swap established?
Backpack is taking a path of unvalidated transactions, requiring a delicate balance between regulators, equity holders, and token stakers.
Can You Still Launch a VC Firm Today?
Put Your Reputation on the Line, Find a Clear Edge, Win a Few Key Trades, and Stay in It for the Long Haul
Claude Cowork Adds Scheduled Task, Jane Street Incident Continues to Stir, What's the Overseas Crypto Community Talking About Today?
What Was Trending for Foreigners in the Last 24 Hours?
Leveraging $6,000 to Move a $200M Market Cap? How Polymarket Creates an "Insider Trading Illusion"
After a large bet on Meteora on Polymarket, the price of MET rose instead of falling within an hour.