Openai has Published a postmortem on the Recent Sycophancy Issues With the default ai model powering chatgpt, GPT-4o-Issues that forced the company to roll back an update to the model released late week.
Over the weekend, following the GPT-4o Model Update, Users on Social Media Noted that Chatgpt Began Responding in an overly validating and agreeable way. It Quickly Became a Meme. Users Posted Screenshots of Chatgpt Applauding All Sorts of Problematic, Dangerous decisions and Ideas,
According to openai, the update, which was intended to make the model’s default personality “Feel more intuitive and effective,” was informed too much by “Short-term feedback” For how users ‘interactions with chatgpt evolve over time.’
“As a result, gpt‑4o skewed towards responses that were overly supported but dishinuous,” Wrote openai in a blog post. “Sycophantic interactions can be uncomfortable, uncomforting, and cause distress.
Openai Says it’s implementing several fixes, including refining its core model training techniques and system prompts to explicitly Steer GPT-4O Away from Sycophancy. The company is also building more safety guardrails to “Increase [the model’s] Honesty and transparency, “It says.
Openai also say that it’s exploring ways to allow users to give “real-time feedback” to “directly influence their interactions” with chatgpt and choose from multiplitia. “
,[W]e’re exploring new ways to incorporate broader, democratic feedback into chatgpt’s default behavior, “The company Wrote in its blog post. Chatgpt behaves and, to the extent that it is safe and feasible, make adjustments if they don’t agree with the default behavior. “