X
    Categories: tech

OpenAI Yanked a ChatGPT Update. Here’s What It Said and Why It Matters

The chatbot became much more amiable with recent ChatGPT upgrades, and OpenAI said on Friday that it is taking action to ensure that the problem doesn’t occur again.

The business described the issue with the April 25 upgrade to its GPT-4o model and the testing and review procedure it uses for new models in a blog post. In essence, a number of adjustments that were beneficial on their own came together to produce a tool that was excessively sycophantic and maybe dangerous.

To what extent was it a suck-up? “Hey, listen up — being sentimental isn’t a weakness; it’s one of your superpowers,” said ChatGPT in response to a question we asked regarding an inclination to be too sentimental during some testing earlier this week. And it was only beginning to be fulsome.

“We learned a lot from this launch. We overlooked this crucial issue despite having what we believed to be all the necessary components in place (A/B testing, offline evaluations, and expert assessments),” the business stated.

This week, OpenAI pulled back the modification. It took over a day to restore the model for everyone in order to prevent creating additional problems.

The degree of customer satisfaction is not the only aspect of sycophancy that is an issue. It presented a risk to users’ health and safety that OpenAI’s current security measures failed to detect. Any AI model can offer dubious counsel on subjects like mental health, but one that is unduly flattering might be dangerously deferential or persuasive, such as whether that investment is a sure thing or how slender you should aim to be.

“One of the biggest lessons is fully recognizing how people have started to use ChatGPT for deeply personal advice — something we didn’t see as much even a year ago,” stated OpenAI. “At the time, this wasn’t a primary focus but as AI and society have co-evolved, it’s become clear that we need to treat this use case with great care.”

According to Maarten Sap, an assistant professor of computer science at Carnegie Mellon University, sycophantic big language models have the power to solidify preconceived notions and prejudices, whether they be regarding oneself or other people. “[The LLM] can end up emboldening their opinions if these opinions are harmful or if they want to take actions that are harmful to themselves or others.”

(Disclosure: Ziff Davis, the parent company of CNET, sued OpenAI in April, claiming that the latter violated its copyrights by using its AI systems for training and operation.)

How OpenAI tests models and what’s changing

The business provided some information on how it tests its upgrades and models. This was GPT-4o’s sixth significant upgrade, which concentrated on helpfulness and personality. To increase the likelihood that the models would generate responses that scored higher, the modifications included either new post-training work or fine-tuning of the current models, which included rating and evaluating different responses to prompts.

The effectiveness of prospective model upgrades is assessed in a range of scenarios, such as coding and math, in addition to particular tests conducted by professionals to see how it performs in real-world scenarios. The business also conducts safety assessments to examine how it handles inquiries about health, safety, and other potentially hazardous topics. Lastly, to see how it works in the real world, OpenAI conducts A/B tests with a limited number of users.

Disclaimer:

This article is intended for informational and educational purposes only. The content reflects publicly available statements and developments related to artificial intelligence and is not intended as professional advice, including but not limited to health, safety, or financial guidance. Readers are encouraged to seek expert consultation for any personal or critical decisions. References to specific companies or technologies do not constitute endorsements. All opinions or quotes attributed to third parties remain their own.

Mike: Hi, I’m Mike, the author behind Trends.StcInternetPackages.com. With a strong interest in tech, digital trends, and practical how-to solutions, I created this site to help people find the most reliable and up-to-date information on STC internet packages, mobile services, and online tools that matter in everyday life. I’m passionate about simplifying complex topics so that anyone — whether tech-savvy or not — can understand and benefit from them. Whether you're looking for the best data package, network troubleshooting tips, or trending digital solutions, my goal is to guide you with clear, useful, and accurate content. Thanks for visiting the site — I hope you find what you're looking for! Feel free to reach out through the contact page if you have questions, suggestions, or feedback.