CriticGPT finds errors in AI model output

OpenAI has developed a new CriticGPT model to identify errors in the code generated by ChatGPT, making the output of large-scale language models (LLMs) more accurate.

To improve the output, reinforcement learning with human feedback (RLHF) is usually used, where humans evaluate the model’s output and further refine it. This is time-consuming and error-prone, especially for large models, and can result in a high number of incorrect or unnecessary responses.

OpenAI hopes to change that by making GPT-4 the basis for CriticGPT. “Reviewing ChatGPT’s code with the help of CriticGPT performs 60% better than someone without help,” say the creators of the new tool. CriticGPT is said to be able to detect hallucinations that people would not recognize on their own.

The new model is trained from a dataset of code samples containing intentional bugs and example feedback, allowing CriticGPT to detect not only common bugs, but also rare bugs.

performance

To demonstrate CriticGPT’s performance, OpenAI compared the model with humans and found that it outperformed the average human code reviewer. Criticisms that involved observing and explaining errors were preferred over human-written criticism in 63 percent of cases. According to OpenAI, this is because the model was less nit-picky about the code and produced fewer false positives than the humans themselves.

OpenAI plans to integrate models like CriticGPT into the RLHF labeling pipeline to aid in model trainers. Many of the results OpenAI is currently showing are still mostly in the research stage.

Tip: ChatGPT app now available for macOS

Source link

What's Hot

Your favorite kitchen gadgets from Ninja, Amazon, and more, including air fryers, smart kettles, and more

Low-tech gadgets to make your break more comfortable: Body Electric: NPR

How the trader and another person stole $50,000, GH¢50,000, iPhone 15 and other gadgets from John Kuma’s widow

NYT: Hackers steal details of OpenAI AI technology

European media put to the test by artificial intelligence

Avicenna.AI Gains European MDR Certification for Its AI Medical Imaging Portfolio

NYT: Hackers steal details of OpenAI AI technology

European media put to the test by artificial intelligence

Avicenna.AI Gains European MDR Certification for Its AI Medical Imaging Portfolio

EU AI Law: What Businesses Need to Know | Insight

What to Pack for a Trip to Europe, According to a Fashion Editor

Inside Turkey’s Powerful Fashion Factories

E-commerce in Spain – E-commerce News

37 Cyber Monday Clothing Deals 2023 to Refresh Your Wardrobe

Our Picks

NYT: Hackers steal details of OpenAI AI technology

European media put to the test by artificial intelligence

Avicenna.AI Gains European MDR Certification for Its AI Medical Imaging Portfolio

Most Popular

Russian gas discounts continue in Europe as Moscow secures buyers

NATO countries call for new defense of European borders with Russia and Belarus

Europe is no longer producing innovative world-class companies – The Irish Times

Subscribe to Updates

What's Hot

CriticGPT finds errors in AI model output

performance

Related Posts