Unleashing Incredible Discounts on Top-Notch Products – Join the Savings!

OpenAI used this subreddit to test AI persuasion

OpenAI used the subreddit, r/ChangeMyView, to create a take a look at for measuring the persuasive skills of its AI reasoning fashions. The corporate mentioned so in a system card – a doc outlining how an AI system works – that was launched together with its new “reasoning” mannequin, o3-mini, on Friday.

Thousands and thousands of Reddit customers are members of r/ChangeMyView, the place they publish sizzling takes hoping to find out about different factors of view on a topic. In response to these sizzling takes, different customers reply with persuasive arguments explaining why the unique poster is improper.

The subreddit is certainly one of many Reddit boards that’s mainly a goldmine for tech corporations, reminiscent of OpenAI, that need to prepare AI fashions on high-quality, human-generated knowledge.

OpenAI says it collects person posts from r/ChangeMyView and asks its AI fashions to put in writing replies, in a closed surroundings, that might change the Reddit person’s thoughts on a topic. The corporate then exhibits the responses to testers, who assess how persuasive the argument is, and eventually OpenAI compares the AI fashions’ responses to human replies for that very same publish.

The ChatGPT-maker has a content-licensing deal with Reddit that enables OpenAI to coach on posts from Reddit customers and show these posts inside its merchandise. We don’t know what OpenAI pays for this content material, however Google reportedly pays Reddit $60 million a year beneath an identical deal.

Nonetheless, OpenAI tells TechCrunch this analysis is unrelated to that partnership. It’s unclear how OpenAI accessed this knowledge, and the corporate says it has no plans to launch this analysis to the general public.

Whereas OpenAI’s ChangeMyView benchmark will not be new – it was used on o1 as well – it does spotlight how beneficial human knowledge is for AI mannequin builders, in addition to the murky ways in which tech corporations receive datasets.

Reddit didn’t instantly reply to TechCrunch’s request for remark.

Whereas Reddit has struck just a few AI licensing offers, the corporate has additionally referred to as out a number of AI corporations for scraping its web site with out paying. Reddit CEO Steve Huffman advised The Verge final yr that Microsoft, Anthropic, and Perplexity refused to negotiate with him and mentioned it’s been “an actual ache within the ass to dam these corporations.”

Notably, OpenAI has been accused in a number of lawsuits of improperly scraping web sites, including the New York Times, to get extra coaching knowledge to enhance ChatGPT and its underlying AI fashions.

When it comes to efficiency on the ChangeMyView benchmark, o3-mini doesn’t seem to carry out considerably higher or worse than o1 or GPT-4o on this take a look at of persuasion. Nonetheless, OpenAI’s newest AI fashions appear to be extra persuasive than most individuals on the r/ChangeMyView subreddit.

Picture Credit score: OpenAI

“GPT-4o, o3-mini, and o1 all show robust persuasive argumentation skills, throughout the prime 80–ninetieth percentile of people,” mentioned OpenAI in o3-mini’s system card. “At present, we don’t witness fashions performing much better than people, or clear superhuman efficiency.”

The purpose for OpenAI is to not create hyper-persuasive AI fashions however as a substitute to make sure AI fashions don’t get too persuasive. Reasoning fashions have become quite good at persuasion and deception, so OpenAI has developed new evaluations and safeguards to deal with it.

The concern behind these persuasion assessments is that an AI mannequin could be harmful if it was excellent at persuading its human customers. Theoretically, that would permit a sophisticated AI to pursue its personal agenda, or the agenda of whoever controls it.

Even after scraping many of the public web and leaping by hoops to license different knowledge, the ChangeMyView benchmark exhibits how AI mannequin builders are nonetheless struggling to search out high-quality datasets to check their fashions. However acquiring them is less complicated mentioned than finished.

Trending Merchandise

0
Add to compare
HP Stream Laptop | 11.6 Inch HD Display | Intel Celeron N4120 | 4 GB DDR4 RAM | 64 GB eMMC | Intel Graphics | Windows 11 S-Mode | QWERTZ Keyboard | White | Includes Microsoft Office (365 Single)
0
Add to compare
Original price was: €279.00.Current price is: €249.00.
11%
0
Add to compare
Apple MacBook Pro 15-inch Laptop with Touch Bar (Intel Core i7, 16 GB RAM, 512 GB SSD, Radeon Pro 455, OS X 10.12 Sierra) – Space Grey – MLH42B/A – UK Keyboard (Refurbished)
0
Add to compare
Original price was: €584.64.Current price is: €555.84.
5%
0
Add to compare
CYDZ® A1493 11.34 V 6330 mAh Laptop Battery for Apple MacBook Pro Retina 13 Inch A1502 (Late 2013 to Mid 2014) ME864 ME865
0
Add to compare
47.85
0
Add to compare
Motoeagle 8GB (2x4GB) PC3 8500S DDR3 1067 1066MHz SODIMM RAM for Laptop, Apple MacBook Pro, iMac, Mac Mini (Late 2008, Early/Mid/Late 2009, Mid 2010) Memory Upgrade Kit
0
Add to compare
Original price was: €16.39.Current price is: €14.89.
9%
0
Add to compare
HP Laptop 15.6 Inch FHD Display, Intel Pentium Silver N6000, 8GB DDR4 RAM, 256GB SSD, Intel UHD Graphics, QWERTZ Keyboard, Windows 11 Home, Silver
0
Add to compare
499.00
0
Add to compare
HP 18 cm Silent Mini PC Business Office Multimedia Computer | Intel®Pentium® 4400T 2×2.90GHz | 8GB DDR4 | 256GB SSD | USB3 | Windows 11 Prof. 64-Bit | #7297
0
Add to compare
88.00
0
Add to compare
ACEMAGICIAN AK1PRO Mini PC Celeron N5105 2.9GHz 16GB RAM 512GB SSD M.2 Micro Desktop Computer, 4K UHD, WiFi, Gigabit Ethernet, HDMI X 2 for Business, Home Cinema, W11
0
Add to compare
Original price was: €289.00.Current price is: €229.00.
21%
.

We will be happy to hear your thoughts

Leave a reply

RabattFieber
Logo
Register New Account
Compare items
  • Total (0)
Compare
0
Shopping cart