News

GPT-4 OpenAI’s Latest Model Claims to Outperform Humans in Exams

Mar 15, 2023

OpenAI, creator of ChatGPT and some other popular artificial intelligence, has announced the release of its latest AI model, the GPT-4. The company claims that this new AI model is a significant milestone in its deep learning research and that can outperform humans in many area. According to OpenAI, GPT-4 exhibits “human-level performance on various professional and academic benchmarks.” Here are the details…

OpenAI Unveils GPT-4, A Deep Learning Model That Outperforms Humans In Exams

According to OpenAI, GPT-4 is capable of passing a simulated bar exam in the top 10% of test takers, while its predecessor, GPT-3.5 (the basis of ChatGPT), only scored around the bottom 10%. GPT-4 also performed well on various other exams, like the SAT Math test (scoring 700 out of 800). However, it scored only 2 on the AP English Language and Composition exam (14th to 44th percentile).

It’s important to note that GPT-4 is a regurgitation engine that draws upon material it was trained on to address a query. Although it may recall details for exams, its accuracy can vary. OpenAI CEO Sam Altman acknowledged that GPT-4 is still flawed and limited, and may seem more impressive on first use than after spending more time with it.

GPT-4 is a large multimodal model designed to accept queries via text and image inputs, with answers returned in text. It is being made available initially via the wait-listed GPT-4 API and to ChatGPT Plus subscribers in a text-only capacity, as image-based input is still being refined. Despite the addition of a visual input mechanism, OpenAI is not being open about or providing visibility into the making of its model. The company has chosen not to release details about its size, how it was trained, nor what data went into the process.

In a live stream on YouTube, Greg Brockman, president and co-founder of OpenAI, demonstrated the difference between GPT-4 and GPT-3.5 by asking the models to summarize the OpenAI GPT-4 blog post in a single sentence where every word begins with the letter “G.” GPT-3.5 did not try, while GPT-4 returned “GPT-4 generates ground-breaking, grandiose gains, greatly galvanizing generalized AI goals.”

Brockman also had GPT-4 generate the Python code for a Discord bot, as well as HTML and JavaScript code for a hand-drawn mockup of a jokes website sent to Discord. Finally, Brockman set up GPT-4 to analyze 16 pages of US tax code to return the standard deduction for a couple, Alice and Bob, with specific financial circumstances. OpenAI’s model responded with the correct answer and an explanation of the calculations involved.

While GPT-4 is an impressive achievement, its development raises concerns about the lack of transparency and potential consequences of large-scale models like GPT-4.

RELATED:

(via)

GPT-4 OpenAI’s Latest Model Claims to Outperform Humans in Exams

OpenAI Unveils GPT-4, A Deep Learning Model That Outperforms Humans In Exams

Apple canceling car project is benefiting Xiaomi: Over 50% of Xiaomi SU7 owners are Apple...

Oppo Find X7 goes shiny White in China

Here’s when Samsung Unpacked 2024 event is happening, according to a leak

OpenAI Unveils GPT-4, A Deep Learning Model That Outperforms Humans In Exams

RELATED ARTICLESMORE FROM AUTHOR

GPT-4 Outperformed Junior & Trainee Eye Doctors on a Mock Exam

ChatGPT Gets Smarter for Premium Users with GPT-4 Turbo Upgrade

GPT-5 will Likely Come out in June, Red Team Testing Invites are being Sent Out

Apple canceling car project is benefiting Xiaomi: Over 50% of Xiaomi SU7 owners are Apple...

Oppo Find X7 goes shiny White in China

Here’s when Samsung Unpacked 2024 event is happening, according to a leak

RELATED ARTICLES MORE FROM AUTHOR