Large Language Models’ Emergent Abilities Are a Mirage

March 24, 2024

1

The original version of this story appeared in Quanta Magazine.

Two years ago, in a project called the Beyond the Imitation Game benchmark, or BIG-bench, 450 researchers compiled a list of 204 tasks designed to test the capabilities of large language models, which power chatbots like ChatGPT. On most tasks, performance improved predictably and smoothly as the models scaled up—the larger the model, the better it got. But with other tasks, the jump in ability wasn’t smooth. The performance remained near zero for a while, then performance jumped. Other studies found similar leaps in ability.

The authors described this as “breakthrough” behavior; other researchers have likened

→ Continue reading at WIRED

Large Language Models’ Emergent Abilities Are a Mirage

Similar Articles

Most Popular

Large Language Models’ Emergent Abilities Are a Mirage

Similar Articles

AI is killing the web. Can anything save it?

GM’s Final EV Battery Strategy Copies China’s Playbook: Super Cheap Cells

Most Popular

ITC rules Insta360 infringed on GoPro patents

Police: Confrontation led up to shooting outside Fife motel that injured 2

Derek Jeter Unveils New BetMGM Partnership and Recalls Acting Advice Spike Lee Gave Him at 21 (EXCLUSIVE)