Elon Musk claims AI has exhausted real-world training data

Elon Musk: AI has consumed humanity's knowledge; synthetic data is the future (Image source: Dall-E 3)

Elon Musk claims AI has exhausted available real-world training data since 2024, advocating for synthetic data generation as the future of AI development. Major tech companies already embrace this approach, though researchers warn of potential risks like model collapse and bias amplification.

Nathan Ali, Published 01/13/2025 🇫🇷 🇪🇸 ...

In a recent interview at CES, Elon Musk mentioned that artificial intelligence has basically used up all the real-world training data available, pointing to synthetic data generation as the primary way forward. This idea aligns with what former OpenAI chief scientist Ilya Sutskever said about hitting "peak data" in AI development.

Musk believes we ran out of human-produced data back in 2024. As the CEO of Tesla and the owner of xAI, he stressed that getting AI to create its own training data is the most practical solution for moving AI ahead. This method lets AI systems check on themselves and learn as they go.

Plenty of big tech companies have already hopped on the synthetic data train. Microsoft’s newly open-sourced Phi-4 model, for instance, relies on a combo of synthetic and real-world information, while Google is using a similar strategy for its Gemma models. Anthropic’s Claude 3.5 Sonnet and Meta’s latest Llama series also rely on AI-generated data.

Meanwhile, analysts at Gartner predict that by 2024, around 60 percent of the data used in AI and analytics projects will be synthetic. One big reason for the shift is cost. AI startup Writer says it spent about $700,000 developing its Palmyra X 004 model—way cheaper than the estimated $4.6 million to build a comparable OpenAI model.

But synthetic data isn’t without its issues. Researchers warn about the risk of “model collapse,” where AI can become less inventive and more biased. This problem might crop up if any biases in the original dataset get amplified when the AI starts churning out fresh data on its own.

Source(s)

Fast Technology (in Chinese)

OpenAI releases a new blueprint for its own version of AI regulation (Image source: Dall-E 3)

OpenAI unveils economic blueprint to secure U.S. AI leadership 01/14/2025

AMD FSR 4 gets tested in Ratchet and Clank Rift Apart

AMD FSR 4 on RX 9070 shows massive improvements early analysis 01/10/2025

Nvidia criticizes US government's impending chip export restrictions (Image source: SMIC)

Nvidia criticizes US government's last-minute AI chip export restrictions 01/10/2025

Dangbei Freedo unleashes 1080p cinema in travel cup-sized package 01/08/2025

The 4D Cosmos simulator can turn thousands of driving scenes into billions (Image source: Nvidia)

Nvidia goes after Tesla FSD with Thor Blackwell AI car chip and 4D autonomous driving world simulator 01/07/2025

For the first time, Figure AI has delivered its humanoid robot, Figure 02, to an undisclosed customer, according to the company. (Image source: YouTube / Figure)

Figure AI’s humanoid robot reaches its first customer 01/02/2025

Read all 6 comments / answer

Loading Comments

Comment on this article

Honor Magic7 Pro launches on 15 Jan...

55-inch LG B3 OLED TV with 4K 120Hz...

Nathan Ali - Tech Writer - 353 articles published on Notebookcheck since 2024

I'm a tech geek at heart, and it all started back in middle school. I've always loved messing around with gadgets—rooting Android phones and jailbreaking iPhones was my thing. I've definitely bricked a few phones along the way, but that never stopped me from trying. For over a decade, I've been glued to tech news, always trying to keep up with the latest and greatest. But I'm not just about tech; I'm also really into cars and love following what's new in the automotive world. Oh, and I should mention that I also worked as a freelance writer. I can't name-drop the companies I wrote for (you know how it is), but it was a pretty cool experience. I switch between reading, gaming, and keeping up with all the tech and car stuff in my downtime. It's a mix that keeps things interesting and fun for me.

contact me via: @Painite6

Please share our article, every link counts!