SEAL shows how AI can think and evolve continuously

SEAL is a new learning layer for language models that continuously improves itself with its own "self-edits". (Image source: DallE3)

MIT researchers have developed a framework that enables existing language models to behave more like thinking entities, capable of continuous, independent development. However, the approach still faces several limitations.

Marius Müller (translated by Marius Müller), Published 06/26/2025 🇩🇪 🇪🇸 ...

Science AI

Artificial intelligence is becoming increasingly versatile – it generates images, writes poetry and builds apps. Yet one key limitation remains: today’s systems struggle to truly evolve beyond their initial programming. That’s exactly where a new concept from the Massachusetts Institute of Technology (MIT) comes in. Called SEAL, or Self-Adapting Language Models, this framework enables large language models to behave more like learning beings. SEAL allows them to process new information, generate their own insights and update their knowledge in real time – without relying on external datasets or extensive developer intervention. The research paper was published on June 12 on arXiv.

Continuous learning without developer intervention

“Especially in companies, it is not enough to simply retrieve data – systems must be able to adapt continuously,” says MIT PhD student Jyothish Pari. SEAL is designed to do exactly that, using a continuous two-step process. First, the AI summarizes new information, generates relevant examples and adjusts its internal settings. These changes are referred to as “self-edits.”

The system then immediately puts its self-edits to the test: it undergoes brief retraining with the new adjustments and is evaluated to see whether its responses actually improve. SEAL only retains the changes if the results show a clear performance gain. Comparative tests confirm the effectiveness of this method: in a question-and-answer quiz without supporting text, the accuracy of the Qwen 2.5-7B model rises from 33.5% to 47%. In the more challenging ARC puzzles – logic-based tasks from the Abstraction & Reasoning Corpus – performance even climbs to 72.5%, more than triple the model’s original score.

Thanks to this cycle, SEAL behaves almost like a thinking entity: whenever new facts or questions arise, the model “reflects” on what matters, generates its own examples and adjusts its settings to better apply what it has learned. Since this process runs continuously, the AI is always learning. It no longer relies on separate developer fine-tuning but instead uses incoming texts as training material – generating its own data on the fly.

SEAL unlocks several possibilities at once. In the future, chatbots could naturally adapt to users' personal preferences without needing to send sensitive data to external servers. Development and research tools could also evolve more independently – adjusting to shifting project requirements without having to be retrained each time. And even if publicly available text data becomes scarce, SEAL can generate its own training material through self-created examples, offering a smart way to sidestep potential data shortages.

High potential, but not without hurdles

Although SEAL holds significant promise for advancing AI development, the researchers point to three main challenges:

First, there's the issue of catastrophic forgetting: as the model continuously integrates new self-edits, its ability to perform earlier tasks gradually declines. The study already shows early signs of this effect.
Second, the computational cost is substantial, as each self-edit requires a brief fine-tuning step. According to the study, a full cycle takes between 30 and 45 seconds, significantly increasing the operational cost of running large models.
Third, verifying the accuracy of self-edits remains a challenge. The performance tests primarily assess how convincing an answer sounds, rather than whether it is actually correct. Users on Reddit have already raised concerns that the system might accept plausible-sounding but incorrect self-edits as improvements – and then internalize these errors permanently.

Source(s)

arXiv

A promotional image for Cloudflare's Content Independence Day initiative displaying an orange flag with the symbol of a pen. (Image Source: Cloudflare)

Cloudflare will now block AI web scrapers that don't pay for data 07/01/2025

Researchers at Kyoto University have built the first magnetically controllable BPVE solar cell. (Image source: DallE3)

Next-generation solar power: Japanese researchers achieve BPVE breakthrough 06/26/2025

An image showing an asteroid in space. (Image source: Pixabay)

This asteroid could hit the Moon in 2032 and send debris toward Earth 06/26/2025

An image showing smoke escaping from a volcano. (Image source: Roman Kirienko - Pexels)

The “Big One” in San Andreas seems to be delayed, and that's not good news 06/25/2025

Neura Robotics’ 4NE1 is set to launch in 2025, aiming to bring humanoid robots to the mass market. (Image source: Neura Robotics)

Germany’s answer to Musk’s Optimus: Neura’s 4NE1 robot promises ironing, lifting and unloading the dishwasher 06/25/2025

New brain-computer interface (BCI) can turn brain signals into spoken words in 25 milliseconds (Image source: AI-generated illustrative image)

A mute man talks: Neuralink-surpassing brain implant instantly turns thoughts into speech 06/24/2025

Unknown radio signals have been detected beneath the ice in Antarctica. (Image source: Concept Art - Sebastian Zentilomo)

Unexplained radio signals have been detected under the ice in Antarctica 06/24/2025

Loading Comments

Comment on this article

Xiaomi Pad 7S Pro launches as high-...

Dell 14 Premium: XPS 14 successor a...

Editor of the original article: Marius Müller - Tech Writer - 2574 articles published on Notebookcheck since 2024

As a child in the 90s, my Gameboy was my steady companion. After school, the PlayStation was fired up. When I finally got my first PC, I was completely hooked. My passion for gaming has never waned since. For me, writing for Notebookcheck means reporting on topics that are really close to my heart - in addition to gaming, I also like to write about e-mobility, photovoltaics and innovative gadgets. When I'm not sitting at my computer, I'm probably on water rescue duty on the Baltic coast or trying to counteract the downsides of my geek life - namely sitting for long periods - at the local swimming pool.

Please share our article, every link counts!