Skip to content(if available)orjump to list(if available)

Autonomous AI Agents Should Not Be Developed

geor9e

Too late. I added a 5 minute cron job for cursor AI's compose tab in agent mode that keeps replying "keep going, think of more fixes and features, random ideas as fine, do it all for me". I won't pull the plug.

upghost

This is a purely procedural question, not supporting or critiquing in any way-- other than this reads kind of like an editorial with the format of a scientific paper. The question is... are there rules about what constitutes a paper or can you just put whatever you want in there as long as you follow "scientific paper format"?

dhruvbatra

This looks like ICML formatting (and the submission deadline just passed).

ICML25 has an explicit call for position papers: https://icml.cc/Conferences/2025/CallForPositionPapers

upghost

Wow, great observation. Thank you. Makes sense. I'd never heard of a "position paper" before.

mark_l_watson

I really enjoy Margaret Mitchell‘s podcast (she is the first author on the paper), and perhaps I missed something important in the paper, but:

Shouldn’t we treat separately autonomous agent we write ourselves, or purchase to run on our own computers, on our own data and that use public APIs for data?

If Margaret is reading this thread, I am curious what her opinion is.

For autonomous agents controlled by corporations and governments, I mostly agree with the paper.

in3d

I'd recommend looking for other sources of information if you're relying on someone who co-authored the paper that introduced the most misleading and uninformed term of the LLM era: "stochastic parrot".

currymj

it was a pretty defensible term at the time the paper came out, in the context of how LLMs were being trained and used.

in this paper, it's clear that the authors don't think modern LLM-based systems are just stochastic parrots.

asdasdsddd

This has to be the least interesting paper I've ever read with the most surface level thinking.

> • Simple→Tool Call: Inaccuracy propagated to inappropriate tool selection.

> • Multi-step: Cascading errors compound risk of inaccurate or irrelevant outcomes.

> • Fully Autonomous: Unbounded inaccuracies may create outcomes wholly unaligned with human goals.

Just... lol

null

[deleted]

rtcode_io

Yet, we all know we will!

bdangubic

the best way to get people to stop doing X is to tell them not to do X. works so well with my kid :)

bamboozled

People are going to be developing these no matter what. Whether it wipes us out or not is just up to fate really.

esafak

We can constrain their use, as with nuclear materials.

roenxi

Despite doing a pretty decent job of containing the risk we're still on the clock until something terrible happens with nuclear war. Humanity appears to be well on track to killing millions to billions of people; rolling the dice relatively regularly waiting for a 1% chance to materialize.

If we only handle AI that well doom is probable. It has economic uses, unlike nuclear weapons, so there will be a thriving black market dodging the safety concerns.

fizx

Nuclear materials have the advantages of being rare, dangerous to handle, and hard to copy over the internet.

johanneskanybal

No not really. There's no power in the world that can restrain this in it's current form even mildly much less absolutly. Why do you think that would be even slightly possible?

esafak

For the same reason we can regulate other things? Encryption is regulated, for example. There "just" needs to be international co-operation, in the case of AI.

redeux

At some point in the probably near future it will be much simpler to create an autonomous AI agent than a nuclear bomb.

esafak

True, so we need to make sure we don't find ourselves in a mess before it happens. Right now I don't see nearly enough concern given to risk management in industry. The safeguards companies put on their models are trivially subverted by hackers. We don't even know how to cope with an AI that would attempt to subvert its own constitution.

ASalazarMX

In the incredible case that we develop fully autonomous agents capable of crippling the world, that would mean we developed fully autonomous agents capable of keeping it safe.

Unless the first one is so advanced no other can challenge it, that is.

grayfaced

How did you jump to that conclusion? The agent will be limited by the capabilities under its control. We have the technological ability to cripple world now and we don't have the technological means to prevent it. Give one AI control of the whole US arsenal and the objective of ending the world. Give another AI the capabilities of the rest of the world and the objective of protecting it. Would you feel safe?

wendyshu

Fallacious

redeux

The paper described a level 5 fully autonomous agent as one that can:

create code(user request);

execute();

Is this not possible with tool use alone, so long as the agent has access to a tool that can execute arbitrary code?

jcarrano

I feel that these kinds of statements are more effective at promoting AI than limiting it. It reinforces the assumption that such powerful AI is behind the corner. It hypes up AI and the result is likely more money and resources being put into it.

Imagine if the A-bomb was being openly developed. What title would have contributed more to funding and research, "The A-bomb (is terribly powerful and) should not be developed" or "The A-bomb will never work"? Except the A-bomb did work and in a surprisingly short time, while autonomous AGI is still a conjecture.

lysace

Our analysis reveals that risks to people increase with the autonomy of a system: The more control a user cedes to an AI agent, the more risks to people arise. Particularly concerning are safety risks, which affect human life and impact further values.

artninja1988

This is quite concerning seeing that the authors are all affiliated with huggingface. Hopefully they won't start censoring what models you can or can't upload because they seem certain things shouldn't be developed.