Anthropic's CEO says in 3-6 months, AI will write 90% of the code (March 2025)

86 comments

·August 16, 2025

Visit

danparsonson

Man selling shovels reports that every job requires a new shovel.

Or in other words, of course he says that; it's not really a useful analysis is it?

rasz

Ray Kassar, CEO of Atari, says every Atari 2600 owner will buy at least 3 copies of E.T. the Extra-Terrestrial!

Fizzadar

100%. How is this even remotely considered newsworthy.

crinkly

Technically man selling shit and shovels. Or is it shovels that make shit.

0xflarion

I remember 2018-2020, when everyone was saying that in 1-2 years, all cars would be autonomous vehicles and we won't need drivers anymore. Guess what.

flohofwoe

Same for UML in the 90s: "in a few years programmers will be obsolete because a handful system architects will draw UML diagrams which will be synthesized into code", in the end the opposite happened, and the dedicated system architect role pretty much disappeared.

And UML even made a lot more sense to precisely describe a problem compared to human language prompts.

voxleone

And using UML selectively and intentionally can build the habit of thinking before coding. It's a mental exercise, much like test-driven development or writing pseudocode, that strengthens your design intuition.

flohofwoe

The question is if (too much) upfront planning is actually better.

IME it's more useful to treat a source file (or new project) like a sketchbook, you start with an empty sheet of paper but already have a rough idea what you want, then you quickly sketch out the outlines, try out different ideas, explore different solutions (all without going into too much detail yet), you step through the code, explore some different paths to get an idea what "feels right", delete things, shuffle them around, rewrite them and slowly filling out the details and that way incrementally get towards the first working version.

Which such an approach of incremental "micro-feedback-loops" you already eliminated a lot of dead ends that will appear anyway despite the best plans - but identifying such dead ends early is much better than late.

This incremental approach also forces you to keep the code small, tidy and malleable. The planning stage basically already happens in source code, and it's not one long planning stage, but many micro-planning-stages.

...this is also my problem with the current state of LLMs, they're pretty good at creating a first initial sketch from a very general problem statement (but only for things that have been done thousands of times by other people - but that's a different issue...) - but the more you'll need to go into the details and discover and fix problems in your initial 'mental design' (e.g. discovering what you actually wanted in the first place), the more the differences to traditional programming disappear, and at some tipping point it gets even more complex because you need to steer the LLM with a language (the human language) that simply isn't useful enough for detailed problem descriptions - that's why lawyers, mathematicians, engineers and scientists all invented their own precise 'DSLs'.

After a few iterations your prompts need to be just as detailed as writing source code in the first place - so what's the point of writing prompts in human language again?

I feel like productivity could be improved much more by improving programming tools (yeah - boring old-school incremental maintenance work) instead of betting on some weird AI future which will just move the focus from writing source code to writing human-language prompts (which IMHO is a definitive step backwards because human language lacks precision - and adding that precision is how you end up with programming languages).

Why are editing, compilation and debugging/testing still separate steps, why do debuggers still have those 60s style bare variable panels instead of realtime visualizations of the internal program state? Why do I still need to wait for a build to finish? Why is version control still such a PITA to work with? Instead everybody is jumping on the AI hype train while everything around them is crumbling into a post-apocalyptic wasteland.

glimshe

XML was going to end all other data formats.

pera

I still remember this one from 2016 very well:

A Driverless Tesla Will Travel From L.A. to NYC by 2017, Says Musk

https://www.nbcnews.com/business/autos/driverless-tesla-will...

bboygravity

Really not that far off in the grand scheme of things IMO.

Ray Kurzweil was even more on the money in his 2006 book "The singularity is near". I remember reading some of the stuff in there that is now happening. From the top of my head he predicted 2030 for human-level AI hardware and software that could be bought for 1000 USD by anyone. I feel he's going to be very close.

card_zero

2020. "Based on the above analyses, it is reasonable to expect the hardware that can emulate human-brain functionality to be available for approximately one thousand dollars by around 2020."

crinkly

https://en.wikipedia.org/wiki/List_of_Tesla_Autopilot_crashe...

jimsimmons

And how solving the trolley problem is the key to make it all work

bufferoverflow

[dead]

onename

Meanwhile: Nearly half of all code generated by AI found to contain security flaws - even big LLMs affected

https://www.techradar.com/pro/nearly-half-of-all-code-genera...

VBprogrammer

Now do Junior Devs...

JCM9

Are most junior devs honestly any better? In my experience no.

fzeroracer

In my experience, junior devs are actually better about this because they haven't been ground down by years of corporate bullshit and shellacking whenever they bring up security holes or issues. Some of them will be a bit over eager and miss obvious stuff but unlike LLMs you can actually train them to be better about it.

VBprogrammer

Yeah, to be honest I agree. I've worked with senior people who couldn't give a crap, "it works" is the only barrier to shipping it.

I've also worked with juniors who are technically much better than most of the senior people around them.

UmGuys

If there are no devs, who finds the junior bugs?

tdhz77

All devs

righthand

All senior devs are now junior devs, that’s why we stopped hiring juniors.

Give me them downvotes all you want but I’m not the “senior” turning in slop Llm code.

akaike

Couldn’t it easily also take over the CEO job? Pretty sure it’s easier than producing code that works and is maintainable.

sameermanek

Given how much these CEOs are hallucinating these days while hopelessly losing money on every venture they pursue, I think AI is leaps and bounds ahead of these idiots for decision making.

ozgung

Good point. Maybe not for the CEO yet, but a manager without people to manage is a useless thing. So I think corporates will invent new Bullshit Jobs[1] for humans, to keep them in their organization chart.

[1] https://en.wikipedia.org/wiki/Bullshit_Jobs

UmGuys

I feel like management roles would be much more easy to automate than dev. It's hilarious how they're trying to sell these products. The only thing they couldn't do is go golf and drink half the time. They would be superior in that regard.

ben_w

I once asked a CEO, what a CEO's job involves.

Apparently it's lots of fiduciary duties.

As with driving cars, even if the AI is strictly better at doing these tasks than they are at writing code, mistakes aren't so easy to recover from and can destroy something unrecoverably in a 5-second attention lapse from a human overseer.

gf000

So can software, if not more. Like, your healthcare data leaking, bank account losing your money, some legal document getting lost/wrongly issued, .. and then we didn't even talk about actual safety critical applications (which are hopefully not vibe coded) like airplanes/medical device, etc.

ben_w

AI is software, so in a sense everything that can go wrong with AI must be a subset of things that can go wrong with software.

Lots of software has a test environment. Even in live, e.g. bank account losing your money the transactions can be un-wound.

And that's the difference when it comes to replacing software devs with LLMs vs replacing CEOs with LLMs: it's possible to write the tests and then lock them. And to perform code review before changes are merged into the main branch. And to test against something other than production.

I know the Board can in principle remove a CEO, but is there even a standardised way for a CEO to have a vice-CEO that always checks everything they do, that always tests their commands against a simulation of the company and only lets them proceed if they agree with the outcome?

The point is that "AI as CEO" would be in the category of "business-critical" software, and also that current approaches to AI also lack sufficient guarantees of obligation compliance or sufficient defence against failures, which in the banking example would be things like the AI deciding to save money by deleting the system capable of unwinding incorrect banking transactions.

To the extent this kind of failure mode happens with vibe coding (in the original coining of the term: always accept without reading), it's like letting the LLM modify the unit tests to always succeed regardless of the code.

akaike

Well, the same goes for wrong code. One wrong line can cost millions or destroy everything completely, depending on the context. It is also not very easy to recover from.

ben_w

The two contexts where that applies is "interacts with the outside world and you deployed without tests", and "even though it only affects your own data, you don't have backups and you deployed without tests".

hakanderyal

For CRUD/UI heavy web apps, it can. I have probably wrote ~5% of the code I produced in the last 2 months, but I've read and verified and corrected every line of code. Output increase is substantial.

But I'm not expecting that to hold true for linux kernel or postgres codebase or equivalent anytime soon.

CafeRacer

Im heavily using claude code right now... because i need something now. But it's a huge pile of technical debt.

With every piece of code created i write a doc of what id need to focus on when rewriting chunks.

AI has its usage, it makes some things faster... but spitting out more code does not necessarily make it more productive.

I ride AI train, but i ride my cb650r same way... sometimes wondering why i am still alive.

rich_sasha

> But it's a huge pile of technical debt.

That's an interesting observation: how will LLM-generated code age?

Maybe we don't care, we just test it and throw more prompts at it.

But already 90% of some jobs is understanding the legacy code someone else wrote. I suppose LLMs can be good at this. Unless their maintenance of legacy code is just throwing more complexity at it until it becomes unmanageably complex.

hakanderyal

With Claude Code, you need to create the docs beforehand, not after.

For my latest project, I have 20k+ lines of markdown docs to guide it, with great success. Some of them are generic rules, some of them describes how I code, some of them describing the codebase & features. Then I have another 17k+ that are used while coding. Plans, phases, todos, reviews etc.

All of them are written by Claude Code also. I'm calling it "spec-driven development".

Cursor has a much different flow, where you usually pair program with it, which I call "ai-assisted development".

zarzavat

I've tried this approach but at some point you're just coding in a less precise language (English), and with more duplication.

My current approach is to stub out the architecture I want, write the type definitions I want in full, and ask Claude to fill in the blanks. It's good at implementing an interface but bad at devising the interface on its own.

CafeRacer

Yes, but claude forgets things

brushyolaf

I recently got to freelance for a project where a junior dev took over an existing codebase and AI now writes 90% of the code. The codebase is now completely unmaintainable as the AI ignored the underlying abstraction patterns and the Dev is not experienced enough to correct the mistakes. 90% AI, 100% spaghetti code.

fleebee

As a note to future historians, this sounded just as crazy back in March.

anonzzzies

The thing is that this is probably currently indeed happening. The bizarre output of LoC by LLMs probably now eclipses whatever humans write by hand. I don't believe 90% passes the reviews though; a lot more gets written and a lot more gets discarded. My fear is more that this will change, and code will pass because less and less people know what is good or not. Not yet, but soon-ish, EVEN if the output quality does not improve.

bboygravity

90 percent definitely passes reviews.

Source: my commits pass reviews and it's not because the reviewers don't have like 50 years of experience between them.

It's funny how the comments on HN went from complete denial about coding with LLM's ever being possible at all like 1 year ago to "oh, it might be happening" now. I find it hard to understand how people can simultaneously be skilled (older) engineers while at the same time being completely oblivious to the concept of exponential improvement of tech in general. I guess people don't WANT to see the reality that is change and prefer to just look away and cope?

gf000

Because we already are in the diminishing returns phase with relation to LLM reasoning capability - exponentials can almost always only exist for short durations.

anonzzzies

I am unsure but as a formal verification (pupil of dijkstra and his students in eindhoven and Amsterdam) person, I see this happening in a crazy waterfall and indeed people on HN are the last deniers who keep saying 'it does not work for them'. It works for the rest of the world and things are going completely ape shit now. I am happy to be old as I probably won't live through the fallout; I would say I hope we change our ways and redistribute the gained wealth here but it won't happen. And then we also do not learn from having terrible software ruining lives...

anal_reactor

> I guess people don't WANT to see the reality that is change and prefer to just look away and cope?

Yes. But frankly, it's hard to blame them. It's just a shitty social contract where you spend lots of time gaining skills and by the time you're done your field has been completely obsoleted.

I'm a huge fan of making comparisons to how things used to work before industrial revolution, because that's what our brains evolved for. If your father was a blacksmith, you'd be a blacksmith too, and so would be your son. It was difficult to imagine a world that simply would stop needing blacksmiths. So it's completely normal that our brains spend first 20 years learning, and once we're done learning, we just apply that knowledge to everyday life, save for minor tweaks here and there.

Then it became a requirement to keep learning your entire life because the world kept changing. This sucks, but it's not completely infeasible, so people rolled with it, especially that new positions were cozy and well-paid, and there was some gentle exit path for those who were too old to learn new things.

But now we're in a situation where the society tells you that there's huge demand for given skill, and then your entire profession gets completely obliberated before you're even halfway done with your studies. In such environment it's simply impossible to create a realistic career plan, let alone have energy for luxuries such as raising a family or maintaining social connections. This isn't good.

AI has objectively made our life worse, and everyone is completely clueless how the situation develops. Denying everything and hoping to wait things out might actually be just as good of an attitude like any other. Because honestly tell me - even if we assume the most optimistic scenario "AGI within five years" - how the fuck do I plan my career?

poniko

Did some bigger admin tool to do search and matching datapoints etc and decided to only use claude code. First version was wow awesome this saved me so much time .. now 2 weeks later putting back code that was deleted several times, removing 3 copy's of the same code, way to complicated sqls, verbose code and looking how its mixing htmx with some wird own JavaScript. It really looks as a junior developer solutions, so I'm done using it for more then boiler plate things..

I will come back in 3-6 months and hope its better to understand its own limitations.

I'm guessing it would be less then a week for me to write it on my own.

The biggest issue is that I've now seen the shit it creates so I have zero trust in the code I now have from a security and stability standpoint. I know many have better experiences then mine.

verdverm

Have you tried agent instructions? They have been helping with these issues

urquhartfe

Well, he still has a month I guess

edg5000

For me this appears to be true actually, these days. It struggles more with large and terrible codebases though, because it has to ingest so much garbage code in order not to break anything, wheras with nicely strucured code it can do great work on a module level, as long as you direct it well.

zarzavat

"AI" has been writing 90% of my code, on a per-character basis, since IntelliSense came out.

The question is not how many characters can a computer spit out, but whether you need a human in the loop or not. That's not going to change in 6 months. In a few years, perhaps.