Skip to content(if available)orjump to list(if available)

The Anthropic Economic Index

The Anthropic Economic Index

197 comments

·February 10, 2025

azinman2

I'd love to see this analysis done for ChatGPT, which has a much bigger 'consumer' marketshare.

I'm also very wary of their analysis method, given classifiers-gonna-classify. We already see it in their example of someone asking why their game is crashing and it buckets them into Computer & Mathematical occupation. I'm guessing the original question was not that of a game developer but rather a game player, so can you really call this an occupational task? Sure it's in that domain, I guess, but in a completely different context. If I'm asking a question about how to clean my dish washer, that's hardly in repairman or industrial occupations.

Still, it's cool they're doing this.

raylad

Very wary. Not weary.

skeeter2020

why not both?

fragmede

Yeah. reminds me of the ancient okcupid data analysis blogs and not the creepy one by sleep8. The group I'm surprised not to see represented in their analysis is "personal", where people I know use ChatGPT as a therapist/life coach/sms analysis&editor. and of course they crucially but understandably left off the denominator. 35% of a million requests is different than 35% of a billion. and also how many of the conversations had 1 message, indicating "just testing" vs 10 or 100 messages.

bufferoverflow

> 35% of a million requests is different than 35% of a billion.

Not statistically.

alwa

A mentor I respect memorably explained to young me that “it doesn’t matter how big the pot of soup, you can use the same size spoon to taste it.”

layman51

What do you mean by “statistically”? The end results would be like three orders of magnitude apart. Wouldn’t the desired sample size depend on the size of the population itself?

olddustytrail

Yes statistically. What do you think "statistically" means?

rafram

> not the creepy one by sleep8

What are you referring to?

frankfrank13

This more or less confirms what I imagine most of us thought, AI is mostly used by engineers, or for engineering tasks. Makes sense, I wonder how much traffic comes from automated tasks (co-pilot, etc). Every time I read a report like this I do wonder if we'll ever see an ROI on LLMs. HUNDREDS of billions of dollars of spend, and 3 years in its still primarily the same crowd using it, and has yet to create a "killer" app beyond ChatGPT and Co-Pilot style IDE's. And its not like people aren't trying! Look at the recent YC batches, its all AI-for-industry. Idk man, I fear the economic reality on the backside of this kind of spend.

steveBK123

I think the problem is the data.

Software engineering is a weird niche that is both a high paying job and something you can almost self-teach from widely available free online content. If not self-teach, you can rely on free online content for troubleshooting, examples, etc.

A lot of other industries/jobs are more of an apprenticeship model, with little data and even less freely available on open internet.

esperent

> something you can almost self-teach from widely available free online content.

I think you massively underestimate just how much data is online for everything, especially once you include books which are freely available on every possible subject (illegally, perhaps, but if Meta can download them for free then so can everyone else).

There's less noise for many other subjects than for software engineering, there's often just a couple rather than 100s of competing ways to do everything. There might just be one coursebook rather than 1000s of tutorials. But the data for self teaching is absolutely there.

BeetleB

The difference is in the cost of the equipment.

Sure, you can learn all about power electronics by yourself. But have some ideas you want to implement? Hundreds to tens of thousands of dollars.

kanbankaren

> Software engineering is a weird niche that is both a high paying job and something you can almost self-teach

If you meant programming, I agree it could be self-taught, but not SE. SE is the set of techniques, practices, tools that we have collected over decades for producing multi-versioned software that meets a certain reliability rating. Not all of these is freely available online.

BeetleB

I'll bite. Can you list specific things not freely available online?

elicksaur

Unless you are talking about people who are actual licensed engineers, this is a distinction without a difference.

Thing is, I’ve never met someone in software with a professional license.

steveBK123

Sure. And that's why SWEs will be fine in the world of AI, as the rote work is more easily automated.

The contrast is that for a lot of other jobs, the rote tasks are not routinely solvable with free online content in text form.

optimalsolver

There are plenty of self-taught people in the open source space making highly reliable software.

taurknaut

The vast majority of people learn this on the job. This is certainly not taught in schools (or is only barely scratched as a topic).

bakuninsbart

I would agree that the products coming out so far lack imagination, but hard disagree on the impact. LLMs have completely transformed multiple industries already. In SWE, I would estimate that junior positions shrank by 70-80%, but even that is less extreme than what is going on in other industries.

In marketing, the entire low-end to mid-tier market is gone. Instead of having teams working on projects for small to mid-sized companies, there's now a single Senior managing projects with the help of LLMs. I know multiple agencies who cut staff by 80-90% without dropping revenue.

Translation (of books, articles, subtitles) was never well paid, even for very complex and demanding work. My partner did it a bit on the side, mostly justifying the low pay with some moral bla about spreading knowledge across cultures... With LLMs you can completely cut out the grunt part. You define the hard parts (terms that don't translate well), round out the edges and edge out the fluff, and every good translator becomes two to ten times more productive. Since work is usually paid by the page, people in the industry got a very decent (at least temporary) pay jump, I would imagine around 100%.

Support is probably the biggest one though. It is important to remember that outsourcing ot India only works for English speaking countries. And even that isn't super cheap. Here in Germany, if you don't have back-up wealth, it is your constitutional right to get some support from the state (~1400 euro), but you are obligated to find a job as soon as possible, and they will try to help you find a role. Support was always one of the biggest industries to funnel people towards. I talked to a friend working there, and according to them the complete industry basically stopped advertising new positions, the only ones that are left are financial services. The rest went all in on LLMs and just employ a fraction of the support stuff to deal with things escalating enough.

And that's not even touching on all the small things. How much energy is spent on creating pitch decks, communicating proposals, writing documentation etc? It probably goes up as far as 50% of work in large Orgs, and even if you can just save 5% of your time by using LLMs to phrase or organize, there is a decent ROI for companies to pay for them.

advael

I think a lot of this is because the economic pressure is weak right now both on the side of labor and consumers, due to decades of severe upward wealth transfer. A lot of these companies are not improving or even maintaining their productivity or quality of service, and while there are probably some productivity gains for engineers, I suspect based on what I'm seeing that this is going to burn a lot of people out, as there is significant social pressure both from peers and employers to exaggerate this somewhat. People can have too heavy a workload for a decent amount of time before breaking.

There's just no countervailing force to make these decisions that immediately painful for them. Sectors are monopolized, people are tired and desperate, tech workers are in a basically unprecedented bout of instability.

The situation is super dark from a lot of angles, but I don't think it's really "the overwhelming usefulness of AI" that's to blame here. As far as I can tell, the biggest thing these technologies are doing is providing a cover story for private-equity-style guttings of various knowledge work verticals for short-term profit, which was kind of inevitable given that's been happening across the board in the larger economy, it's just another pretense that works for different verticals.

There are cases where LLMs seem really genuinely useful (Mostly ones that are for and by SWEs, like generating documentation or smoothing some ramp processes in learning new libraries or languages) and those don't seem to be "transformative" at scale yet, unless we count "transforming" many products into buggier products that are more brittle and frustrating to interact with

dorgo

>I know multiple agencies who cut staff by 80-90% without dropping revenue.

I'm finding it hard to reconcile this with my own experiences. My whole team ( 5 people ) left last year ( for better pay I guess ) and the marketing agency in germany Im working for had to substitute them with freelancers. To offset the cost they fired the one guy who was hired to push the whole LLM AI topic. We managed to fill one junior position by offering 10k+ more then in their last job. The firm would love to hire people to replace the freelancers. We had to cut stuff lately. But mostly they closed the kitchen which wasn't used due to work from home policy. Definitely don't see any stuff reduction due to automation / LLM use. They still pay (external) people 60€ per written text/article. Because clients don't like LLM written stuff.

torginus

Actually I have interacted with multiple translators in multiple industries and I haven't seen any disruption (although I agree with your statement that it was never well paid)

- Synchronous translation at political/economic events still needs a personm as it ever did - LLMs are nowhere near the level to be able to translate fine literature at a high enough quality to be publishable - Translating software is still very hard, as the translator usually needs a ton of context/reference for commonly used terminology - we partnered with a machine translation company, and what they produced sucked balls.

I have friends who work as translators, and we make use of translation services as a company, and I haven't seen the work going away.

achierius

> I would estimate that junior positions shrank by 70-80%

This just isn't true, it's nowhere close.

physicsguy

We’re trying to use it for industrial apps. Been over a year of R&D. Some good but often mixed results. Adherence to prompts is a big issue for us. It’s most useful not as a chatbot but to give explained descriptions of what the user is seeing, so they don’t need to dig down into 20 graphs and past history. That necessitates being able to refer to things with URIs which works 95% of the time but the 5% is killer since it’s difficult to detect issues and leads to dead links.

frankfrank13

I tried to build a BIG E2E automation pipeline, along the lines of like, replace a team of 5 with this one simple tasks. And as I was doing it, all I could think was, just use chatgpt. Sure it can't actually automate what you're doing, but it'll get you there 80% as fast as fully-automated, with 90% less risk of error/nonsense at the end. Ironically, this company blocks all LLM websites, they even block GH on their employees' computers.

flessner

But "AI" tools have more or less seeped into every mainstream product... this is a strong "defensive move" for companies in anticipation of more to come.

We aren't leaving MS Office or Adobe because they already pushed out some minimal innovation. But what about the products you don't even know about? For lawyers, doctors, logistics, sales, marketing, wood workers, handymen? In Europe or Asia?

New product by bringing true innovation could easily push out legacy business by "shiny new thing"(AI) and better UX alone. A lot of software in these areas simply hasn't improved for 10 years - with a great idea and a dedicated team it's a landslide waiting to happen.

Ancalagon

Maybe spend and better models will help this (I’ve not used the deep research models so maybe we are there already). But even day to day coding, the LLMs are great helpers but giving them anything more than a slightly complicated prompts and it seems like these models become completely helpless. You just constantly need a human in the loop because these models are too dumb or lack the context to understand the big picture.

Maybe these models will get better as they’re given more context and can understand the full stack but for now they cannot.

And this is just with code where it already has billions of examples. Nevermind any less data-rich fields. The models still need to get smarter.

salynchnew

Daniel Rock has done some interesting work on the ROI of AI in general (also, I believe two of his papers are referenced in this study). Note that this doesn't explicitly restrict itself to covering LLMs, but... still a very interesting body of work.

https://www.danielianrock.com/research

lifeisstillgood

My term for this is “Whitey’s goin’ to the data center”. We are looking at an arms race, where there really is genuine new technology and it will make a difference - but at the 1-2% per annum of an economy level - compounded over fifty years that is geo political dominance yes, but it’s not “machines of loving grace” level growth.

We already have thousands of geniuses working across our economies and teaching our youth. The best of our minds have every year or so been given a global stage in Nobel speeches. We still ignore their arses and will ignore it when AI tells us to stop fighting or whatever.

The real issue here is that wafer scale chips give 900,000 cores, and nothing but embarrassingly parallel code can use it - and frankly no coder I know writes code like that - we have to rethink our whole approach now Moores law is over. Only AI has anything like ability to use the processing ability being built today - the rest of us can stick to cores from 2016 and nothing would change.

Throwing hundreds of billions at having a bad way to program 1 million cores because we have not rethought software and businesses to cope seems wrong - both because “Whitey” can spend it on better things but also because it is an opportunity - imagine being 900,000 times faster than your competitors - what’s does that even mean?

Edit: Trying to put it another way - there are two ways AI can help us - it can improve cancer treatments at every stage of medical care, through careful design and creation of medical AI models that can slowly ratchet up diagnosis, treatment and even research and analysis. This is human organisations harnessing and adapting around a new technology

Or AI can become so smart it just invents a cure for cancer.

I absolutely think the first is going to happen and will benefit the denizens of the first world first. The second one requires two paradigm shifting leaps in the same sentence. Ten years ago I would have laughed in Anthropics face. Today I just give it a low probability multipled by another low probability- and that is an incredible shift.

tzury

Claude is indeed far more familiar amongst software engineers.

Google Gemini integration into their docs/sheets/slides and Gmail perhaps will show different demographics in a few months, and that is yet before we heard from OpenAI.

frankfrank13

You may be right, but I doubt it. I suspect similar usage metrics for Gemini and OpenAI

brap

Seems like Anthropic has too much money on their hands and are looking for ways to spend it. It’s surprising to see lean AI startups accumulate fat so quickly. Usually this sort of wheel spinning is reserved for large corporations.

And it’s not just them. To me this trend screams “valuations are too high”, and maybe hints at “progress might start to stagnate soon”.

raldi

Anthropic is a Public Benefit Corporation whose governance is very different from a typical company in that it doesn’t put shareholder ROI above all else. A majority of its board seats are reserved for people who hold no equity whatsoever and whose explicit mandate is to look out for humanity.

https://www.anthropic.com/news/the-long-term-benefit-trust

https://time.com/6983420/anthropic-structure-openai-incentiv...

Eextra953

This is why I cancelled my chatgpt subscription and moved to claude. Its kinda silly, but I feel like the products are about equivalent for my use case so I'd rather do business with a company that is acting in good (better?) faith.

mupuff1234

Don't think that's silly at all.

rvnx

In the case they don't get high salaries from this activity, there is also a solution. The next step in ~10 years could be to offer their services to governments to offer "automated court decisions".

Then the people who funded / trained this "justice" out of their good heart, would actually have leverage, in terms of concrete power.

It's a much more subtle way to capture power, if you can replace the judges with your software.

mostlysimilar

That isn't silly, that's one of the only ways to exercise agency under hypercapitalism. I recently cancelled my Amazon Prime membership and got a Costco membership for the same reason. I don't get every product I want, but I'm also okay with that.

mppm

I'm not sure if you are being sarcastic or not, but the practical upshot of this new "Public Benefit Corporation" thing, with or without a trust or non-profit attached, is that you can tell both the public and your investors to fuck off. The reason why all the big AI startups suddenly want to use it is because they can. Normally no sane investor would actually invest in such a structure, but right now the fear that you might be left out of the race for humanity's "last invention" is so acute that they do it anyway. But if Dario Amodei actually cared about humanity any more than Sam Altman, that would be the surprise of the year to me.

raldi

Can you imagine a hypothetical AI company that did care about humanity, and if so, how would it look different from Anthropic?

erikerikson

You seem to misunderstand benefit corporations. They remain committed to profit and are just as subject to their board and officers as any other corporation.

There are sane investors that prefer investing in companies that adopt these corporate structures. Based on data, those investors see public benefit corporations as more profitable and resilient. They are able to attract employees and customers that would otherwise not be interested or might be less interested.

idiotsecant

Put this and 'dont be evil' and 5 dollars in my hand and I'll give you a cup of coffee.

nonchalantsui

Coffee for $5? That's a steal in this economy!

null

[deleted]

beepbopboopp

The exact opposite. Relative to ChatGPT Anthropic has an enormous "brand problem." What they should be doing is exclusive deals like this, but with deals with large publishers on a recurring basis and figure out how to teach consumers who they are and how to use them best. For like 99% of the use cases all these products are parody and the real business gains are finding a way into consumers lives.

Semi-relevant sidenote: ChatGPT, spent $8m on a super bowl commercial yesterday just to show cool visualizations instead of any emotional product use case to an ultra majority audience has never had a direct experience with the product.

These companies would be best served building a marketing arm away from the main campus in a place like LA or NY to separate the gen pop story from that of the technology.

peterlk

I disagree. I think Anthropic, like the other big players, is trying to get some of that government money. Releasing policy-adjacent papers seems like a way to alert government officials that Anthropic ought to be in the room when the money starts changing hands.

noah_buddy

I am inclined to agree. If you’re at the precipice of automating or transforming knowledge work and the value for being the first is nearly infinite (due to “flywheel effects”), why would you dedicate any energy to studying the impact of AI on jobs directly? The thesis is everything changes.

I think AI in its current iteration is going to settle into being like a slightly worse version of Wikipedia morphed with a slightly better version of stackoverflow.

lblume

I think that strongly underestimates the impact LLMs, especially reasoning models, have on how code is written today.

noah_buddy

Educate me. I find them useful but they are less so when you try to do something novel. To me, it seems like fancy regurgitation with some novel pattern matching but not quite intuition/reasoning per se.

At the base of LLM reasoning and knowledge is a whole corpus of reasoning and knowledge. I am not quite convinced that LLMs will breach the confines of that corpus and the logical implications of the data there. No “eureka” discovery, just applying what we already have laying around.

erikerikson

Because that research helps you understand your market and where the value generation is. This can expose where to better invest.

tinyhouse

Understating who is using your product is wheel spinning?

amazingamazing

idk, the models themselves are quickly becoming a commodity. it makes sense to spend money figuring out go to market rather than just improve the models themselves.

SpicyLemonZest

I don't see it. This is just an analysis of how Anthropic customers are using the product and what investment areas seem most promising in the future - why wouldn't they want that?

nerdponx

It's clearly more than an interesting tech blog post written by one of the data guys in their spare time. It's an "initiative".

That said, this doesn't seem like completely superfluous "fat" like what Mozilla does. It seems very much targeted at generating interesting bits of content marketing and headlines, which should contribute to increasing Anthropic's household brand name recognition vs. other players OpenAI, as well as making them seem like a serious, trustworthy institution, rather than a rapacious startup that has no interest in playing nice with the rest of society. That is: it's a good marketing tool.

My guess is that they developed it internally for market research, and realized that the results would make them look good if published. Expect it to be "sunset" if another AI winter approaches.

sofixa

Even on the contrary, this is very important information to have, in order to understand your customer base and how sticky you are with them, what features you need to focus on, etc etc

laidoffamazon

I would argue this is within their overall objective. It’s not like Stripe creating a publisher (??)

AnEro

Everyone in the comments seem to not like the article or see it as a waste of time. I just don't think we are the audience they wanted for this, I think they want to show the average business owner the realistic potential and public (journalists that will distill this later) they are aware of the impacts and what to expect.

I don't read it as fear AI, I read change is happening because of AI.

ajmurmann

They also call out how it's more used for augmentation rather than full automation which will address some concerns the public has.

s_dev

Tools empower those with knowledge further than those without knowledge. The fact that people were concerned layman were simply going to be able to take on experienced programmers at their day jobs was farcical.

neom

I'm not a dev but I thought the issue from a senior dev perspective was AI on AI, not layman on AI?

SilasX

I honestly don’t know who the audience could be, other than “people who like to tell others they’re in the know because they read AI companies’ press releases.”

At no point do I see an actual elevator pitch/tl;dr/summary of what the frak this index actually is, except that it’s part of some effort to track AI adoption. It just rains down figures about which industries are using how much AI without first grounding the new concept they’re introducing.

When you say you have a new economic index, you need to give me a number, how I should interpret that number, and where it comes from. I don’t see that.

GDP: measure if a country’s total economic output by adding up end product purchases.

CPI: general price level by taking a weighted average of prices throughout the economy

Big Mac index: how expensive goods are in a country relative to the US by reference to the local cost of a Big Mac, converted through the exchange rate.

Here I expect something like “the economic output-weighted fraction of production taken over by AI”, but instead it’s just a list of AI adoption by industry.

Why introduce an index and not headline with a definition of an index? Which audience prefers that?

ajmurmann

Awesome that there are releasing this paper and the associated data. I hope they'll do this regularly, so that changes can be tracked.

One thing I hope they'll correct going forward is inclusion of API usage. Anecdotally, I only use Anthropic models via Cursor. So none of that usage shows up in here. I'd expect that specialized tools/interfaces like Cursor will grow and thus more usage will shift to API. It would be a shame to miss out on that in the data set.

bwhiting2356

I expect more 'automation' to happen through the API than 'augmentation'.

cruffle_duffle

Tools like cursor have to be huge chunk of their traffic. I mean Cursor and Sonnet are like two peas in a pod.

Even if they don’t train on the data they could break it down by user agent / API client ID and infer something about cursor traffic.

rsanek

haven't they committed to not training on / using data submitted through API?

chr15m

Claude is absolutely killing it on the Openrouter LLM rankings board.

https://openrouter.ai/rankings

It seems like the most popular choice for API access.

Havoc

That tracks well with my subjective experience.

At day job - finance/office stuff - essentially zero traction despite everyone having enterprise AI subs & brainstorming sessions about use cases etc.

Then go home & do some hobby coding and suddenly it's next level useful.

It's not that the one is harder than the other, but rather that many jobs don't have an equivalent to a code base. The AI could I think grok parts of the job but typing up relevant content & what is required would take longer much than doing the task. There is nothing there to copy & paste for a quick win in the same way as code.

rafaelmn

Feels like a page of graphs where Anthropic team discovers that Claude is the best coding model and is used mostly by devs. And they have no penetration in general population compared to OpenAI

CL_ergo

I laughed out loud at their chart showing that Sonnet had a higher share of coding questions, whereas Opus had more writing.

If they would just look at their product, they'd see that it literally says it in the model description that Opus is better for writing. If you advertise one of your models as geared for task X, the insight that people use it more for task X isn't really an insight.

mlinhares

Read it as that as well, I think this mixes the "there is no penetration in that market" with "we have not been able to get these people to use our tools".

hall0ween

Does OpenAI have similar data available to compare to?

slama

> Overall, we saw a slight lean towards augmentation, with 57% of tasks being augmented and 43% of tasks being automated.

I'd like to see a comparison to the data 6 months ago, before Sonnet 3.5. I suspect the automation rate will track up over time, but that may mostly be captured by API usage which isn't in the dataset.

xnx

I wouldn't expect Anthropic to have any special knowledge/insight in this area aside from the data they have on usage of their own tools. As such, I'd be much more interested to see some "Google Trends"-like data about Anthropic usage. Unfortunately, since AI is dynamic and competitive industry, I don't think anyone will be sharing information like that for a long time.

827a

Honestly, simply releasing a graph showing the trendline of their own metrics on e.g. inference would speak volumes. They have this data already, 100%, its on some grafana dashboard the on-calls are watching every day. My suspicion is that some AI providers (OAI/Google) are watching these metrics go up-and-to-the-right quite consistently, but Anthropic's isn't doing that.

rsanek

depends on if you count API usage. OpenRouter shows that it's pretty clearly up and to the right thanks to tools like cursor -- they're at 2x the volume vs. just 3 months ago https://openrouter.ai/anthropic/claude-3.5-sonnet:beta/activ...

Cheer2171

Words I typed into a paid service are now public on a huggingface repo. I don't care if it is anonymized. Fuck that. I am deleting my account.

cosmojg

Uh, this isn't true at all. Did you actually look at the repo[1]? Only the metadata (i.e., LLM-generated task names and interaction classifications) have been made available.

[1] https://huggingface.co/datasets/Anthropic/EconomicIndex

ramon156

> While Claude.ai data contains some non-work conversations, we used a language model to filter this data to only contain conversations relevant to an occupational task, which helps to mitigate this concern.

That's nice. My main prompt has a hint suggesting when I refer to work. I do this so Claude can assume my tech-stack. Of course I exclude or mask confidential data, but it's still nice this stuff gets filtered.