Harvard study: open-source has an economic value of 8.8T dollars
28 comments
·March 21, 2025zonkerdonker
blitzar
> Around 95 percent of this figure is attributable to 3,000 developers worldwide
Linus is probably 60% of that?
WithinReason
$8.8T*0.95/3000 = $2.8B
7e
Yes, $2.8B per developer is a clear indication that this study is delusional. The entire global wealth of the world is 450 trillion. Open source is 2% of that? I don't think so. Wealth includes labor, land, materials, homes and buildings, tools, weapons, renewable and fossil sources of energy, industrial and rural capacity, other capital, mines, etc.
Or maybe I'm wrong, and OpenAI and Anthropic (which produce AIs which can code decently) are massively undervalued at only a hundred or so OSS developers each.
glimshe
Does this support the idea behind the 10x and 100x developer concept? In my experience, they do exist.
washadjeffmad
Depending on your standards, those are just flattery rates. People probably won't want to work for you (or in the field), though, if they're implied to be 0.1x or 0.01x developers.
c0balt
That value and cost-factory is likely a higher multiple when talking about infrastructure. If everyone had to pay for Windows (or a similar OS) instead of Linux and numerous other systems (PostgreSQL, OpenLDAP, Nginx, ...) our costs for hosting services would move up at least an order of magnitude.
flanked-evergl
And you also would have to factor in the negative externalities of giving the profesional managerial class like Bill Gates more money and power.
fragmede
How does open source avoid that? Red Hat, which got bought by IBM; both of them have executives in spades, as does Google and Meta, both of them are huge supporters of open source. (Not including Llama. Fsck Mark Zuckerberg for co-opting the term Open Source for llama when it isn't)
flanked-evergl
Of course, the PMC, being the parasites they are, will try to siffon value from everything they can. My point is merely that it's more difficult for them with Open source because quite often no money is changing hands. I don't pay for my operating system, I use Fedora. If I were to pay for it the rent seeking PMC will take their cut of what I paid, and most likely the largest cut.
reaperman
RedHat is a pretty small % of total linux installations. I think you're reaching a bit with this general line of argumentation.
rjbwork
Sorry, Bill Gates is not PMC - if we're going with a neo-Marxist analysit of class, Gates is firmly Bourgeoisie, while all of us operating the business machines of the day and using our intellect to help organize and manage the resources of society are the PMC.
anonzzzies
We make future open source software; it becomes open source when we recouped our investments. It's the only way I found that works. If we never recoup them, then ok, we open source them too but that we don't tell anyone. And yes, we donate what we can to projects we use. But that's not a plan; unfortunately it doesn't work outside a few coffees for them if the project is substantial.
rvense
Sounds low. None of the companies I've worked for would have existed without open source.
7e
That's like saying companies you worked for couldn't exist without indoor plumbing (which is what OSS effectively is). Yet the earliest company was founded in the year 578 AD, and other massive organizations (like empires) existed much earlier than that. OSS is plumbing, cheaper plumbing makes for more tiny fruit-fly companies, yes, but there are always alternatives.
shoo
The study appears to be this one from Jan 2024:
Hoffmann, M., Nagle, F. and Zhou, Y., 2024. The value of open source software. Harvard Business School Strategy Unit Working Paper, (24-038).
link to HBR summary (published March 2024): https://www.library.hbs.edu/working-knowledge/open-source-so...
link to the 2024 working paper [pdf]: https://www.hbs.edu/ris/Publication%20Files/24-038_51f8444f-...
shoo
brief critique of the methodology from a skim-read of the paper:
> We first estimate the supply-side value by calculating the cost to recreate the most widely used OSS once. We then calculate the demand-side value based on a replacement value for each firm that uses the software and would need to build it internally if OSS did not exist. We estimate the supply-side value of widely-used OSS is $4.15 billion, but that the demand-side value is much larger at $8.8 trillion
[...]
> We first measure the value of OSS by considering the supply and the demand side of OSS using a labor market approach. The thought experiment is that we live in a world where OSS does not exist and has to be recreated at each firm that uses a given piece of OSS. Using the labor market approach, we calculate the labor replacement cost of each OSS package. To estimate the value for each package, we use COCOMO II (Boehm, 1984; Boehm et al., 2009) at the individual package level and then sum across all package values to obtain a supply-side labor market replacement value. Then, we scale the supply-side value by the number of times firms are using each package while removing multi-usage within each firm to obtain a demand-side value.
The demand-side estimate (used to produce the headline $8.8 trillion US dollar number) is simple, but has a few flaws:
- it assumes a world where commercial software vendors cannot replace open source projects to become suppliers and maintainers of equivalent packages / libraries. Instead, each firm must internally reconstruct each package it depends upon without external communication or trade. This seems to be a poor approximation of how the world works -- it assumes specialisation of firms and trade between firms is impossible.
- it assumes that each firm that uses a package uses _all_ the code in the package. but often in practice, an open source package caters to a variety of use cases, but its use by a particular firm may only be for a single or sparse number of use cases involving a subset of the code
- it assumes that demand for a good (an open-source software package) does not reduce as the price of that good increases from zero to a potentially extremely high capex cost. in practice we might find that the _benefit_ firms obtain from depending upon an open source package is marginal, and an increase in the cost of the package from "free" to even 1% or 10% of the capex cost to develop it in house (let alone if it cost 100% of the capex cost) might result in it being irrational for the firm to depend on the package, and the firm would eliminate the dependency by switching to a cheaper alternative (use a manual process instead of a software process, switch to a commercial package sold by a vendor for a modest licensing fee, simply stop doing the marginal-value job that the software was doing)
From a quick skim-read, the authors do not explicitly seem to address any of these potential weaknesses in the demand-side estimation approach in their paper, but to their credit they do include "Appendix A) Goods-Market Valuation Approach" where they explore an alternative.
> The top 5 programming languages contain C (including C# and C++), Java, JavaScript, Python, and Typescript. We also add Go to this list of top languages since it is an increasingly widely used language in OSS.
They group C# and C++ together with C as "C"!
> For both the supply and demand models, we obtain the dollar-value for each package via the Constructive Cost Model II, also abbreviated as COCOMO II (Boehm 1984, Boehm et al. 2009). [ ... ] It is a highly flexible model that allows us to create non-linear transformations of lines of code to dollar-values. It uses the following modelling equation: E_i = alpha eta L_i^beta where L_i represents the lines of codes in thousands and E_i the effort in person-month units for each project i. Consistent with Blind et al. (2021), we use the default parameter values for alpha, beta and eta. The parameters alpha, beta are non-linear adjustment factors set to 2.94, and 1.0997 respectively. The parameter eta is an effort-adjustment factor which can be modified to incorporate subjective assessments of product, hardware, personnel, and project attributes. Since we do not have a prior for each project, we set eta to the default value of one
So, if I follow that, the estimate of person-months of effort for package i is E_i := alpha eta L_i^beta = 2.94 * L_i^1.0997 where L_i is the number of lines of code in thousands in package i.
Interestingly, this approximation for effort is independent of programming language used to develop the package!
This suggests that if we rewrote a given package in a less expressive programming language, where 10x as many lines of code were required to produce a package with equivalent functionality, then all things equal, the estimated "value" of our package would increase by 10x, as a result of labour productivity decreasing by a factor of 10!
This doesn't make a lot of sense, but is a consequence of the assumption to approximate the firm's value of using a package with the replacement cost. I imagine that estimating the actual economic value of the package to each firm, independently of the cost, would require a lot of very hard work and information that is difficult or impossible to obtain.
It is unclear if the approach of "Blind et al (2021)" to set all the COCOMO II model parameters alpha and beta to defaults, and further the author's decision to set the parameter eta to one, produces accurate estimates of effort as a function of lines of source code. It is certainly an easy thing to do!
edit: i mixed up supply-side & demand-side in parts of my critique. fixed.
robertlagrant
> This suggests that if we rewrote a given package in a less expressive programming language, where 10x as many lines of code were required to produce a package with equivalent functionality, then all things equal, the estimated "value" of our package would increase by 10x, as a result of labour productivity decreasing by a factor of 10!
This is part of the "labour theory of value".
feverzsj
And no company pays them enough to even cover the living expenses.
blitzar
Companies have shareholders and responsibilities.
They have to choose between contributing a salary for a year to open source projects that are the foundation of their business or fuelling one of the corporate jets for an hour. Making these sorts of tough decisions are what the 10mil+ in bonuses are for at the end of the year.
ForTheKidz
I suspect many of these open source developers are employed in this capacity on some level by for-profit corporations (which sadly does bring a high degree of dysfunction in terms of resource allocation). But I can't imagine being this productive as a side-gig.
almosthere
well, most successful oss projects are paid for through that one guy at the corner cubicle that has very few company responsibilities, but he maintains lib-fart-chart, which is also used at said company.
guappa
You have a source or at least a reasoning for this optimistic assumption?
high_na_euv
A lot of oss contributors work for big companies
Wow, killer of a first paragraph.
>Around 95 percent of this figure is attributable to 3,000 developers worldwide. In addition, open-source software is included in 96 percent of all codebases.
Not shocking to see the big guns coming hard at FOSS.