Skip to content(if available)orjump to list(if available)

Hard drives on backorder for two years as AI data centers trigger HDD shortage

rsync

It's rough out there and has become increasingly difficult to maintain our pace of storage deployment.

Further - and most concerning - is the pollution of the supply chain with refurbished/recertified stock being sold and marketed as "new".

One example:

https://kozubik.com/items/MaestroTechnology/

I strongly advise buyers to stick with trusted suppliers, avoid Amazon/ebay channels, and carefully vet your incoming stock with SMART tools to ensure you receive what you think you are ... especially for SSD parts.

estimator7292

DO NOT assume SMART is reliable. You can wipe SMART stats or write any values you want.

You have to actually examine the real bits on the drive. Resellers don't want to take the time to actually zero a drive, they usually just nuke the partition table.

You also need to physically examine the drive. Corroded fingerprints on the PCB, wear on the port contacts, scratches from mounting rails, etc.

That's how it found out that the last "new" drive I bought on Amazon was actually a used Backblaze drive. It contained terabytes of customer data, and a shit ton of cleartext files. SMART, of course, reported it was a brand new drive with zero hours. Cleartext logs on the drive showed many thousands of hours of runtime.

Physical examination is the only reliable method.

neilv

> That's how it found out that the last "new" drive I bought on Amazon was actually a used Backblaze drive. It contained terabytes of customer data, and a shit ton of cleartext files. SMART, of course, reported it was a brand new drive with zero hours. Cleartext logs on the drive showed many thousands of hours of runtime.

This sounds like it could be a big problem for Backblaze customers, and consequently for Backblaze.

Can you alert the Backblaze CEO about their insufficiently-decommissioned drives leaking out like this?

Backblaze customers also need to know, but I would give Backblaze the first shot at figuring out how to notify, whom, of what.

SoftTalker

> drive I bought on Amazon was actually a used Backblaze drive

Assuming this is true, I find it weird/surprising that Backblaze doesn't at least zero their drives before disposing of them? I have to do that at my work, and at least by policy I could lose my job if I skipped doing it.

kkylin

Question for all of you more knowledgeble than I: can SMART data be tampered with? When I get, say, a refurbished Mac from Apple, I'm trusting Apple won't stoop to that. But a SSD vendor I've never heard of?

fny

Yes, it can be tampered with. Drives can even lie about the amount of storage they support. I once bought a 1TB pen drive that was only 32MB for $10. (Yes, I knew it was a scam beforehand.)

walterbell

"OMEC" (Organization of Memory Exporting Countries) NAND production quotas?

https://x.com/jukanlosreve/status/1988505115339436423

  Samsung Electronics has lowered its target for NAND wafer output this year to around 4.72 million sheets, about 7% down from the previous year's 5.07 million. Kioxia also adjusted its output from 4.80 million last year to 4.69 million this year.. SK hynix and Micron are likewise keeping output conservatively constrained in a bid to benefit from higher prices. SK hynix's NAND output fell about 10%, from 2.01 million sheets last year to around 1.80 million this year. Micron's situation is similar: it is maintaining production at Fab 7 in Singapore—its largest NAND production base—in the low 300,000-sheet range, keeping a conservative supply posture.
Micron has a new US fab coming online in 2027, which should improve supply.

DRAM price fixing scandal: 1998-2002, 2016-2018, https://en.wikipedia.org/wiki/DRAM_price_fixing_scandal

acd10j

It’s a cartel just like the OPEC oil cartel.

Eddy_Viscosity2

Is there anything that isn't functionally a cartel at this point?

Pet_Ant

Yes, anywhere there is a race to the bottom. That is why you see cartels rise up, like unions are cartels. Things that are not cartels generally have ever diminishing margins.

null

[deleted]

criemen

So far, due to the AI boom, we're lacking:

* HDDs

* SSDs

* DRAM

* GPUs, obviously

* Power to hook up to our datacenters

Anything I'm missing? What a crazy world we're living in.

magicmicah85

Water? Clean Air? ROI?

captainkrtek

Nearly nothing left unexploited in the pursuit of profits.

SlavikCA

And there is no profit, too.

maerF0x0

Makes me wonder about the things that go into datacenters and power plants:

- Air conditioners

- generators

- Racks/Cages

- fire supression

- Concrete, Steel, power lines

etc

derefr

I have a hypothesis that it was AI, not COVID/sanctions/etc, that was mainly responsible for the 2020-to-ongoing "chip shortage." Ignoring companies with their own fabs (Intel) and companies with pre-existing reserved-capacity contracts with fabs (Apple), everyone else is stuck waiting in line behind batch after batch of fab orders from Nvidia.

Downstream of that, AI is effectively also responsible for the current generation of game consoles never declining in price.

Because game consoles are fixed platforms that continue to be manufactured over 5+ years, normally the most expensive parts in the system (the CPU and GPU) would gradually get cheaper to manufacture [and in turn, cheaper to buy] over the course of the console's lifetime—which was often passed onto the consumer in the form of the console's MSRP gradually decreasing. Either the process node for the console's silicon design would stay fixed, and demand for this process node would gradually decrease as larger fab customers move on to newer nodes, decreasing the (effectively auction-based) pricing for fab time on the older node; or the console manufacturer['s silicon vendor] would put their silicon design through a process-shrink, and so, while still paying top dollar for use of the fab's newest node, would be getting more chips per wafer out of that, and again could charge less.

But instead, what we've seen since the start of the AI boom is that there's no longer any price-reduced timeslots to be sold to manufacture the low-BOM parts for these price-sensitive console-maker customers. Instead, both Nvidia and AMD are now getting such high value out of even the older nodes, that 1. the fabs know they can squeeze them, charging full price for those slots as well; and 2. both Nvidia and AMD, in their roles as silicon vendors to the console makers, haven't been able to justify using (very much of) the time they're paying so much for to fab low-BOM-cost parts to fulfill their pre-existing outstanding purchase orders, when they could instead be fulfilling much-higher-margin new POs from the hyperscalers.

Thus every console of the ninth generation (PS5, Xbox Series, Switch) still selling for their launch price (with no help from process shrinks); thus none of these consoles having been able to be produced in excess of demand to the point that supply-and-demand ever drove retail prices below MSRP; and thus the only tenth-gen console so far, the Switch 2, taking ~4 years longer than anticipated to release†.

---

† Nintendo were very likely waiting for Nvidia to run off enough of the Tegra T239 with a sufficiently low passed-on fab cost, for Nintendo to both 1. be able to build a backstock and non-run-dry pipeline of Switch 2s, and 2. be able to be positive-margin on charging the same price as the Switch 1 for them. They waited four years, and neither thing ever happened; so they eventually just gave up and priced the Switch 2 higher, and also built out an entirely novel D2C + online-marketplace-partner distribution pipeline so they could ration the tiny initial supply of units they had been able to build with the chips Nvidia had supplied them so far.

Though, that being said, Nintendo actually got a double whammy here. They were also waiting for fast NAND to come down in price, so that they could have physical game cards manufactured for a trivial BOM price while still enabling the "direct GPU disk-streamed assets" pipeline that games of the last generation had begun relying on. Obviously, as today's article points out, that hasn't been happening either! Thus game key cards; thus SD Express cards only beginning to trickle out, with no sizes above 128GiB available at the Switch 2's launch time; and thus those SD Express cards being ridiculously priced for their capacity compared to equivalent transfer-speed + die-size NAND (as seen in e.g. low-profile/flush-mount flash drives.)

stuff4ben

jobs...

mchannon

During 1998-2000, AOL was ordering so many free trial CDs that it locked up world production, and music CDs faced 8-12 week delays. It was rumored that certain weeks there were no albums getting fabricated at all, worldwide.

I wonder if history isn’t repeating itself. AOL CDs had pretty much jumped the shark by 2000.

krackers

>Picking QLC over TLC allows them to maintain costs while achieving sufficient endurance for cold storage.

How does that work, doesn't QLC have less write endurance?

ycombinete

Yes, but QLC has much higher density.

I think it's the higher density that makes it better for cold storage, which generally has infrequent access, and more reads than writes.

Hence the QLC's endurance being "sufficient for cold storage".

bayindirh

In short: Aggressive overprovisioning.

Enterprise SSDs are not expensive only because they have better flash chips, but they have much more of them.

A top of the line write oriented SSD comes with 4-7x more capacity than what it says on the tin, but that extra capacity is used for cell replacement rather than capacity itself.

Mixed use comes with 2-4x overprovisioning, and read oriented is around 2x IIRC.

Havoc

Ssd for cold storage seems like an odd choice in itself. If that’s genuinely done due to availability then we really are in for a wild ride

bayindirh

I believe "cold storage" in this parlance is more like "read-oriented" rather than being accessed once in three years.

esseph

Cold storage normally doesn't have frequent writes or frequent reads.

ls612

Thank god I built my NAS in early October it seems like I got the proverbial last train out of town.

karlkloss

And when the AI bubble bursts, "refurbished" HDDs and GPUs will flood the market. Save your money now and be prepared.

Nux

GPUs yes, but there'll be no HDDs making it alive, they'll get destroyed to protect whatever rubbish they had on.

bigbuppo

Nah, the liquidators aren't going to care about that. Those hard drives are going to be shipped out with all your wildest porn chat bot fantasies.

maerF0x0

"Shredded onsite" means by the next user when they format the drive and write contents to it /s

pmontra

HDD can be written multiple times with random data if data centers really have to protect what their former customers wrote on them. I never looked at those details in standard contracts.

There is also encryption at rest.

jimwalsh

All the large datacenter/cloud companies do not let hard drives leave the building.

tencentshill

They have enough investor money they don't need to recoup it selling used drives. Straight to the shredder.

teeray

> HDD can be written multiple times with random data

Which costs more in compute than simply throwing the drive in a shredder

archagon

I believe many enterprise drives have instant-erase functionality (presumably deleting an encryption key).

HugoTea

Depends how it goes down, if a company goes into insolvency all security policies are off the table and random hardware can get shifted into lot bidding.

microtherion

My understanding is that refurbished GPUs from e.g. crypto mining are something people are warned away from, because they've often been run into the ground. Are AI usage practices different?

trenchpilgrim

> My understanding is that refurbished GPUs from e.g. crypto mining are something people are warned away from, because they've often been run into the ground.

No, this wasn't the case. While there were never comprehensive studies various tech media purchased these cards to run testing and found that, other than scammers, they all performed to expectation.

lm28469

You can buy used car tires with 1% of thread left and they'll perform amazingly during your one time test too

esseph

If this bubble pops you might need that money for food when bananas go from $1.50 to $150.00

mrandish

I really hope the AI bubble bursts sooner rather than later. Sooner will impact the broad economy less severely (although it'll still be pretty bad) and curtail these supply chain shortages. If the bubble keeps inflating, the storage makers will have already mostly built out excess capacity and the crash will lead to even longer-term supply distortion.

margalabargala

This is a long term good thing.

It sucks right now and will probably suck through 2027.

By 2028 or so we'll have a 50% drop in price-per-storage for these components.

palmotea

> This is a long term good thing. ... By 2028 or so we'll have a 50% drop in price-per-storage for these components.

Per the op:

> and the ongoing DRAM shortage is proof of this, with memory kits costing more than double what they did just a few months ago.

> While enterprise-grade QLC SSDs would entirely power this pivot, Sandisk has already raised NAND prices by 50%, according to another DigiTimes report, after initially warning of a 10% increase two months ago.

So you're basically saying prices may return to normal in two years, and that's somehow a good thing compared to them not being inflated in the first place?

chrismorgan

> By 2028 or so we'll have a 50% drop in price-per-storage for these components.

Do you mean relative to six months ago, or now? Because a lot of the prices have already more than doubled.

(I’m upset because the computer I’ve been planning to build, which three months ago would have come to around ₹90,000, is now up to ₹1,20,000 and climbing week by week, half due to price increases on the same part, half due to forced substitutions on RAM since the cheaper 32GB 6400MT/s DDR5 sticks are completely unavailable. And looking into laptops, for the first time ever I’m seeing manufacturet SODIMM or SSD upgrades being cheaper than aftermarket.)

walterbell

> manufacturer SODIMM or SSD upgrades being cheaper than aftermarket.

Temporarily thanks to old stock.

andy_ppp

Will the AI bubble last until 2028? I’m still unclear how AI will return even 10% of this investment in profit.

citrin_ru

Depending on the future you predict 10% may be a good ROI - if AI will replace humans and traditional economy will collapse all other investments will loos value even more. In such scenario you cannot save the money you only can loose less if you will make a right investment.

adrianN

I'm not sure I understand why the economy would collapse if AI replaced humans. Wouldn't the companies just make more profit because they save labor costs and stonks go up?

andy_ppp

No I'm predicting a 90%+ loss.

mock-possum

> delivery times for enterprise-grade HDDs delayed by two years.

I sleep

> so hyperscalers are now switching to QLC NAND-based SSDs to avoid these backorders … This could lead to SSD prices rising worldwide

Real shit

BoredPositron

The funniest thing about this is that, with high GPU prices, rising RAM costs, and now increasing SSD prices, Apple will end up producing the most affordable PCs.

jrvarela56

If every other PC is more expensive, they will just increase prices.

ipsum2

Apple uses the same RAM, SSD, etc as everyone else does. They don't have a magic supply chain that is unaffected by the broader world.

siva7

They have a magic supply chain that is unaffected by the broader world which is one of the reasons why Tim Apple was chosen by Steve Jobs as his successor.

BoredPositron

They don't use the same SSDs? They don't use the same RAM? They have their own supply chain in place? Whatcha talking about bud?

HackerNewt-doms

No, Apple has effectively promoted iCloud as the alternative to local storage as part of its product differentiation strategy in the lower price segment.

Apple will almost certainly introduce the same approach for the budget MacBook as well.

evanjrowley

The future of AI is everything we liked getting ruined by AI.