Skip to content(if available)orjump to list(if available)

DeepSeek gives Europe's tech firms a chance to catch up

harry8

I've been trying both deepseek-r1:8b and deepseek-r1:32b using ollama on a local desktop machine.

Trying to get it to generate some pretty simple verilog code with extensive prompting.

It seems really bad?

Like specify what the module interface should be in the prompt and it ignores it and makes something up bad. Utterly rubbish code beyond that. Specify a calculation to be performed yet it calculates something very different.

What am I missing? Why is everyone so excited? Seems significantly worse to me than llama. Both o1-mini and claude haiku imperfect, sure, but way ahead. Both follow the same prompt and get the interface and calculation as specified. Am I doing it all wrong somehow (more than likely)?

After fixing up my open-webui install I tried "testing 1 2 3, testing. Respond with ok if you see this." Deepseek-r1:8b started trying to prove a number theory result.

Is there a chance this thing is heavily optimised for benchmarking not actual use?

turblety

Just to confirm, Ollama's naming is very confusing on this. Only the `deepseek-r1:671b` model on Ollama is actually deepseek-r1. The other smaller quants are a distilled version based on llama.

https://ollama.com/library/deepseek-r1

diggan

Which, according to the Ollama team, seems to be on purpose, to avoid people accidentally downloading the proper version. Verbatim quote from Ollama:

> Probably better for them to misunderstand and run 7b than run 671b. [...] if you don't like how things are done on Ollama, you can run your own object registry, like HF does.

kgwgk

It’s definitely on purpose - but if the purpose was to help the users making good choices they could actually give information - and explain what is what - instead of hiding it.

TeMPOraL

It's also not helping the confusion that the distills themselves were made and released by DeepSeek.

If you want the actual "lighter version" of the model the usual way, i.e. third-party quants, there's a bunch of "dynamic quants" of the bona fide (non-distilled) R1 here: https://unsloth.ai/blog/deepseekr1-dynamic. The smallest of them is just able to barely run on a beefy desktop, at less than 1 token per second.

rightbyte

It is very interesting how salty many in the LLM community are over Deep Seek.

DS has more or less been ignored for a very long time before this.

avereveard

Also Ollama is traditionally very sloppy with the chat templates they use, which does impact model performances.

diggan

The press and news are talking about R1 while what you've been testing is the "distilled" version.

Sadly, Ollama has a bit of a confusing messaging about this, and it isn't super obvious you're not actually testing the model that "comes close to GPT-4o" or whatever the tagline is, but instead testing basically completely different models. I think this can explain the mismatch in expectation vs reality here.

rightbyte

You are hard to impress. Running a 1/20x sized version locally that would be sci fi level 10 years ago.

For such small models I would recommend specialized models only. Like Deep Seek Coder. But I think that one is lagging behind the state of art now.

harry8

Like an 8 year old factoring large numbers it's not amazing how well it is done it's that it is done at all that amazes. Sure. Amazing but not at all useful and not something one would expect the kind of fuss we've seen.

Seems the explanation is the deepseek-r1 models I was using are not, in fact, deepseek-r1. Thanks all for the heads up.

rightbyte

Ye, to your defence it seems like the Ollama project did it confusing on purpose to mess with Deep Seek.

npn

>What am I missing?

Like everything. You do not even mention your machine spec, so I'm assume you just pick the ones that fit, which probably the quant versions.

Quant versions of the "small" models do not perform that well. Not the way you expected them to be.

rcarmo

I tried that distill, plus the "original" at chat.deepseek.com and the Azure-hosted replica on a simple coding problem (https://taoofmac.com/space/blog/2025/01/29/0900), and all three were bad, but not that bad. I suspect the distill will freak out with very little context.

thom

The distilled versions are terrible and the full version seems very slow, but the latter is certainly in o1’s league.

numpad0

Do other models do well for the same use cases? I thought LLMs are only good for low-value adtech codes and resources accessible on public Internet, like tons of getters/setters and onEvent triggers without much CS elements or time or multi domain implications.

They're also hyper sensitive to what I'd describe as geometric congruency between input and output: your input has to be able to be decompressed into final form with basically zero IQ spent on it, as if the input were zipped version of as yet existing output that the LLM simply macro-expanded.

R1 is just an improved LLM, nothing groundbreaking in those specific areas. Common limitations of LLMs still apply.

IMO, the layman's model of LLM should be more of predictive text than AI. It's a super fast keyboard that types faster than, not better than, your fingers.

sega_sai

Given the current state of US government I wouldn't be surprised if soon it would introduce sanctions against anyone trying to use Deepseek.

alwayseasy

You're right: Republican Senator Hawley introduced that bill Friday. https://www.hawley.senate.gov/wp-content/uploads/2025/01/Haw...

vancroft

Wow, why is the title tag of the page apparently a Windows file path?

watwut

Because someone copy pasted something somewhere.

null

[deleted]

guerrilla

New reasons I never thought I'd have to have to use a VPN. Italy already banned DS by the way.

walthamstow

Italy also banned ChatGPT when it was released.

diggan

And unbanned as soon as OpenAI made ChatGPT compatible with the data/privacy laws in the country. I'm sure DeepSeek will be able to make the adjustments too.

lyu07282

I imagine if lets say Hetzner were to offer hosted DeepSeek they would be forced out of the US market, no longer be allowed to even buy NVIDIA. That is a high-risk, assuming EU companies would ever even chose to switch to Hetzner over Azure/etc., which they won't. The EU is acting like and ruled by people who believe the US isn't an hostile entity, that attitude is never going to change and the biggest reason why AI in the EU is dead.

bootsmann

Azure/AWS biggest competitor is going to be LIDL, not Hetzner. Hetzner competes with Digitalocean et al.

belter

> Azure/AWS biggest competitor is going to be LIDL

You can't be serious...I heard to open an account your are supposed to send them a fax... :-)

pinoy420

Side note: what are people’s opinions of the UK regarding AI now? Completely insignificant? I feel the UK has taken a nosedive over at least the past 30 years.

f1shy

In general, in all related to CS and software the UK is ages ahead of the mainland Europe. Also in AI. But still of course way behind US.

belter

> In general, in all related to CS and software the UK is ages ahead of the mainland Europe

You are you going to have to support this statement a little bit better :-)

Cumpiler69

How about that the Danish pharma giant Novo Nordisk, makers of Ozempic, is opening their AI/ML lab in London next to DeepMind and other tech giants, instead of in its back yard?

The only one close to UK in Europe is probably Switzerland and France but they too are mostly focused on research in universities rather than pushing out commercial products the way the US is exceeding at. Everyone else is not even in the game.

captainbland

It's not an AI specific thing. The UK's investment in harder science/tech seems to be seriously anemic.

Every job posting I see seems to be some variation on the theme of pumping some representation of money around or retail/ecomm/HR management.

HPsquared

DeepMind has made some really amazing stuff.

The UK will need to be an AI hotspot if we want to continue being a financial centre.

teamonkey

The UK is a financial (and legal) centre because of its regulations. They make the UK a trustworkthy broker for international companies to deal with. (Also its deregulations, making it easy to squirrel money away to tax havens.)

Not sure why AI would help there.

yuumei

The government wants to introduce a law to make it illegal to possess AI tools that are capable of CSAM output. As we know this is impossible, any company starting in the UK with AI will likely fail compared to other countries if this law passes.

https://www.bbc.co.uk/news/articles/c8d90qe4nylo

pinoy420

I scariest part of this which I do not see people worried about, is the one sentence about requiring suspects to open their phones at the border for inspection.

kypro

We have the talent, but lack the vision to ever build something significant.

Founders are more likely to want to tackle smaller problems with a more secure business model.

Investors are much less likely to back high-risk/high-reward speculative business propositions.

Government will insist that we need to go slow and steady and prioritise goals like inclusivity and safety over performance.

The public will be sceptical of any large AI company, especially if they're pushing regulatory boundaries, and demand government intervention.

krageon

They did brexit and started mattering even less than they already did: Not at all.

Xuban

If French, German and British nations start competing (like in the old days) for AI supremacy, it will unleash a level of creativity that we long forgot we are capable of in Europe.

DrScientist

DeepMind - the company owned by Google, but behind things like AlphaFold is based on London. And one of the fathers of modern AI - leading the resurgence of research into neural nets - Geoffrey Hinton is British-Canadian.

He moved from Britain originally due to the difficulty in getting his research funded.

So the issue isn't one of intellectual capital - and while it's obviously the case that well place monetary capital is an issue - it's not clear to me what the real underlying issue is.

Perhaps Europe needs a tech/industrial revolution again - where the power shifts from the old guard to the new. Perhaps too many people in charge in Europe are from a certain class that studied history at university.

rcarmo

Well, they were completely right in studying history, but maybe they should have studied better economics too.

Cumpiler69

We both know that's not gonna happen. Europe is way too entrenched in its ways by this point. The good ol' glory days that brought in Airbus and Concorde are gone and not transferrable to the modern, dynamic and very internationally competitive SW world, nor are its leaders strong and motivated enough to enact policy changes that favor disruption of the old money guard at the expense of the status quo. Case in point we have no SW giants, no Airbus equivalent of the SW world. All Europe's giants are decades to centuries old. 20 years ago EU's GDP was on par with the US's, now we're only half the US's GDP. We're cooked.

Plus, we first have to prioritize solving more urgent and important topics like affordable housing (WHEN?!), the collapsing pension and welfare systems which is a ticking timebomb, cheap energy, collapsing demographic (see affordable housing), illegal immigration, Putin's war next door, the rise of the right wing (see illegal immigration) before jumping into another pissing contest with the US and China on something that's not gonna help fix the pressing issues we have right fucking now. I don't see how we can recover from this downward spiral when I look at the inactions of our politicians who are just kicking the can down the road and blaming the EU and other countries of the union for their own systemic failures.

Winning the AI race might sound cool but it might also be similar to winning the race to the moon: a cool flex but not super useful to the general population if they can't afford a place to live or getting healthcare in a timely manner. Until ChatGPT can wipe your retired old ass in a care home I doubt many people will see AI investments as being a top priority.

sanxiyn

> OpenAI charges $2.5 for 1 million input tokens, or units of data processed by the AI model, while DeepSeek is currently charging $0.014 for the same number of tokens.

This is somewhat misleading, because OpenAI price is for uncached and DeepSeek price is for cached. DeepSeek uncached price is $0.14.

TeMPOraL

DeepSeek API has also been down for over a week now, so does this even mean anything?

world2vec

Besides Mistral which AI companies are out there in Europe really competing against the US and Chinese tech giants? Deepmind doesn't count, it's owned by Google for quite some time now.

diggan

Besides Mistral there are a bunch of smaller "research" models like https://huggingface.co/Almawave/Velvet-14B (Italian) and https://huggingface.co/projecte-aina/aguila-7b (Spanish/Catalan), but as far I'm aware, nothing that really competes with OpenAI/DeepSeek so far (which tbh, I don't think Google does at this time either? None of their models I've tried even came close to GPT4)

mrdevlar

No, I don't think so.

I think most of the work done in AI here is focused on current applications of AI not on the development of new AI.

Havoc

> There was an offer from DeepSeek which was five times lower than their actual prices,

Lower than the usual DS price? Or than OAI? Article is a little ambiguous.

Would be surprised if DS is offering 1/5 rates

karmasimida

DeepSeek gives everyone a chance to catch up, to be precise.

It is just the beginning.

hirokio123

It's interesting how ChatGPT is securing major deals with governments and large corporations, while DeepSeek is gaining support from smaller startups. The contrast is also intriguing from the perspective of the power dynamics between the U.S. and China.

dep_b

What data sets do you use when testing this or other open source models?

seydor

catch up to whom? china? US? russia?

baq

China and US obviously but first EU needs to be shaken out of complacency. It’s brewing slowly, but as all risk, it’s slow until it isn’t. Shutting down nuclear, outsourcing all tech and manufacturing, overregulation and allowing to be taken over by culture of lazy (and lazy cultures) are all fixable, but it takes time, which is running out.

iinnPP

The lazy culture thing is so overwhelming in Canada right now as well. I have started to note the impact in a lot of places.

rat9988

What is this lazy culture you are all speaking about?

imiric

> outsourcing all tech and manufacturing

Huh? Can you mention any examples of this, specifically in tech?

AFAIA outsourcing is done largely by western countries, including the US, _to_ eastern countries, including parts of Europe, Russia, China, India, etc. This is an obvious cost cutting measure, since developers there are cheaper and the talent pool is large. This has been going on for decades. Hell, the H-1B visa is made to bring those developers in, while still undercutting their salaries compared to US employees.

Outsourcing is much less prevalent in the EU, let alone "all tech"...

> overregulation

As opposed to no regulation? The US and China are not role models for how tech companies should exist and operate in society. Whatever "innovations" are being stifled by regulations in the EU is for good reasons. Big Tech has way too much power and influence to the detriment of society. At least the EU is making an effort to draw some boundaries, and if you ask me, it's not nearly enough.

> allowing to be taken over by culture of lazy (and lazy cultures)

Yeah, how dare those lazy europeans have a sensible work-life balance!

rightbyte

>allowing to be taken over by culture of lazy (and lazy cultures) are all fixable

Please wait a bit until I've leveled up to the capitalist class. I'll be there any day now. Thank you.

ein0p

Not just Europe. Many companies in the US will benefit, too. As will companies in Asia, Africa and the Middle East. These are the first truly frontier-grade models released under a friendly license. The most potent non-reasoning model before this point was Mistral Large, and it has serious restrictions on allowed uses (research only).