Python can run Mojo now

169 comments

·June 20, 2025

pjmlp

Given that NVidia now decided to get serious with Python JIT DSLs in CUDA as announced at GTC 2025, I wonder how much mindshare Mojo will managed win across researchers.

"1001 Ways to Write CUDA Kernels in Python"

https://www.youtube.com/watch?v=_XW6Yu6VBQE

"The CUDA Python Developer’s Toolbox"

https://www.nvidia.com/en-us/on-demand/session/gtc25-S72448/

"Accelerated Python: The Community and Ecosystem"

https://www.youtube.com/watch?v=6IcvKPfNXUw

"Tensor Core Programming in Python with CUTLASS 4.0"

https://www.linkedin.com/posts/nvidia-ai_python-cutlass-acti...

There is also Julia, as the black swan many outside Python community have moved into, with much more mature tooling, and first tier Windows support, for those researchers that for whatever reason have Windows issued work laptops.

https://info.juliahub.com/industries/case-studies

Mojo as programming language seems interesting as language nerd, but I think the judge is still out there if this is going to be another Swift, or Swift for Tensorflow, in regards to market adoption, given the existing contenders.

fnands

Mojo (and Modular's whole stack) is pretty much completely focused at people who are interested in inference, not training nor research so much at this moment.

So going after people who need to build low latency high-throughput inference systems.

Also as someone else pointed out, they also target all kinds of hardware, not just NVidia.

orochimaaru

Why not use PyO3 instead? It had a much cleaner interface than cython and c++ libraries.

The primary advantage of mojo seems to be Gil-free syntax that is as close to Python as possible.

fnands

GPU programming in Rust isn't great.

In Mojo it's pretty much the whole point of the language. If you're only using CPUs, then yeah, PyO3 is a good choice.

pjmlp

Currently looks more like CPUs and eventually AMD, from what I have been following up on their YouTube sessions, and whole blog post series about freedom from NVidia and such.

They also miss CPUs on Windows, unless using WSL.

hogepodge

There's pretty broad support for server-grade and consumer GPUs. It's a bit buried, but one of the most reliable lists of supported GPUs is in the Mojo info documentation.https://docs.modular.com/mojo/stdlib/gpu/host/info/

MohamedMabrouk

Already GPU code, kernels, and complete models can run on datacenter AMD GPUs using the same code, the same programming model, and same language constructs.

bsaul

Mojo is marketed as a way to get maximum hardware performance on any hardware, not just nvidia.

This may appeal to people wanting to run their code on different hardware brands fro various reasons.

pjmlp

True, however that goal is not yet available today, it doesn't even run on Windows natively.

And for those that care, Julia is available today on different hardware brands, as there are other Python DSL JITs as well.

I agree they will get there, now the question is will they get there fast enough to matter, versus what the mainstream market cares about.

melodyogonna

Mojo GPU kernels can run on both Nvidia and AMD GPUs today

fulafel

The limitations of DSLs and the pull of Python make it a practical sweet spot I think if they manage to get the Python compatibility up to par.

ForHackernews

I love Julia and want to see it break through. It suffers from the lack of a big corporate patron.

pjmlp

Plenty of them exist already, that is why I pointed out this, that HNers keep overlooking

https://info.juliahub.com/industries/case-studies

ForHackernews

Sure, a bunch of companies use Julia but none of them are backing it the way Google backs Go, Oracle backs Java, or Mozilla (formerly) backed Rust.

btian

Mojo runs on nVidia and AMD.

The competitor is Triton, not CUDA or Julia...

eigenspace

Julia runs on Nvidia, AMD, Intel and Apple GPUs

GeekyBear

Chris Lattner (the tech lead behind Mojo, LLVM, Clang, Swift and MLIR) appeared on a podcast a bit over a week ago and discussed the state of Mojo and where it is going.

He also discussed open sourcing Mojo and where the company expects to make its money.

https://www.youtube.com/watch?v=04_gN-C9IAo

ModernMech

He popped up on r/programminglanguages last week to say a bit about that as well: https://www.reddit.com/r/ProgrammingLanguages/comments/1lfz9...

Staross

Funny how the already weak case for not working on Julia instead of creating a new language is becoming even more flimsy :

FAQ:

> Why not make Julia better? > We think Julia is a great language and it has a wonderful community, but Mojo is completely different. While Julia and Mojo might share some goals and look similar as an easy-to-use and high-performance alternative to Python, we’re taking a completely different approach to building Mojo. Notably, Mojo is Python-first and doesn't require existing Python developers to learn a new syntax.

https://docs.modular.com/mojo/faq/#why-not-make-julia-better

Now :

>We oversold Mojo as a Python superset too early and realized that we should focus on what Mojo can do for people TODAY, not what it will grow into. As such, we currently explain Mojo as a language that's great for making stuff go fast on CPUs and GPUs.

amval

Even funnier: https://github.com/tensorflow/swift/blob/main/docs/WhySwiftF...

> Julia: Julia is another great language with an open and active community. They are currently investing in machine learning techniques, and even have good interoperability with Python APIs.

dragonwriter

The share of projects that start out “we’re going to do X, and conpletely support using the syntax and semantics of existing language Y, but add more on top” that end up “we’re going to do X, in a language vaguely reminiscent of Y” is very hard to distinguish from 100%.

ddaud

I think the real reason is that Chris Lattner doesn’t want to work on Julia. He’s likely very (and justifiably so) opinionated about these topics, and probably wants a certain degree of creative authority over his projects.

pjmlp

Additionally Julia works on Windows, lots of its issues have been ironed out throught the last decade, and many folks are invested into it, instead of a single company product.

Yes many of the mainstream languages started as single company product, but lets put it this way, would anyone be writing one of such languages today, had those not been languages gatekeeped to access a specific platform?

So outside accessing Max and its value preposition as product enabler for XYZ, who would be rushing to write Mojo code, instead of something else.

mattclarkdotnet

That video is restricted, do you have a public link?

abybaddi009

Doesn't seem to be restricted. Do you mean region locked?

slyall

Video isn't region locked according to the tool I normally use

https://polsy.org.uk/stuff/ytrestrict.cgi

GeekyBear

Sorry, but I'm not having any issue with that link.

Here is a link to the episode listing for that podcast, which might help.

https://www.youtube.com/@LatentSpacePod

loudmax

It's the Latent Space podcast, published June 13. It's a pretty decent podcast for keeping on top of AI stuff. Swyx, one of the co-hosts, is active here on HN.

duttish

Just worked fine for me.

I'm logged into youtube, if you're not something with that perhaps?

JonChesterfield

The factorial test giving zero on majo suggests they aren't doing arbitrary precision integer arithmetic.

I liked mojo as a python superset. Wanted to be able to run arbitrary python through it and selectively change parts to use the new stuff.

A "pythonic language" sounds like that goal has been dropped, at which point the value prop is much less clear to me.

melodyogonna

From Chris Lattner on Modular discord few days ago:

Yep, that's right. Int behaving like a machine integer is very important for systems performance. Leaving the "int" namespace untouched allows us to have a object-based bigint in the future for compatibility with python. As others have mentioned above, it is still a goal to be compatible with python in time, just not a short term priority

eyegor

I guess they abandoned the python superset idea? I followed them for a bit when they first publicly launched and they said "don't worry, we'll be a real python superset soon" and the biggest omission was no support for classes. A few years later, it looks to be missing the same set of python features but added a lot of their own custom language features.

hogepodge

I'd say less abandoned, and more deferred.

It was highly aspirational goal, and practically speaking it's better right now to take inspiration from Python and have stronger integration hooks into the language (full disclosure, I work at Modular). We've specifically stopped using the "superset of Python" language to be more accurate about what the language is meant for right now.

serial_dev

Deferred means it’s coming, but we don’t know if it will ever come. I’d bet it won’t.

nickm12

Being a Python superset and being fast are fundamentally in tension. It would be possible, maybe, to have a Python superset where you get the highest performance as long as you avoided the dynamic features of Python. However, I suspect it would make the user base grumpy to have a hidden performance cliff that suddenly showed up when you used dynamic features (or depended on code that did the same).

pjmlp

The dynamic features of Python are no different from the dynamic features of Smalltalk, Self, Common Lisp, but people have been educated to expect otherwise due to the adoption failure of dynamic compilers in Python community.

saagarjha

Taking unchanged Python and making it fast is basically impossible.

fwip

Sure - but you can support unchanged Python at regular Python speed, and also have extensions to the language that can be run at higher speed.

bishabosha

They explicitly casted it to an 'Int' on the mojo side, but the modular website claims that isnt a specific bit-width so i am surprised

int_19h

It's not a specific bitwidth in a sense that it maps to whatever largest integer type is natively supported by the target architecture (i.e. basically 32-bit or 64-bit).

markkitti

The Python superset concept was always a gimmick. The goal was always to juxtapose Python with a language that superficially looks like Python in order for you to completely migrate from Python to Mojo. It is just providing a smooth ramp for you to do so in the same Apple migrated folks from Objective-C to Swift.

Embrace. Extend. Extinguish.

mhh__

[flagged]

0xpgm

Your comment is quite subjective, and Python's popularity both in teaching and in industry would suggest otherwise.

mhh__

Taste is subjective yes, very novel insight.

Python is a hacky language that was never designed other than laying it's eggs against the grain of what we seem to mostly agree is godo — e.g. functional programming and composition.

Big tech spends a lot of money to avoid python in critical infra.

js2

For anyone not up to speed on Mojo: Mojo is a pythonic language for blazing-fast CPU+GPU execution without CUDA:

https://news.ycombinator.com/item?id=35790367

eddythompson80

I'm glad the blazing-fast tech is finally making it to python. Those guys were pretty busy with Rust for like the past 10 years. I'm glad they are finally starting to making a blazing-fast python stuff.

pjmlp

It could already be -fast, even if not blazing, if there was more love for projects like PyPy.

mindwok

I am really rooting for Mojo. I love what the language is trying to do, and making it easier to run SOTA AI workloads on hardware that isn't Nvidia + CUDA will open up all kinds of possibilities.

I'm just nervous how much VC funding they've raised and what kind of impacts that could have on their business model as they mature.

mkaic

If they can manage to make good on their plans to open-source it, I'll breathe a tentative sigh of relief. I'm also rooting for them, but until they're open-source, I'm not willing to invest my own time into their ecosystem.

jillesvangurp

They already released their code under the Apache 2.0 license. Not everything in their stack is open source but the core things appear to be open source.

ayhanfuat

Their compiler is not open source. They only released stdlib code.

bgwalter

No, Python cannot run Mojo. Like Cython and hundreds of other projects, Mojo creates a C-extension, which is then loaded.

https://docs.modular.com/mojo/manual/get-started/

  curl -fsSL https://pixi.sh/install.sh | sh
  pixi init life \
  -c https://conda.modular.com/max-nightly/ -c conda-forge \
  && cd life

No, thanks. C++ is easier.

hogepodge

One of the big differences here is that the Mojo/Python interop package is doing a lot of the heavy lifting of compiling and loading the code. The goal here is to have a language that's close to Python that gives you hardware acceleration as you need it, without having to configure and build C++ code for every platform.

We've run a few events this year (disclosing again that I work for Modular, to put my comments into context), and we've had some great feedback from people who have never done GPU programming about how easy it was to get started using Mojo.

C++ is a powerful and mature language, but it also has a steep learning curve. There's a lot of space for making GPU (and other high performance computing) easier, and platforms/languages like Triton and Julia are also exploring the space alongside Mojo. There's a huge opportunity to make GPU programming easier, and a bit part of that opportunity is in abstracting away as much of the device-specific coding as you can.

I was a HPC C++ programmer for a long time, and I always found recompiling for new devices to be one of the most painful things about working in C++ (for example, the often necessary nightmare of cmake). Mojo offers a lot of affordances that improve on the programming experience at the language, compiler, and runtime levels.

sanderjd

The example in the article seemed a lot easier to build and use than a C++ extension.

It's unclear to me from this comment, what exactly your pushback is.

justin66

> C++ is easier.

I expect that opinion will not be universally agreed upon.

anon-3988

I am not that intrigued that Python that can call some pre-compiled functions, this is already possible with any language that produces a dynamic library.

The space that I am interested in is execution time compiled programs. A usecase of this is to generate a perfect hash data structure. Say you have a config file that lists out the keywords that you want to find, and then dynamically generate the perfect hash data structure compiled as if those keywords are compile time values (because they are).

Or, if the number of keywords is too small, fallback to a linear search method. All done in compile time without the cost of dynamic dispatch.

Of course, I am talking about numba. But I think it is cursed by the fact that the host language is Python. Imagine if Python is stronger typed, it would open up a whole new scale of optimization.

pjmlp

I would rather image CPython being like Common Lisp, Scheme, Raket, Smalltalk, Self compilation model.

Sadly the contenders on the corner get largely ignored, so we need to contend with special cased JIT DSLs, or writing native extensions, as in many cases CPython is only implementation that is available.

dukebw

Doesn't Mojo already support this in its parameter system? https://docs.modular.com/mojo/manual/parameters/

Jitting like you mentioned is supported by the MAX graph API: https://docs.modular.com/max/tutorials/build-custom-ops. It could have a nicer syntax though to be more like Numba, I think you have an interesting idea there.

devjab

> I am not that intrigued that Python that can call some pre-compiled functions, this is already possible with any language that produces a dynamic library.

> The space that I am interested in is execution time compiled programs. A usecase of this is to generate a perfect hash data structure. Say you have a config file that lists out the keywords that you want to find, and then dynamically generate the perfect hash data structure compiled as if those keywords are compile time values (because they are).

I'm not sure I understand you correctly, but these two seem connected. If I were to do what you want to do here in Python I'd create a zig build-lib and use it with ctypes.

anon-3988

Can Zig recompile itself if I change a config in production? I am talking about this

``` python program.py --config <change this> ```

It is basically a recompilation of the whole program at every execution taking into account the config/machine combination.

So if the config contains no keyword for lookup, then the program should be able to be compiled into a noop. Or if the config contains keyword that permits a simple perfect hash algorithm, then it should recompile itself to use that mechanism.

I dont think any of the typical systems programming allows this.

Scene_Cast2

The thing I focus on when writing compiled extensions for Python isn't the speed of the extension, but rather the overhead of the call and the overhead of moving objects from Python -> compiled and compiled -> Python.

Is there a zero-copy interface for larger objects? How do object lifetimes work in that case? Especially if this is to be used for ML, you need to haul over huge matrices. And the GIL stuff is also a thing.

I wonder how Mojo handles all that.

MohamedMabrouk

AFAIK, there is a zero-copy interface for objects like numpy array and torch tensors where you can do in-place manipulation in mojo.

benrutter

I've never been thats sold on Mojo, I think I'm unfairly biased away from it because I find new languages interesting, and its big sell is changing as little as possible from an existing language.

That said, importing into Python this easily is a pretty big deal. I can see a lot of teams who just want to get unblocked by some performance thing, finding this insanely helpful!

atomicapple

> its big sell is changing as little as possible from an existing language.

This is not really true. Even though Mojo is adopting Python's syntax, it is a drastically different language under the hood. Mojo is innovating in many directions (eg: mlir integration, ownership model, comptime, etc). The creators didn't feel the need to innovate on syntax in addition to all that.

benrutter

You're right- I probably should have said something like "part of its sell" or "one of its selling points" or something.

I didn't mean to undermine the ambitious goals the project has. I still wish it was a little bolder on syntax though, Python is a large and complex language as is, so a superset of Python is inherently going to be a very complicated language.

markkitti

The creators got burned on Swift for TensorFlow, their first MLIR project. One of the problems with that first venture under Google was that the language was not Python.

Tiberium

> as I'm definitely in the market for a simple compiled language that can offer Python some really fast functions So, Nim? https://github.com/yglukhov/nimpy

mindwok

The real point of Mojo is not the language, it's the deep roots into MLIR which is an attempt to do what LLVM did for compilers, and do it on GPUs / ML hardware. Chris Lattner is leading the project and he created LLVM and MLIR.

null

[deleted]

Someone

FTA (emphasis added): “Chris Lattner mentioned that Python can actually CALL Mojo code now”

So, the message is that it is possible to create nice Python bindings from Mojo code, but only if your Mojo code makes the effort to create an interface that uses PythonObject.

Useful, but I don’t see how that’s different from C code coding the same, as bindings go.

Both make it easier to gradually move Python code over to a compiled language.

Mojo presumably will have the advantage that porting from Python to Mojo is much closer to a copy paste job than porting Python to C is.

amval

For a language that announced itself (and raised a lot of money on the premise of) claiming to be "a Python superset", this does not sound like a huge achievement.

In all fairness, their website now reads: "Mojo is a pythonic language for blazing-fast CPU+GPU execution without CUDA. Optionally use it with MAX for insanely fast AI inference."

So I suppose now is just a compiled language with superficially similar syntax and completely different semantics to Python?

Certhas

I think it was pretty clear immediately that running python code was a far away goal. There was a lot more talk about lifetimes and ownership semantics than details about Python interop. Mojo is more like: Can we take the learnings of Swift and Rust and solve the usability and compile time issues, while building on MLIR to target arbitrary architectures efficiently (and call it a Python superset to raise VC money).

That said, the upside is huge. If they can get to a point where Python programmers that need to add speed learn Mojo, because it feels more familiar and interops more easily, rather than C/CPP that would be huge. And it's a much lower bar than superset of python.

amval

It marketed itself explicitly as a "Python superset", which could allow Python programmers to avoid learning a second language and write performant code.

I'd argue that I am not sure what kind of Python programmer is capable of learning things like comptime, borrow checking, generics but would struggle with different looking syntax. So to me this seemed like a deliberate misrepresentation of the actual challenges to generate hype and marketing.

Which fair enough, I suppose this is how things work. But it should be _fair_ to point out the obvious too.

Certhas

Absolutely. The public sales pitch did not match the reality. This is what I meant with the "Claim to be Ṕython to get VC money" point.

To first order, today every programmer starts out as a Python programmer. Python is _the_ teaching language now. The jump from Python to C/Cpp is pretty drastic, I don't think that it's absurd that learning Mojo concepts step by step coming from Python is simpler than learning C. Not syntactically but conceptually.

fwip

I think one of the "Python superset" promises was that any particular dev wouldn't need to learn all of that at once. There could exist a ramp between Python and "fast python" that is more gradual than the old ways of dropping into C, and more seamless than importing and learning the various numpy/numba/polars libraries.

int_19h

FWIW generics are already a thing in pure Python as soon as you add type annotations, which is fast becoming the default (perhaps not the least because LLMs also seem to prefer it).

fnands

They've backed off a little from the Python superset claims and leaned more into "Python family".

> I'd argue that I am not sure what kind of Python programmer is capable of learning things like comptime, borrow checking

One who previously wrote compiled languages ;-). It's not like you forget everything you know once you touch Python.

johnofthesea

> and call it a Python superset to raise VC money

What else was proclaimed just to raise VC money?

dragonwriter

The real unique selling point of Mojo is "CPU+GPU execution without CUDA", specifically, you write code that looks like code without worrying about distinctions like kernels and device functions and different ways of writing code that runs on GPU vs. code that runs on CPU, and mojo compiles it to those things.

eigenspace

Not unique though. Julia has had that since before Mojo was even started.

melodyogonna

What does the performance of Julia's GPU kernels look like in comparison to kernels written by Nvidia or AMD?

saghm

> For a language that announced itself (and raised a lot of money on the premise of) claiming to be "a Python superset", this does not sound like a huge achievement

I feel like that depends quite a lot on what exactly is in the non-subset part of the language. Being able to use a library from the superset in the subset requires being able to translate the features into something that can run in the subset, so if the superset is doing a lot of interesting things at runtime, that isn't necessarily going to be trivial.

(I have no idea exactly what features Mojo provides beyond what's already in Python, so maybe it's not much of an achievement in this case, but my point is that this has less to do with just being a superset but about what exactly the extra stuff is, so I'm not sure I buy the argument that the marketing you mention of enough to conclude that this isn't much of an achievement.)

melodyogonna

I've written this somewhere else before, Modular did not raise $130m to build a programming language, nobody does that. They raised that much money to revolutionize AI infrastructure, of which a language is just a subset. You should definitely check some of the things they've put together, they're amazing

Imustaskforhelp

Yes. They are revolutionizing AI infrastructure but I guess a lot of world is just babbling about AI, but not every developer needs to worry about AI.

And so his improvements in mojo and now calling mojo code from python just make a lot more net positive to the community than being, some other Ai infrastructure company.

So I do wish a lot of good luck to mojo. I have heard that mojo isn't open source but it has plans to do so. I'd like to try it once if its as fast / even a little slower than rust and comparable to understanding as python.

ModernMech

Agreed, Modular is walking a very fine line, and they're doing so by trading on the reputation of Chris Lattner.

On the one had, as the other poster noted, no one raises $100M+ for a programming language; programming languages have no ROI that would justify that kind of money. So to get it, they had to tell VCs a story about how they're going to revolutionize AI. It can't just be "python superset with MLIR". That's not a $100M story.

On the other hand, they need to appeal to the dev community. For devs, they want open source, they want integration with their tools, they don't want to be locked into a IP-encumbered ecosystem that tries to lock them in.

That's where the tension is. To raise money you need to pretend you're the next Oracle, but to get dev buy-in you have to promise you're not the next Oracle.

So the line they've decided to walk is "We will be closed for now while figure out the tech. Then later once we have money coming in to make the VCs happy, we can try to make good on our promise to be open."

That last part is the thing people are having trouble believing. Because the story always goes: "While we had the best intentions to be open and free, that ultimately came secondary to our investors' goal of making money. Because our continued existence depends on more money, we have decided to abandon our goal of being open and free."

And that's what makes these VC-funded language plays so fraught for devs. Spend the time to learn this thing which may never even live up to its promises? Most people won't, and I think the Darklang group found that out pretty decisively.

melodyogonna

I don't think investors look at what makes a net positive to the community when making large investments like in Modular. I was calling out the part of the post that said Modular raised a lot of Money to develop Mojo, that isn't entirely true as just creating a language isn't enough reason to invest $130m into a company, no matter how much net-positivity the language would bring.

dismalaf

It was never going to have Python semantics and be fast. Python isn't slow because of a lack of effort or money, it's slow because of all the things happening in the interpreter.

meander_water

Fwiw the website still claims this:

> Further, we decided that the right long-term goal for Mojo is to adopt the syntax of Python (that is, to make Mojo compatible with existing Python programs) and to embrace the CPython implementation for long-tail ecosystem support

Which I don't think has changed.

boxed

I believe they're still working towards making the syntax and semantics more python-like.