Skip to content(if available)orjump to list(if available)

WorldGrow: Generating Infinite 3D World

keyle

This is cool. And could be fun in games. Not sure I get the point otherwise... The thought that came to mind was "Architectural slop".

fjfaase

I wonder if they also have a strategy for deleting generate tiles, otherwise the infinite is limited to the size of available memory. I also wonder if with their method can exactly recreate tiles that have been deleted. Or in other words, that they have a method for generating unique seeds for all tiles. The paper does not give much technical details. If the seed has a limited size and there is a method for generating seeds for each 2D coordinate, I wonder if it is possible to make a non-repeating infinite world. I think it is not possible with a limited size seed.

Garlef

I don't think generating virtual space is the issue.

It's about generating interesting virtual space!

james-bcn

Yep. People have been doing this kind of stuff for computer games for decades. It's actually not that difficult. It's not clear what novel problem is being solved here.

jsheard

Yeah but those traditional procgen techniques don't use AI, and this one does use AI. They solved the problem of them not being AI enough for the AI era. AI!

agravier

Do you have some particular piece of software or tech demo or game in mind with interesting very large generated 3D worlds?

SiempreViernes

In Mario 64 there is a staircase you can run up forever, granted it looks the same no matter how long you have Mario run up the stairs, but that certainly fits "big but uninteresting 3d world."

antonvdi

Minecraft surely fits those criteria.

sirtaj

Valheim and No Man's Sky are ones I've played recently.

jpalomaki

” The generated scenes are walkable and suitable for navigation/planning evaluation.”

Maybe the idea is to create environments for AI robotics traini ng.

analog8374

Consider the levels generated in any roguelike.

Consider the patterns generated by cellular automata.

Both tend to stay interesting in the small scale but lose it to boring chaos in the large.

For this reason I think the better approach is to start with a simple level-scale form and then refine it into smaller parts, and then to refine those parts and so on.

(Vs plugging away at tunnel-building like a mole)

rootlocus

Or at least coherent.

gcr

This could be a great way to make backrooms horror environments!

I've dreamed of a NeRF-powered backrooms walking simulator for quite a while now. This approach is "worse" because the mesh seems explicit rather than just the world becoming what you look at, but that's arguably better for real-world use cases of course.

grumbelbart2

> backrooms horror environments

True, it sounds (and looks) a lot like https://scp-wiki.wikidot.com/scp-3008

embedding-shape

It is only a paper as of now:

> The code is being prepared for public release; pretrained weights and full training/inference pipelines are planned.

Any ideas of how it would different and better compared to "traditional" PCG? Seems like it'd give you more resource consumption, worse results and less control, neither of which seem like a benefit.

glenneroo

The description in the linked YouTube video for some reason has more info than the github repo:

> We tackle the challenge of generating the infinitely extendable 3D world — large, continuous environments with coherent geometry and realistic appearance. Existing methods face key challenges: 2D-lifting approaches suffer from geometric and appearance inconsistencies across views, 3D implicit representations are hard to scale up, and current 3D foundation models are mostly object-centric, limiting their applicability to scene-level generation. Our key insight is leveraging strong generation priors from pre-trained 3D models for structured scene block generation. To this end, we propose WorldGrow, a hierarchical framework for unbounded 3D scene synthesis. Our method features three core components: (1) a data curation pipeline that extracts high-quality scene blocks for training, making the 3D structured latent representations suitable for scene generation; (2) a 3D block inpainting mechanism that enables context-aware scene extension; and (3) a coarse-to-fine generation strategy that ensures both global layout plausibility and local geometric/textural fidelity. Evaluated on the large-scale 3D-FRONT dataset, WorldGrow achieves SOTA performance in geometry reconstruction, while uniquely supporting infinite scene generation with photorealistic and structurally consistent outputs. These results highlight its capability for constructing large-scale virtual environments and potential for building future world models.

jackdoe

cant wait for the new diablo :)

pjmlp

With a quarter the size of the development team, 'cause productivity!

speedgoose

It looks more like the Stanley parable.

null

[deleted]

rana762

[dead]