Asking LLMs to create my game Shepard's Dog

64 comments

·March 8, 2025

vnglst

Shepherd's Dog is a game I've wanted to create for a long time, but I never got the sheep flocking behaviour just right. The goal of the game is to herd all the sheep into the pen before nightfall. I've asked several models to create this game and I'm particularly impressed with what Claude 3.7 could do with a one-shot prompt.

- You can play the Claude game here (note: doesn't work on Safari for some reason): https://html-preview.github.io/?url=https://raw.githubuserco...

- o3-mini's version is here: https://html-preview.github.io/?url=https://raw.githubuserco...

Results of other models and a leaderboard is here: https://github.com/vnglst/when-ai-fails/blob/main/shepards-d...

Some videos: https://hachyderm.io/@vnglst/114125938185826311

Keyframe

Shepherd's Dog is a game I've wanted to create for a long time

Not sure if you're aware, but there was a game like that for playstation and GBA, called Sheep! https://en.wikipedia.org/wiki/Sheep_(video_game) Here's some gameplay footage (player here didn't chose a dog to play with for some reason): https://www.youtube.com/watch?v=SP058CHQj20 Premise of the game is the same, you run the sheep to the designated area over obstacles.

vnglst

Ah thanks for this. The game above is lovely and it’s really similar to what I had in mind (I was also thinking of lemmings!). I see in the other comments below that this idea of mine has been created as a game a lot of times already. Seems like I’m not as original as I thought haha

Keyframe

Seems like I’m not as original as I thought haha

in creative work that's absolutely irrelevant. Don't even think about that. Everything has been done before; It's your take that counts, your vision!

AustinDev

Just tried a 1-shot on Grok3 - Thinking and it couldn't get past the start button. Throws an error: | "<a class='gotoLine' href='#67:39'>67:39</a> Uncaught ReferenceError: startGame is not defined"

Scope issue.

No barking or dog player model but pretty similar in style to Claude's output.

What's interesting to me about playing with AI Codegen is each model has specific and sometimes overlapping output errors. Claude 3.7 really like to solve errors by returning dummy data as a 'fallback' when doing client or server calls. A little prompting can reduce this but not eliminate it. 'The tests always pass if you return dummy data'

https://jsfiddle.net/aL3ugtj1/

jchw

Here is an attempt using Google Gemini 2.0 Pro Experimental.

https://gist.github.com/jchv/e8869a7cbe2d854a0ec93e946030d90...

It seems like it has some issues, but the result is interesting nonetheless. Just a one-shot like the others, needed a single "Keep going" but otherwise this is the vanilla output from the prompt.

Edit: Looks like you can share an HTML preview of a gist using html-preview.github.io, so here's that. https://html-preview.github.io/?url=https://gist.githubuserc... - It'll go to level 2 if you refresh the page and hit Restart, but I don't think it's possible to clear Level 2. The flock stays too far apart to fit enough sheep in the pen.

n4r9

I just played the Claude attempt and found that the "fence" in level 3 doesn't actually obstruct either the dog or the sheep. Otherwise pretty fun.

swyx

great demos. one shotting isnt really fair imo, i feel like that might be hard even for a human to do (working without feedback). i'd be curious what deepseek would do with a bit more feedback.

breckenedge

Since you’re releasing the code to GitHub, do you think you’ll eventually run into issues with the training data including prior versions of the game?

tdy_err

The implied scenario being that the memory of its own output would result in the model producing degraded future output? Why is that a given?

mythrwy

Probably the same reason that close relatives marrying each other for generations produces genetic problems.

Chaosvex

Read about model collapse. The TL;DR is garbage in, garbage out.

https://en.wikipedia.org/wiki/Model_collapse

frotaur

Seems o3-mini implements the 'boids' algorithm for flocking (likely due to its prevalence online), but I find that here it doesn't really fit.

Indeed in boids each element has a constant (or minimum) velocity, s.t. the sheep never stop 'running'. I find the Claude flocking behaviour looks more natural, for sheep.

franze

ChatGPT o1 Pro

Demo: https://show.franzai.com/a/clean-parrot-brown (Page will self-destruct after 3 months, feel free to host it somewhere else)

Oneshot Prompt https://chatgpt.com/share/67cff8e6-e218-8009-af5b-d91060eaed...

franze

After some rounds in Cursor using different models

https://show.franzai.com/a/leaf-bug-wasp (LGPT - feel free to fork - Page will vanish in 3 months)

patates

Best attmpt so far IMHO. Very hard though!

stevage

Wow, really impressive.

franze

https://sheep.franzai.com/ now a bit more polished

vnglst

Wow this one is great!

HenryBemis

Tip: don't push them into a corner! I got up to lvl 7 without a problem, and then I got them stuck in a corner and that was it :( Poor sheep will spend the night in the cold outside the barn!

shever73

After nearly 40 years, Shep has finally been released!

See the Crash magazine "Unclear User" parody. Page 125 of the August 1985 edition for context. [0]

[0] https://archive.org/details/Crash_No._19_1985-08_Newsfield_G...

srejk

That you remember this from one month before I was born is incredible.

the_arun

All the demo sites are flagged by Microsoft Edge as - "This site has been reported as unsafe". The irony is the demos are hosted on github pages.

NitpickLawyer

uBO lite as well:

> uBO Lite has prevented the following page from loading:

https://html-preview.github.io/?url=https://raw.githubuserco...

The page was blocked because of a matching filter in OpenPhish Domain Blocklist.

tigerlily

Are they doing this now? Oh brother! And here I was thinking WASM would be a good solution to the desktop exe signing problem for my community's roguelike. Instead browser vendors are likely just going to ban the site.

shakna

Probably related to this recent chaos. [0]

[0] https://www.theregister.com/2025/03/10/infosec_in_brief/

ido

interesting, im also using edge (with all security settings set to maximum) and it works fine for me. Maybe the difference is that I'm using it on mac?

owenpalmer

The one that Claude created was a legitimately fun game! If it implemented boids similar to o3-mini, it would be even better. Slap some sprites on it and put it on steam!

jofzar

I clicked it and went, oh this is actually fun. It feels very early iPhone days mobile game.

matsemann

On desktop the map is huuuuge and it's not particularly fun waiting for them to slowly move all the way to the opposite corner. It's cool that one can prototype this quickly, but needs some tweaks from play-testing as with all games I guess.

boredhedgehog

Claude actually animated the nightfall, unprompted.

I don't think it's fair to say Mistral didn't implement flocking. The force is just very weak.

EDIT: I guess I confused flocking with herding, fair enough.

h4kor

Quiet impressive!

I've build a very similar game for a 3 hour game jam once :D

https://h4kor.itch.io/herding-simulator

matsemann

I once made a boid-thingy, which this also reminds me of. https://matsemann.github.io/boids-workshop/ (and since the parent game is mostly boid behavior with a goal condition, I guess that's why the LLM is so successful in implementing it?)

The link is the final result with lots of controls, but the idea is that it's a tutorial/workshop where you build it step by step yourself, in Norwegian though https://github.com/Matsemann/boids-workshop

oneeyedpigeon

Weird coincidence, but you made the exact same typo the OP made in their prompt. It's "built", not "build" :)

4ndrewl

Ha, I've been creating this on-and-off for a while. Just last night I asked various LLMs to implement a boid-with-predator algorithm and all failed hard.

Instead I spent an hour reading through a description and implementing manually and it at least worked.

But yes, boids is a good start, but it requires some work to make it more natural for mammals, who can have a 0 min speed.

franze

Here an updated version using ChatGPT o1 Pro, Claude, and Cursor https://news.ycombinator.com/item?id=43360648

cainxinth

I finally got around to playing Red Dead Redemption 1 recently and was surprised at how much I enjoyed the cattle driving missions.

viccis

Brought me back to the cowherding missions in RDR. Not a fond memory, but still a memory nonetheless.

HN

Asking LLMs to create my game Shepard's Dog

Asking LLMs to create my game Shepard's Dog