Skip to content(if available)orjump to list(if available)

Updated rate limits for unauthenticated requests

TheNewsIsHere

I don’t think the publication date (May 8, as I type this) on the GitHub blog article is the same date this change became effective.

From a long-term, clean network I have been consistently seeing these “whoa there!” secondary rate limit errors for over a month when browsing more than 2-3 files in a repo.

My experience has been that once they’ve throttled your IP under this policy, you cannot even reach a login page to authenticate. The docs direct you to file a ticket (if you’re a paying customer, which I am) if you consistently get that error.

I was never able to file a ticket when this happened because their rate limiter also applies to one of the required backend services that the ticketing system calls from the browser. Clearly they don’t test that experience end to end.

gnabgib

60 req/hour for unauthenticated users

5000 req/hour for authenticated - personal

15000 req/hour for authenticated - enterprise org

According to https://docs.github.com/en/rest/using-the-rest-api/rate-limi...

I bump into this just browsing a repo's code (unauth).. seems like it's one of the side effects of the AI rush.

mijoharas

Why would the changelog update not include this? it's the most salient piece of information.

I thought I was just misreading it and failing to see where they stated what the new rate limits were, since that's what anyone would care about when reading it.

PaulDavisThe1st

Several people in the comments seem to be blaming Github for taking this step for no apparent reason.

Those of us who self-host git repos know that this is not true. Over at ardour.org, we've passed the 1M-unique-IP's banned due to AI trawlers sucking our repository 1 commit at a time. It was killing our server before we put fail2ban to work.

I'm not arguing that the specific steps Github have taken are the right ones. They might be, they might not, but they do help to address the problem. Our choice for now has been based on noticing that the trawlers are always fetching commits, so we tweaked things such that the overall http-facing git repo works, but you cannot access commit-based URLs. If you want that, you need to use our github mirror :)

jorams

> These changes will apply to operations like cloning repositories over HTTPS, anonymously interacting with our REST APIs, and downloading files from raw.githubusercontent.com.

Or randomly when clicking through a repository file tree. The first time I hit a rate limit was when I was skimming through a repository on my phone, and about the 5th file I clicked I was denied and locked out. Not for a few seconds either, it lasted long enough that I gave up on waiting then refreshing every ~10 seconds.

pogue

I assume they're trying to keep ai bots from strip mining the whole place.

Or maybe your IP/browser is questionable.

globie

What's being strip mined is the openness of the Internet, and AI isn't the one closing up shop. Github was created to collaborate on and share source code. The company in the best position to maximize access to free and open software is now just a dragon guarding other people's coins.

The future is a .txt file of John Carmack pointing out how efficient software used to be, locked behind a repeating WAF captcha, forever.

voidnap

I encountered this on github last week. Very agressive rate limiting. My browser and IP is very ordinary.

Since Microsoft is struggling to make ends meet, maybe they could throw a captcha or proof of work like Anubis by xe iaso.

They already disabled code search for unauthenticated users. Its totally plausible they will disable code browsing as well.

kstrauser

That hit me, too. I thought it was an accidental bug and didn’t realize it was actually malice.

confusing3478

> Or maybe your IP/browser is questionable.

I'm using Firefox and Brave on Linux from a residential internet provider in Europe and the 429 error triggers consistantly on both browsers. Not sure I would consider my setup questionable considering their target audience.

grodriguez100

I’m browsing from an iPhone in Europe right now and can browse source code just fine without being logged in.

tostr

*other ai bots, ms will obviously mine anything on there.

Personally, I like sourcehut (sr.ht)

immibis

Same way Reddit sells all its content to Google, then stops everyone else from getting it. Same way Stack Overflow sells all its content to Google, then stops everyone else from getting it.

(Joke's on Reddit, though, because Reddit content became pretty worthless since they did this, and everything before they did this was already publicly archived)

croes

Other bots or MS bots too?

Zdh4DYsGvdjJ

dang

(This was originally posted as a reply to https://news.ycombinator.com/item?id=43981344 but we're merging the threads)

croes

Doesn‘t make it any better.

Collateral damage of AI I guess

formerly_proven

It's even more hilarious because this time it's Microsoft/Github getting hit by it. (It's funny because MS themselves are such a bad actor when it comes to AIAIAI).

fragmede

This is the same Microsoft that owns LinkedIn which got sued by HiQ which is where the ruling came from that is making sites login required.

jarofgreen

Also https://github.com/orgs/community/discussions/157887 "Persistent HTTP 429 Rate Limiting on *.githubusercontent.com Triggered by Accept-Language: zh-CN Header" but the comments show examples with no language headers.

I encountered this too once, but thought it was a glitch. Worrying if they can't sort it.

Euphorbium

I remember getting this error a few months ago, this does not seem like a temporary glitch. They dont want llm makers to slurp all the data.

new_user_final

Isn't git clone faster than browsing web?

PaulDavisThe1st

Yep. But AI trawlers don't use it. Ask them why.

null

[deleted]

trallnag

Good that tools like Homebrew that heavily rely on GitHub usually support environment variables like GITHUB_TOKEN

stevekemp

Once again people post in the "community", but nobody official replies; these discussion-pages are just users shouting into the void.