Skip to content(if available)orjump to list(if available)

HN

12 Days of Shell

12days.cmdchallenge.com

Show HN: Web app that lets you send email time capsules

GitHub Actions Has a Package Manager, and It Might Be the Worst

Jujutsu Worktrees Are Convenient

Nango (YC W23) is hiring back-end engineers and dev-rels (remote)

jobs.ashbyhq.com

Emacs is my new window manager

Damn Small Linux

damnsmalllinux.org

I failed to recreate the 1996 Space Jam website with Claude

Show HN: Lockenv – Simple encrypted secrets storage for Git

Bag of words, have mercy on us

experimental-history.com

Client-side GPU load balancing with Redis and Lua

Dollar-stores overcharge customers while promising low prices

theguardian.com

Google Titans architecture, helping AI have long-term memory

research.google

Show HN: ReadyKit – Superfast SaaS Starter with Multi-Tenant Workspaces

The fuck off contact page

The C++ standard for the F-35 Fighter Jet [video]

Mechanical power generation using Earth's ambient radiation

I wasted years of my life in crypto

An Interactive Guide to the Fourier Transform

betterexplained.com

Solving Rush Hour, the Puzzle (2018)

michaelfogleman.com

daringfireball.net

CATL expects oceanic electric ships in 3 years

cleantechnica.com

The Anatomy of a macOS App

eclecticlight.co

Client-side GPU load balancing with Redis and Lua

Client-side GPU load balancing with Redis and Lua

2 comments

·December 2, 2025

lneiman

Author here. We were hitting tail latency and low GPU utilization issues serving SLMs via Triton.

I built a scrappy client-side router using Redis and Lua to track real-time GPU load. It boosted utilization by ~40% and improved latencies.

Happy to hear feedback on the implementation or thoughts on better ways to do this!

pbrumm

Have you tried switching it to a job queue where the GPU instances try to keep themselves busy. That way you can auto scale the gpus based on utilization. I find it easier to tune and you can monitor latency and backlogs easier. It does require some async mechanisms to the client but I have found it easier to maintain