Skip to content(if available)orjump to list(if available)

HN

Show HN: I'm an airline pilot – I built interactive graphs/globes of my flights

Normalizing Flows Are Capable Generative Models

machinelearning.apple.com

A Brief History of Children Sent Through the Mail (2016)

smithsonianmag.com

C compiler for Web Assembly (c4wa)

James Webb Space Telescope Reveals Its First Direct Image of an Exoplanet

smithsonianmag.com

SymbolicAI: A neuro-symbolic perspective on LLMs

Reinforcement learning, explained with a minimum of math and jargon

understandingai.org

Multi-Stage Programming with Splice Variables

Qwen VLo: From "Understanding" the World to "Depicting" It

qwenlm.github.io

Facebook is starting to feed its AI with private, unpublished photos

Weird Expressions in Rust

Structuring Arrays with Algebraic Shapes

10 Years of Pomological Watercolors

parkerhiggins.net

nimbme – Nim bare-metal environment

Transmitting data via ultrasound without any special equipment

bootc-image-builder: Build your entire OS from a Containerfile

Theoretical Analysis of Positional Encodings in Transformer Models

Spark AI (YC W24) is hiring a full-stack engineer in SF (founding team)

ycombinator.com

New Process Uses Microbes to Create Valuable Materials from Urine

newscenter.lbl.gov

Rust in the Linux kernel: part 2

Show HN: Do you know RGB?

maxwellito.github.io

The Journey of Bypassing Ubuntu's Unprivileged Namespace Restriction

u1f383.github.io

Whitesmiths C compiler: One of the earliest commercial C compilers available

Reinforcement learning, explained with a minimum of math and jargon

Reinforcement learning, explained with a minimum of math and jargon

1 comments

·June 24, 2025

mnkv

reasonable post with a decent analogy explaining on-policy learning, only major thing I take issue with is

> Reinforcement learning is a technical subject—there are whole textbooks written about it.

and then linking to the still wip RLHF book instead of the book on RL: Sutton & Barto.