Skip to content(if available)orjump to list(if available)

HN

Show HN: Unsure Calculator – back-of-a-napkin probabilistic calculator

filiph.github.io

How dairy robots are changing work for cows and farmers

spectrum.ieee.org

Cursor IDE support hallucinates lockout policy, causes user cancellations

Generate videos in Gemini and Whisk with Veo 2

A flowing WebGL gradient, deconstructed

Launch HN: mrge.io (YC X25) – Cursor for code review

4chan Sharty Hack And Janitor Email Leak

knowyourmeme.com

The case of the UI thread that hung in a kernel call

devblogs.microsoft.com

Canadian math prodigy allegedly stole $65M in crypto

theglobeandmail.com

OpenAI is building a social network?

JSX over the Wire

How the U.S. became a science superpower

Liquid: Language models are scalable and unified multi-modal generators

foundationvision.github.io

Hacking the Postgres wire protocol

What does it mean for a technology to follow Wright's Law?

ourworldindata.org

Chroma: Ubisoft's internal tool used to simulate color-blindness

METS, the Middle English Texts Series

metseditions.org

How to win an argument with a toddler

How many supernova explode every year?

badastronomy.beehiiv.com

Whistleblower details how DOGE may have taken sensitive NLRB data

Cohere Launches Embed 4

Benn Jordan's AI poison pill and the weird world of adversarial noise

Fun ways of deciding authorship order (2016)

dynamicecology.wordpress.com

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

2 comments

·April 15, 2025

solomatov

Does anyone know if there were any attempts to test Mamba on really large scale? To me this model looks as the most promising successor to the transformer architecture. Does anyone know why it might not be the case or what are other alternatives?

ed

Interesting direction for research but not a model you’d want to use today. The paper looks at a 3b model built on llama3.2-3b, modified for mamba, and they’re comparing to a distilled version of r1 with 1.5b params.