Skip to content(if available)orjump to list(if available)

HN

Affinity by Canva

affinity.studio

Launch HN: Propolis (YC X25) – Browser agents that QA your web app autonomously

app.propolis.tech

987654321 / 123456789

ZOZO's Contact Solver (for physics-based simulations)

US declines to join more than 70 countries in signing UN cybercrime treaty

therecord.media

Uv is the best thing to happen to the Python ecosystem in a decade

Free software scares normal people

danieldelaney.net

Show HN: In a single HTML file, an app to encourage my children to invest

Qt Creator 18 Released

Show HN: I made a heatmap diff viewer for code reviews

Estimating the Perceived 'Claustrophobia' of New York City's Streets (2024)

Replacing EBS and Rethinking Postgres Storage from First Principles

Frozen DuckLakes for Multi-User, Serverless Data Access

ducklake.select

Ventoy: Create Bootable USB Drive for ISO/WIM/IMG/VHD(x)/EFI Files

Tell HN: Azure outage

Spinning Up an Onion Mirror Is Stupid Easy

Minecraft removing obfuscation in Java Edition

Language models are injective and hence invertible

Typst's Math Mode Problem

laurmaedje.github.io

Some Smalltalk about Ruby Loops

tech.stonecharioteer.com

Acronymy (Can we define every word as an acronym?)

How ancient people saw themselves

worldhistory.substack.com

The Aesthete's Progress

sydneyreviewofbooks.com

Raspberry Pi Pico Bit-Bangs 100 Mbit/S Ethernet

elektormagazine.com

Bridging the gap between keyword and semantic search with SPLADE (2024)

Bridging the gap between keyword and semantic search with SPLADE (2024)

2 comments

·May 5, 2025

jbellis

I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn

You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.