Skip to content(if available)orjump to list(if available)

HN

The owner of ip4.me/ip6.me, Kevin Loch, passed away

Bayleaf · Building a low-profile wireless split keyboard

Satellogic's Open Satellite Feed

tech.marksblogg.com

DiffRhythm: Fast End-to-End Full-Length Song Generation with Latent Diffusion

aslp-lab.github.io

Show HN: Appstat – Process Monitor for Windows

Foundry (YC F24) Hiring Founding Engineer to Build an Internet-Scale Web Crawler

ycombinator.com

Repairable Flatpack Toaster

Windows NT for GameCube/Wii

Enhancing Frame Detection with Retrieval Augmented Generation

Lawrence of Arabia, Paul Atreides, and the roots of Frank Herbert's Dune (2021)

TSMC expected to announce $100B investment in U.S.

DeepSeek's smallpond: Bringing Distributed Computing to DuckDB

mehdio.substack.com

The IBM 650: An appreciation from the field (1986) [pdf]

Show HN: PG-Capture – a better way to sync Postgres with Algolia (or Elastic)

pg-capture.onrender.com

SQLite-on-the-server is misunderstood: Better at hyper-scale than micro-scale

DIY "infinity contrast" TV – with 100% recycled parts [video]

An Experimental Study of Bitmap Compression vs. Inverted List Compression

Ask HN: Who is hiring? (March 2025)

Hacking the Xbox 360 Hypervisor Part 2: The Bad Update Exploit

Ask HN: What less-popular systems programming language are you using?

The Golden Age of Japanese Pencils (2022)

notes.stlartsupply.com

Comparing Fuchsia components and Linux containers [video]

Show HN: Agents.json – OpenAPI Specification for LLMs

Launch HN: Cuckoo (YC W25) – Real-time AI translator for global teams

Tips for using Gemini 2.0 for PDF ingestion

Tips for using Gemini 2.0 for PDF ingestion

5 comments

·March 4, 2025

jtrueb

Anyone have recommendations for chip datasheets? Ive explored a couple options so far, but getting some bitfields wrong is super annoying.

I see plenty of examples like the one here that are on easier extractions. A PDF to HTML or Markdown converter will probably get it right with OCR.

petercooper

No direct recommendation for that use case, but one strategy I've heard being used and that works with complex documents (or where hallucinations are Very Bad™ - like invoice processing) is using multiple techniques and models at once in a quorum approach. For example, direct ingestion of PDFs into Gemini, OCR and ingestion of text, plus perhaps using another model like GPT. If they all agree on a fact, you're (probably) good. If not, it can be bumped up to human correction.

thelittleone

Interesting, although would be great to see some comparative results, e.g., with and without the html alt tag approach.

javier123454321

Honestly though, I hope that the google notbooklm https://notebooklm.google/ doesn't go to the google graveyard. It is great for feeding a decent amount of information and helping you process it. I've found great success at it.

null

[deleted]