Skip to content(if available)orjump to list(if available)

HN

The decline of high-tech manufacturing in the United States

blog.waldrn.com

Claudia – Desktop companion for Claude code

claudiacode.com

Llama-Scan: Convert PDFs to Text W Local LLMs

The Enterprise Experience

churchofturing.github.io

Why It's OK to Block Ads (2015)

blog.practicalethics.ox.ac.uk

LL3M: Large Language 3D Modelers

threedle.github.io

ArchiveTeam has finished archiving all goo.gl short links

tracker.archiveteam.org

Derivatives, Gradients, Jacobians and Hessians

blog.demofox.org

He found a bomb under a playground – and there were 176 more

Show HN: NextDNS Adds "Bypass Age Verification"

BBC Micro, the ancestor to ARM

retrogamecoders.com

Here be dragons: Preventing static damage, latchup, and metastability in the 386

HN Search isn't ingesting new data since Friday

IQ Tests Results for AI

Two sizes fit most: PostgreSQL and ClickHouse

about.gitlab.com

Show HN: OverType – A Markdown WYSIWYG editor that's just a textarea

Does OLAP Need an ORM

Show HN: Doxx – Terminal .docx viewer inspired by Glow

The Oldest Mask in the World (Pre-Pottery Neolithic B)

MS-DOS development resources

Sunny days are warm: why LinkedIn rewards mediocrity

elliotcsmith.com

undefined.pyfy.ch

A Visual Exploration of Gaussian Processes (2019)

AI vs. Professional Authors Results

mark---lawrence.blogspot.com

Llama-Scan: Convert PDFs to Text W Local LLMs

Llama-Scan: Convert PDFs to Text W Local LLMs

5 comments

·August 17, 2025

firesteelrain

Ironically, Ollama likely is using Tesseract under the hood. Python library ocrmypdf uses Tesseract too. https://github.com/ocrmypdf/OCRmyPDF

david_draco

Looking at the code, this converts PDF pages to images, then transcribes each image. I might have expected a pdftotext post-processor. The complexity of PDF I guess ...

firesteelrain

There is a very popular Python module called ocrmypdf. I used it to help my HOA and OCR’ing of old PDFs.

https://github.com/ocrmypdf/OCRmyPDF

No LLMs required.

roscas

Almost perfect, the PDF I tested it missed only a few symbols.

But that is something I will use for sure. Thank you.

no_creativity_

[dead]