Top model scores may be skewed by Git history leaks in SWE-bench
github.com
Claude's memory architecture is the opposite of ChatGPT's
shloked.com
Bulletproof host Stark Industries evades EU sanctions
krebsonsecurity.com
Rails on SQLite: new ways to cause outages
andre.arko.net
Unusual Capabilities of Nano Banana (Examples)
github.com
NT OS Kernel Information Disclosure Vulnerability
crowdfense.com
'Robber bees' invade apiarist's shop in attempted honey heist
cbc.ca
Behind the scenes of Bun Install
bun.com
Making io_uring pervasive in QEMU [pdf]
vmsplice.net
Launch HN: Ghostship (YC S25) – AI agents that find bugs in your web app
The Helix Text Editor (2024)
jonathan-frere.com
Adam (YC W25) Is Hiring to Build the Future of CAD
ycombinator.com
Show HN: Making a cross-platform game in Go using WebRTC Datachannels
pion.ly
CRISPR offers new hope for treating diabetes
wired.com
AirPods live translation blocked for EU users with EU Apple accounts
macrumors.com
A tech-law measurement and analysis of event listeners for wiretapping
arxiv.org
Conway's Game of Life, but musical
hudsong.dev
Adjacency Matrix and std:mdspan, C++23
cppstories.com
ApeRAG: Production-ready GraphRAG with multi-modal indexing and K8s deployment
github.com
GrapheneOS and Forensic Extraction of Data (2024)
discuss.grapheneos.org
How Palantir Is Mapping Everyone's Data for the Government
techdirt.com
From burner phones to decks of cards: NYC teens adjusting to the smartphone ban
gothamist.com
Randomly selecting points inside a triangle
johndcook.com
An engineering history of the Manhattan Project
construction-physics.com
Hi HN, we're Jesse and Gautham. We're building Ghostship (https://tryghostship.dev/).
Ghostship lets you find bugs in your web app by entering in your URL and describing a user journey.
Here's a video of Ghostship in action: https://www.loom.com/share/dec264ae32f94d50adb141c9246837c3?....
For over half our lives, we've been developers and we've done tons of user-facing projects like a coding competition I built called CerealCodes or freelancing projects on Upwork. The biggest problem we faced was that we shipped bugs in edge cases we didn't test, and the process of testing was annoying to do everytime we shipped a new feature. We tried automated testing tools, but those were flaky and couldn't adapt to feature changes. They also were really annoying to set up.
Our solution is to use browser agents to help you find bugs in your web app by clicking through your product like users would. You'd enter in your URL, describe what a user would do, and Ghostship would go through and try finding bugs by going through the user journey and extrapolating edge cases by visually seeing where else to click as it goes through each step in the user journey. We then show session replays of our agents going through your web app and list out all the steps it took.
We're able to find edge cases with almost no prompting. All you need to do is enter in one URL and one user journey (if you have login credentials on your web app, enter in some test credentials).
One bug we were able to find with Ghostship was on the YC application page. Apparently you could add your education dates in reverse chronological order (April 2022 to January 2021, which makes no sense).
Another bug we were able to find was a crypto smart contract CRM dashboard we vibe coded where we found a bug involving data corruption when you tried editing a draft contract multiple times.
You can sign up here: https://playground.tryghostship.dev/ for a limited number of credits. We'd love to hear from the HN community, whether you're building a web app for fun or a developer shipping a cool user-facing product to customers. We'd love to see what bugs we can find in your web app with Ghostship!
p.s. If you want Ghostship directly in your CI/CD pipeline and run after every PR, book a demo with us.