How I use Claude Code to implement new features in an existing complex codebase

56 comments

·August 3, 2025

Visit

ofrzeta

That many sites about AI look a lot like scam sites ("make money fast") is a real turn-off -> https://www.sabrina.dev

Maybe it does have some valuable content but I will just close it.

Valodim

Wow, geez. That is terrible, I'm surprised a website like that can have any actual content as TFA

WD-42

The grifters have entered the arena.

literalAardvark

Hype is like catnip for them. They've been around for a while, many of the OG AI YouTubers didn't know anything about it when they started to pretend to educate.

brunooliv

Am I the only one who uses CC in a one-shot fashion? I rely on what has been generated by the init slash command at the root of my monorepo and then when I have a complex task I manually create a md file called CONTEXT.md inside the package/area where I want to implement the new task. I try to be as detailed as possible, think like writing a ticket for a junior dev. Then my only command for CC is something like: “Focus on the project structure described in CLAUDE.md and work through the task planned in CONTEXT.md. Follow current codebase standards, keep code reusable and easy to follow and modify.”

This is like 99% of my CC interactions working on top of a well structured codebase and it just works perfectly for almost any task I throw at it.

chrisvalleybay

I mostly do the same. My workflow is usually like this: 1. Supply CC with a simple user story with light spec / business logic, asking it to write a specification for the feature and ask me any questions that need to be answered to make it complete. 2. Build up the context with all relevant files that I want to use. In Ruby on Rails it means refercing models, views and controllers that show the structure we are going to use. 3. Write the code and tests.

manmal

A few thoughts on this:

- I wouldn’t outsource my brain to CC when it comes to checking CC’s output. Very mixed results in my experience, and it might discourage further exploration/thinking if you’ve already performed the checklist CC has given you (satisficing).

- Slash commands are the idiomatic way to memoize often used prompts, I wonder why author put them in CLAUDE.md?

- I’m also a bit skeptical that, aside from strict rules CC needs to follow, the encouragements/enchantments for writing good code author put in CLAUDE.md really work. But who knows.

- I DO like the caveats section at the end a lot. This is probably the most important piece of the article, when it comes to large codebases. Never just accept the first draft. Review everything with high suspicion and bend the output to your own style and taste. Otherwise, you‘re pushing legacy code.

null

[deleted]

cristea

I agree with the notion here that things are moving so fast that most blog posts will be outdated quickly.

However posts like this is valuable for new people to get a basic understanding of how these tools could be used in a very simple, beginner friendly, setup.

cantor_S_drug

This is equivalent of .vimrc file. ".clauderc" .

campbel

The author describes a lot of directives they found useful. Be aware Claude has the concept of `slash commands` https://docs.anthropic.com/en/docs/claude-code/slash-command...

steilpass

I stumbled upon the author’s command `qnew` to read Claude.md.

Why would you need that?

> CLAUDE.md is a special file that Claude automatically pulls into context when starting a conversation.

https://www.anthropic.com/engineering/claude-code-best-pract...

vunderba

That confused me as well. The whole point of CLAUDE.md is that it is automatically injected into the rolling context of any Claude interaction/thread.

That CLAUDE.md file that they've posted is also HUGE - Anthropic themselves recommend keeping it on the more concise side. If you need more, consider just creating dedicated subagents for "UI/UX reviewer", "search", etc.

AndyNemmity

XenophileJKO

Actually please write an article about how you are currently using it then. I've been actively using it and working with sub-agents. I feel like there are very little resources out there on more complex Claude Code setups.

I would love to see other setups, like what MCP, hooks, sub-agents, commands, etc.

AndyNemmity

ToJans

Wow, that is a huge instruction set.

I've created a (way smaller) "/hire" command that does something similar, but I should probably turn it into an agent as well, as the commao is only creating agents, and I still need to do further adaptation with individual promoting and edits

It's these little, but crucial insights that make all the difference, so thank you!

I have the exact same feeling about losing time, for me it's starting to turn into an addiction,

I'm buiding a new side product, and the sense of urgency combined with the available capability makes it hard for me to stop working.

Progress is going so fast that it feels like the competition might catch up any time now.

I now restrained myself upfront with predefined timing windows for work, so the I manage to keep my sanity & social life from disappearing...

"What a great time to be alive"

derektank

>Apologies, I'm not an article writer. I focus all my time on improving my own setup.

Maybe that's why the guides you see appear a bit out of date? Writing, all kinds of communicating, takes time, both in organizing and compiling your thoughts

weitendorf

We’ve been building in this direction for a bit over a year now. I don’t see your contact info but please reach out over email if you’re interested in checking out our approach and maybe working with us!

gexla

I think the basic issue for me is that people put this stuff out as if it's some big discovery, and yet my own usage is way different and serves me just fine thank you. These have the feel of a developer who has just discovered development and then want to tell you about all the best tools. But it's not really about showing you the best tools, is it? Rather, it's about riding the hype train and creating slop for more eyeballs.

weitendorf

Respectfully, I think you are being reflexively contrarian or overindexing on the vibe coding hype train. The way vibe coding UX works is basically just having LLMs guess what they need to do to complete a task and then sending it off to do it and try to fix anything that doesn't go according to plan along the way. Nothing big there.

I had started working on an AI devtool product a few months before Cursor took off, I didn't even know about them when I first started, and I hated that such a dumb UX was setting the narrative in this space. LLMs had essentially no ability to decompose and plan tasks at the time, and they weren't fucking sandboxing it!

Terminal agents are actually moving towards the UX I've been building/anticipating for. In March of 2024 I was playing around with GPT4 and saw it oneshot a microservice I asked it to make. I was so excited about the implications of where this stuff could go that I quit my job at google just to start building in this space.

Without getting into all the details, I am pretty convinced there must be some particular way of arranging infrastructure primitives and AI-coding tools in a way that properly decomposes and executes arbitrarily large or complicated tasks (limited only by available time and resources). Claude code is IMO getting closer to that by the week. No iterative change is crazy science or anything but there are some genuinely novel and exciting patterns for computing things underway.

bathtub365

When someone has a YouTube channel about programming with a certain level of polish they are not trying to be a professional programmer, they are trying to be a professional YouTuber.

null

[deleted]

_betty_

It's from July 6, so yeah a month in the past is totally right.

She has newer posts on sub agents

null

[deleted]

keerthiko

to be fair, the linked youtube video from the article is literally 4 weeks old, so "at least a month in the past" is probably exactly accurate if you are moving at the same pace as the author.

null

[deleted]

_andrei_

> If you don't discuss hooks and subagents, i'm not sure what you're doing right now.

Agents have their own context and can be useful for tasks that can be parallelized, which is a minority of tasks. How are they critical to better performance for you?

AndyNemmity

This is the best question anyone has asked.

Let's consider context. At some level the more context you have is good. At some level, the more irrelevant context you have is bad.

Okay. We have at top level of context, a hook that forces a system prompt on every action.

Next level we have a ./claude/CLAUDE.md then we have the project level CLAUDE.md then we have a possible not required agent setup then we have the instructions you give it

We know that CLAUDE.md gets lost in the context, at any level. The system prompt level hooks don't.

Why does the CLAUDE.md get lost? Why are we losing ability with a longer context.

The problem is irrelevant context to the action. The Documentation agent doesn't require the Golang modernization rules. The Golang agent, doesn't require the planing coordinator rules.

So the question I asked myself last weekend was, what is the experience if you split the contexts to only the required information for the task.

I did head to head battles with agents, reading in the information, versus contextual specific information. The agents with context specific destroyed the competition. Like it was another world.

So then I ran head to head tests on the type of information. Etc etc. My current setup is the best level achieved in those tests.

So my argument is that removing the context that is entirely irrelevant for the agent improves performance dramatically.

But I'm one person doing tests... it's true for me. Maybe it's not true for others. People have to explore the conception and determine that.

I can only tell you what has worked best for me, and for me, it's like a model jump in performance improvements.

dSebastien

It's been my experience too, already prior to support for sub-tasks/agents.

I've been exploring the ways in which I could "lazy load" context depending on the task at hand. I've approached this by defining an AI "receptionist", a set of roles and a set of tasks.

The receptionist is the entrypoint of the system. It knows where to find roles/tasks but only loads very basic information about the roles.

Depending on the role I ask for, it then loads the right role file, which in turn loads additional context. Then I ask for whatever i want to do, and it loads specific task files based on what I asked. And the task files load additional context etc.

This works quite well because I end up with just the right context for the task at hand. And each role/task can actually be executed as a sub-task/agent

_andrei_

> So my argument is that removing the context that is entirely irrelevant for the agent improves performance dramatically.

100% agree on building the optimal context, just have not seen parallel agents do better at sequential tasks. the documentation agent may have better initial context about how to write documentation, but it doesn't have the context of the changes, apart from what is passed to it / can explore. if we don't spawn a new session, and instead throw it a /document command - that would still get all the guidelines and rules for writing documentation, and it should have the same weight since it's at the bottom of the context.

for me the highest 'model jump level' performance booster is externalizing context and controlling the process - having claude initialize a plan file with a pre-defined template that makes it use that as its to do list and documentation place, and getting it to use that as its primary working area

mattlondon

What are hooks and subagents then?

mephitix

Maybe you should ask Claude ;)

https://docs.anthropic.com/en/docs/claude-code/hooks

https://docs.anthropic.com/en/docs/claude-code/sub-agents

chank

> https://docs.anthropic.com/en/docs/claude-code/hooks

> https://docs.anthropic.com/en/docs/claude-code/sub-agents

AndyNemmity

__loam

The capacity this segment of the industry has for just making shit up is incredible.

j_crick

Why did you replace your comments in this thread with . ?

AndyNemmity

Because I gave examples, and details, and thousands of people read them from a pastebin i used to share.

I didn't release it as open source or anything, just sharing. I don't want to take questions concerning it so I can focus on moving it forward.

Today's goal is to try to build self healing agents that automatically fix the problems they encounter so they only happen once, automating a manual process I successfully use.

Perhaps if that works out well, that is something releasable I can do in a real way as opposed to paste bin.

CyberMacGyver

Genuinely, how do you keep up to date with these features while still being focused at the task?

The more I focus the less I have time to read and experiment O want to finish it. What are your sources and how are you balancing it ?

AndyNemmity

Honestly, the release of sub-agents was the first time it felt actually meaningful.

MCP servers and the rest have not provided the type of gains sub-agents have. Hooks and subagents actually provide tangible value. Enough that it's changed my structure entirely to be tool focused, and not output focused.

croes

It will be interesting to see how all this works out when the honeymoon phase is over.

At the moment they burn a investor money to gain customers by giving us free tools and cheap LLMs.

That will end at some point.

chrisvalleybay

One perspective could be that this is a moment to leverage. Subsidized LLMs won't be around forever, so this is a great time to build something that you wouldn't have had the time to build otherwise.

HN

How I use Claude Code to implement new features in an existing complex codebase

How I use Claude Code to implement new features in an existing complex codebase