Claude for
engineering
Reading code, writing code, reviewing pull requests, debugging stack traces, generating tests, and building the little tools that make the repetitive bits go away.
I'm not an engineer but I work with a lot of them, and the gap between engineers who get value from Claude and those who don't is usually about mental model, not technical knowledge. The ones who get value treat it like a senior pair programmer with no sense of stopping. They ask for a plan before code, they iterate on a tight loop, and they never let it touch production without a review. This page is the short version of what that looks like.
Three places to start
Three workflows where Claude earns its keep on engineering work without needing any special infrastructure.
1. First-pass PR review
Paste the diff, paste the context (what the PR is trying to do, any relevant ticket or design doc), and ask Claude for a first-pass review focused on correctness, edge cases, and security. It's not a substitute for a human review. It's a way to focus your human review on the things that actually matter. The thirty-second output will catch half the issues before you've finished your coffee.
PR review prompt, annotated
Four elements, aimed specifically at making the review useful rather than noisy.
Where this falls over
Claude will nitpick style and naming if you don't explicitly tell it not to. Say "do not comment on style" up front and stick to substance.
2. Debugging a stack trace
Paste the trace, paste the relevant code, paste what you were trying to do. Ask Claude to walk you through the three most likely causes in order of likelihood, and for each, one thing you could check to confirm or rule it out. This is the pattern that replaces the "I'll just Google this error" reflex, and it's usually faster and more targeted.
Debugging the hard way vs the fast way
Same stack trace. Different amount of context. Dramatically different output.
Asks five general questions about your code, framework, and environment. Suggests "make sure the item exists before accessing it". Technically correct, practically useless.
Three ranked causes, each with a specific check:
- Most likely: ticketlib 2.4 changed the shape of a response. The 2.4 release notes mention a breaking change to the pagination response. Check if
itemis now nested underitem.datainstead of being returned directly. One-line check: log the response shape just before line 42. - Possible: a malformed row in Monday's data. The error would happen on one row and then crash the whole batch. Wrap line 42 in a try/catch that logs the row ID and skips it, and see if only some rows fail.
- Less likely: race condition between workers. Would normally show up as intermittent, not consistent. Check if this is reproducible with a single worker.
Where this falls over
If you paste the trace without the code, Claude will guess wildly. If you paste both without the context of what you were doing, it will still guess but slightly less wildly. Always include all three.
3. Scaffolding a new feature with Claude Code
For any greenfield feature, the best workflow is: describe what you want in a few sentences, ask for the plan before the code, review the plan, then let Claude Code execute step by step. The tight loop (plan, step, verify, step, verify) is the thing that stops it going off the rails. The faster you see something running, the earlier you catch problems.
Scaffolding a small tool, as a conversation
A rough reconstruction. Notice that nothing gets written until the plan is approved.
Where this falls over
Letting Claude Code run without a plan is how you get an hour of work you can't use. The three most useful words in this workflow are "show me first".
Worked examples
Before and after, on tasks from engineering work.
Prompts to steal
Copy any of these, fill in the square brackets, run.
Try this week
Pick one. All of these are under an hour.
A lot of the engineering work across Abingdon is going to end up touching Claude in one form or another. The thing that separates the people who ship something useful from the people who build a demo and move on is the habit of evaluation. Even twenty test cases is better than zero.
If you want to go broader, the Beginner guide covers the habits that apply everywhere. The prompt library has the full set of role-tagged prompts in one place.