Debug and Rebug: The Records of a Developing Developer

March 23, 2025

Claude 3.7 Sonnet with a separate "watchdog" LLM model?

I am wondering about using Claude 3.7 Sonnet (let's call it C3.7S) with another LLM, separate from the Claude model, "watching" over it to monitor when it goes off its rails.

I have been using C3.7S since the inception of my DF game project. From the first, when I was planning it's structure, I used Claude (from 3.0, through 3.5, and now 3.7....I wonder what's next? Claude 3.85?). I have seen an increase in "thinking" capability. But I'm not sure I have seen an increase in quality of code.

Sometimes Claude "goes off the rails". Often, it will over-complicate or over-engineer some code block. Routinely, I need to review its output for that practice. So I'm wondering, now, about how I could have a secondary "team member" monitoring its work. In this case, specifically for writing more code than is necessary for the prompt, requirement, or task as specified.

I don't have the hardware, sadly, to run my own LLM on my own desktop (let alone my laptop). An unfortunate limitation. So now I wonder if Claude's "extended thinking" can be useful to catch its own self going too deep into some development. This requires a lot more experimentation and tinkering.

Search This Blog

Debug and Rebug: The Records of a Developing Developer

Comments

Post a Comment

Popular posts from this blog

WWHD?

Telling Rocks What To Think