Claude 3.7 Sonnet with a separate "watchdog" LLM model?


I am wondering about using Claude 3.7 Sonnet (let's call it C3.7S) with another LLM, separate from the Claude model, "watching" over it to monitor when it goes off its rails.  


I have been using C3.7S since the inception of my DF game project.  From the first, when I was planning it's structure, I used Claude (from 3.0, through 3.5, and now 3.7....I wonder what's next?  Claude 3.85?).  I have seen an increase in "thinking" capability.  But I'm not sure I have seen an increase in quality of code.


Sometimes Claude "goes off the rails".  Often, it will over-complicate or over-engineer some code block.  Routinely, I need to review its output for that practice.  So I'm wondering, now, about how I could have a secondary "team member" monitoring its work.  In this case, specifically for writing more code than is necessary for the prompt, requirement, or task as specified.


I don't have the hardware, sadly, to run my own LLM on my own desktop (let alone my laptop).  An unfortunate limitation.  So now I wonder if Claude's "extended thinking" can be useful to catch its own self going too deep into some development.  This requires a lot more experimentation and tinkering.





Comments

Popular posts from this blog

Telling Rocks What To Think

WWHD?