teaching a gpt to judge itself, part one
baby pipelines, observability, and scale
Sep 30, 20253 min read7
Search for a command to run...
Series
i’m building a reasoning lab to judge gpt’s logic harder than I judge my own life. reason-saver logs, scores, and critiques completions. this is my journey through llm tooling, prompt debugging, and other ml chaos.