Series

reason-saver: built by devs with a dream to make GPT think better.

i’m building a reasoning lab to judge gpt’s logic harder than I judge my own life. reason-saver logs, scores, and critiques completions. this is my journey through llm tooling, prompt debugging, and other ml chaos.

teaching a gpt to judge itself, part zero
baby datasets, baselines, and traces
Aug 25, 20253 min read24
teaching a gpt to judge itself, part one
baby pipelines, observability, and scale
Sep 30, 20253 min read7

Command Palette