by Edwin Chen
A methodology for defining and measuring quality that moves beyond binary correctness to subjective excellence. It treats data evaluation as a search for the 'best of the best' rather than just filtering out the 'worst of the worst'.
Core Principles
- 1.Reject Binary Checklists: Don't just ask 'Does it have 8 lines?'. Ask 'Does it move the reader? Is the imagery novel?'
- 2.Signal Triangulation: Use implicit metadata (keystrokes, time-on-task, edit history) alongside explicit output to judge worker quality.
- 3.Expert-Tier Annotation: Use domain experts (Nobel physicists, teachers) who can evaluate the *reasoning* path, not just the final answer.
- +1 more...
"We basically never wanted to play the Silicon Valley game... We essentially teach AI models what's good and what's bad. People don't understand what quality even means in this space."