Small models beat large ones when failure modes are explicitly constrained — Markdown | type0 | type0