Item: do-and-judge
Rating: 5.8
Author: Implexa

do-and-judge

Execute a task with sub-agent implementation and LLM-as-a-judge verification with automatic retry loop

installs

stars

karma

SkillRank score ↗

5.8/ 10

evaluated by implexa, claude-haiku-4-5 · 2026-06-10

do-and-judge implements a task execution pattern with meta-judge specification generation, parallel sub-agent dispatch, external verification, and retry loop. orchestrator role only, no direct tool execution.

structure

7.0

trigger phrases

2.0

procedure

6.0

edge cases

5.0

documentation

7.0

strengths

SKILL.md

do-and-judge

Task

Execute a single task by dispatching an implementation sub-agent, verifying with an independent judge, and iterating with feedback until passing or max retries exceeded.

Context

This command implements a single-task execution pattern with meta-judge → LLM-as-a-judge verification. You (the orchestrator) dispatch a meta-judge (to generate evaluation criteria) and an implementation agent in parallel, then dispatch a judge with the meta-judge's evaluation specification to verify quality. If verification fails, you launch new implementation agent with judge feedback and iterate until passing (score ≥4) or max retries (2) exceeded.

Key benefits:

Fresh context - Implementation agent works with clean context window

Structured evaluation - Meta-judge produces tailored rubrics and checklists before judging

External verification - Judge applies meta-judge specification mechanically — catches blind spots self-critique misses

Parallel speed - Meta-judge and implementation run simultaneously

Feedback loop - Retry with specific issues identified by judge

Quality gate - Work doesn't ship until it meets threshold

CRITICAL: You are the orchestrator only - you MUST NOT perform the task yourself. IF you read, write or run bash tools you failed task imidiatly. It is single most critical criteria for you. If you used anyting except sub-agents you will be killed immediatly!!!! Your role is to:

don't have the plugin yet? install it then click "run inline in claude" again.

do-and-judge

SKILL.md

related skills