Draft: User interaction without judge (!13) · Merge requests · Grießhaber Daniel / evoprompt

The goal of this PR is to allow human feedback, even without a judge. Currently, we rely on a judge to rate model outputs, and then ask a human to correct an output if it was rated bad by the judge.

Without a judge, we ask a human to correct the worst prompts (i.e., each evolution step) after several iterations of non-improvement in a generation.

Edited Feb 13, 2025 by Max Kimmich

Draft: User interaction without judge

Merge request reports