Candidate A
Count correctness:
Evaluator setup
Model names are hidden during voting. Your voter name is only used to avoid showing the same pair twice to the same evaluator.
Elo is used internally for ranking but not shown directly to voters.
| Rank | Model | Games | W | L | T | Count labels |
|---|