winogrande
winogrande
¶
WinograndeEval(split: str = 'validation')
¶
Bases: RolloutEvaluation
Winogrande evaluation using validation split.
max_new_tokens(inference: Any) -> int
¶
Only need 1 token for A/B answer.