sst2
sst2
¶
SST2Eval(split: str = 'validation')
¶
Bases: RolloutEvaluation
SST-2 sentiment evaluation using validation split.
max_new_tokens(inference: Any) -> int
¶
Only need ~2 tokens for positive/negative answer.