harmfulqa
harmfulqa
¶
HarmfulQA(split: str = 'train', config: str | None = 'blue')
¶
Bases: ChatTemplateDataset
HarmfulQA (Bhardwaj & Poria, 2023).
Multi-round conversations labeled as safe ("blue") or harmful ("red").
The config parameter selects the conversation type:
- "blue" (default): safe conversations
- "red": harmful conversations
- "all": both types interleaved