Skip to content

harmfulqa

harmfulqa

HarmfulQA(split: str = 'train', config: str | None = 'blue')

Bases: ChatTemplateDataset

HarmfulQA (Bhardwaj & Poria, 2023).

Multi-round conversations labeled as safe ("blue") or harmful ("red"). The config parameter selects the conversation type: - "blue" (default): safe conversations - "red": harmful conversations - "all": both types interleaved