Scientific Data

Table 7 Statistical values on the split of non-toxic dialogues by interlocutor for each category predicted by Detoxify.

From: A dataset of synthetic art dialogues with ChatGPT

Non-Toxic Split	Expert				User
Non-Toxic Split	mean	std	min	max	mean	std	min	max
toxicity	2.4E-03	2.4E-02	3.0E-04	9.9E-01	1.5E-03	1.5E-02	3.1E-04	2.4E-03
Severe toxicity	4.0E-06	4.7E-05	1.0E-06	6.5E-03	2.0E-06	1.0E-05	1.0E-06	4.0E-06
obscene	2.8E-04	1.2E-02	1.8E-05	8.2E-01	5.6E-05	5.9E-04	1.8E-05	2.8E-04
threat	8.5E-05	2.7E-03	1.2E-05	4.5E-01	4.5E-05	8.5E-04	1.4E-05	8.5E-05
insult	8.1E-04	1.6E-02	7.4E-05	9.8E-01	5.0E-04	9.2E-03	7.2E-05	8.1E-04
Identity attack	3.8E-04	3.8E-03	5.3E-05	5.2E-01	1.9E-04	2.8E-03	5.1E-05	3.8E-04
sexual explicit	3.8E-04	1.2E-02	9.0E-06	8.4E-01	1.9E-04	9.1E-03	7.0E-06	3.8E-04

Back to article page

Search

Advanced search

Quick links