Auditing GPT’s Content Moderation Guardrails: Can ChatGPT Write Your Favorite TV Show?
Document Type
Conference Proceeding
Role
Contributor
Published In
Proceedings of the 6th ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT)
Publisher
Association for Computing Machinery
Haverford Libraries Support
APC Waiver - Association for Computing Machinery
First Page
660
Last Page
686
Publication Date
2024
Suggested Citation
Mahomed, Y., et al. (2024). "Auditing GPT's Content Moderation Guardrails: Can ChatGPT Write Your Favorite TV Show?" in Proceedings of the 6th ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT): 660-686. Available: https://doi.org/10.1145/3630106.3658932
COinS
