Negative preference optimization: From catastrophic collapse to effective unlearning

A dataset for testing the effectiveness of unlearning methods in large language models.

Data and Resources

Cite this as

Ruiqi Zhang, Licong Lin, Yu Bai, Song Mei (2025). Dataset: Negative preference optimization: From catastrophic collapse to effective unlearning. https://doi.org/10.57702/vjgwd5mx

DOI retrieved: January 3, 2025

Additional Info

Field Value
Created January 3, 2025
Last update January 3, 2025
Defined In https://doi.org/10.48550/arXiv.2407.01920
Author Ruiqi Zhang
More Authors
Licong Lin
Yu Bai
Song Mei