SST-2

The dataset used for the experiments across ten models– ranging from bag-of-words models to pre-trained transformers– and find that a model having higher AUC does not necessarily have a higher selective answering capability.

Data and Resources

Cite this as

Depeng Liang, Yongdong Zhang (2024). Dataset: SST-2. https://doi.org/10.57702/xbqg3gx6

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2109.05463
Citation
  • https://doi.org/10.48550/arXiv.2305.11596
  • https://doi.org/10.48550/arXiv.2210.14576
  • https://doi.org/10.48550/arXiv.2310.07579
  • https://doi.org/10.48550/arXiv.2105.06020
  • https://doi.org/10.48550/arXiv.2210.04466
  • https://doi.org/10.48550/arXiv.2102.04761
  • https://doi.org/10.48550/arXiv.2401.03514
  • https://doi.org/10.48550/arXiv.2305.14710
  • https://doi.org/10.48550/arXiv.2009.07360
  • https://doi.org/10.48550/arXiv.2007.06898
Author Depeng Liang
More Authors
Yongdong Zhang
Homepage https://www.aclweb.org/anthology/D15-1012