You're currently viewing an old version of this dataset. To see the current version, click here.

TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding

TreeMix is a compositional data augmentation approach for natural language understanding. It leverages constituency parsing tree to decompose sentences into sub-structures and recombines them to generate new augmented sentences.

Data and Resources

Cite this as

Le Zhang, Zichao Yang, Diyi Yang (2024). Dataset: TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding. https://doi.org/10.57702/28o1fqnz

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2205.06153
Author Le Zhang
More Authors
Zichao Yang
Diyi Yang
Homepage https://github.com/Magiccircuit/TreeMix