-
ParaBank 2
ParaBank 2 is a large synthetic paraphrase dataset created by translating one side of bitext into the language of the other side. -
English Controlled Paraphrase Generation
The dataset for English controlled paraphrase generation. -
Chinese Controlled Paraphrase Generation
The dataset for Chinese controlled paraphrase generation. -
Quora Question Pairs (QQP) and Twitter-URL
The dataset used in this paper is Quora Question Pairs (QQP) and Twitter-URL. -
Question Pairs (QQP) and Twitter-URL
Paraphrase generation aims to produce high-quality and diverse utterances of a given text. The dataset used in this paper is Question Pairs (QQP) and Twitter-URL.