Dataset Groups Activity Stream mT5 A multilingual version of the seq2seq architecture trained on Colossal Clean Crawled Corpus. BibTex: @dataset{Tosin_Adewumi_and_Foteini_Liwicki_and_Marcus_Liwicki_2025, abstract = {A multilingual version of the seq2seq architecture trained on Colossal Clean Crawled Corpus.}, author = {Tosin Adewumi and Foteini Liwicki and Marcus Liwicki}, doi = {10.57702/e9u6h76z}, institution = {No Organization}, keyword = {'Colossal Clean Crawled Corpus', 'Multilingual Task', 'Text-to-Text', 'mT5', 'multilingual', 'seq2seq'}, month = {jan}, publisher = {TIB}, title = {mT5}, url = {https://service.tib.eu/ldmservice/dataset/mt5}, year = {2025} }