PubMed abstracts and PubMed Central (PMC) full-text articles dataset

The PubMed abstracts and PubMed Central (PMC) full-text articles dataset is used for pretraining the UBERT variants.

Data and Resources

Cite this as

Vinh Nguyen, Hong Yung Yip, Olivier Bodenreider (2025). Dataset: PubMed abstracts and PubMed Central (PMC) full-text articles dataset. https://doi.org/10.57702/lfwcc22q

DOI retrieved: January 3, 2025

Additional Info

Field Value
Created January 3, 2025
Last update January 3, 2025
Defined In https://doi.org/10.48550/arXiv.2204.12716
Author Vinh Nguyen
More Authors
Hong Yung Yip
Olivier Bodenreider
Homepage https://github.com/naaclubert/UBERT