ProtDescribe

The ProtDescribe dataset used for pretraining the AMMA model, consisting of 553k sequence and function description pairs.

Data and Resources

Cite this as

Seul Lee, Minseon Kim, Dongki Kim, Eunji Ko, Sung Ju Hwang (2024). Dataset: ProtDescribe. https://doi.org/10.57702/loi0wq0t

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.48550/arXiv.2405.06663
Author Seul Lee
More Authors
Minseon Kim
Dongki Kim
Eunji Ko
Sung Ju Hwang
Homepage https://www.protdescribe.org/