You're currently viewing an old version of this dataset. To see the current version, click here.

ProtST

The ProtST dataset is a collection of protein sequences and their corresponding biomedical text descriptions.

Data and Resources

This dataset has no data

Cite this as

Minghao Xu, Zuobai Zhang, Jiarui Lu, Zhaocheng Zhu, Yangtian Zhang, Ma Chang, Runcheng Liu, Jian Tang (2024). Dataset: ProtST. https://doi.org/10.57702/wqf3k9xu

Private DOI This DOI is not yet resolvable.
It is available for use in manuscripts, and will be published when the Dataset is made public.

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2404.16866
Author Minghao Xu
More Authors
Zuobai Zhang
Jiarui Lu
Zhaocheng Zhu
Yangtian Zhang
Ma Chang
Runcheng Liu
Jian Tang
Homepage https://doi.org/10.1038/s41586-023-05551-4