AMMA dataset

The dataset used in the paper for protein representation learning, consisting of 120k sequence, structure, and function triplets.

BibTex: