2 datasets found

Formats: JSON Tags: sequence

Filter Results
  • ProtDescribe

    The ProtDescribe dataset used for pretraining the AMMA model, consisting of 553k sequence and function description pairs.
  • AlphaFoldDB

    The dataset used in the paper for secondary structure-guided novel protein sequence generation with latent graph diffusion.
You can also access this registry using the API (see API Docs).