-
Physicochemical Properties of Protein Tertiary Structure Data Set
Physicochemical Properties of Protein Tertiary Structure Data Set. UCI Machine Learning Repository, https://doi.org/10.24432/C5QW3H. -
CASP dataset
The CASP dataset was used for testing. The dataset contains 96 template-free proteins and 90 template-based proteins. -
SCOP 2.06 dataset
The SCOP 2.06 dataset was used for testing. The dataset contains 4,188 domains, covering 550 folds. -
SCOP 1.75 dataset
The SCOP 1.75 dataset was used for training and validation. The dataset contains 16,712 proteins covering 7 major structural classes with total 1,195 identified folds. -
DeepSF: deep convolutional neural network for mapping protein sequences to folds
Protein fold recognition is an important problem in structural bioinformatics. Almost all traditional fold recognition methods use sequence (homology) comparison to indirectly... -
Protein Folding
The dataset used in the paper for protein folding, which is a type of bioinformatics problem. -
Random Fragments Classification of Microbial Marker Clades with Multi-class SV...
Microbial clades modeling is a challenging problem in biology based on microarray genome sequences, especially in new species gene isolates discovery and category. Marker family... -
Machine Learning and Bioinformatics for Diagnosis Analysis of Obesity Spectru...
The dataset used for diagnosis analysis of obesity spectrum disorders -
SCOPe dataset
Structural Classification of Proteins — extended (SCOPe) dataset