DNABERT-S: LEARNING SPECIES-AWARE DNA EMBEDDING WITH GENOME FOUNDATION MODELS

DNABERT-S is a genome foundation model designed to generate effective, species-aware DNA embeddings. The model uses a combination of Manifold Instance Mixup and Curriculum Contrastive Learning to learn species-aware embeddings.

BibTex: