-
Llama: Open and efficient foundation language models
The LLaMA dataset is a large language model dataset used in the paper. -
BERT: Pre-training of deep bidirectional transformers for language understanding
This paper proposes BERT, a pre-trained deep bidirectional transformer for language understanding.