StackLLaMA: An RL fine-tuned LLaMA model for Stack Exchange question and answering

The dataset used in the paper is the StackExchange dataset.

BibTex: