Evaluating large language models trained on code

The paper presents the results of the OpenAI Codex evaluation on generating Python code.

BibTex: