Dataset Groups Activity Stream Evaluating large language models trained on code The paper presents the results of the OpenAI Codex evaluation on generating Python code. BibTex: @dataset{Mark_Chen_and_Jerry_Tworek_and_Heewoo_Jun_and_Qiming_Yuan_and_Henrique_Ponde_de_Oliveira_Pinto_and_Jared_Kaplan_and_Harri_Edwards_and_Yuri_Burda_and_Nicholas_Joseph_and_Greg_Brockman_2024, abstract = {The paper presents the results of the OpenAI Codex evaluation on generating Python code.}, author = {Mark Chen and Jerry Tworek and Heewoo Jun and Qiming Yuan and Henrique Ponde de Oliveira Pinto and Jared Kaplan and Harri Edwards and Yuri Burda and Nicholas Joseph and Greg Brockman}, doi = {10.57702/wbv3e61b}, institution = {No Organization}, keyword = {'Code Generation', 'Large Language Models', 'Natural Language Processing', 'code generation', 'large language models', 'natural language processing'}, month = {dec}, publisher = {TIB}, title = {Evaluating large language models trained on code}, url = {https://service.tib.eu/ldmservice/dataset/evaluating-large-language-models-trained-on-code}, year = {2024} }