LiveCodeBench is a benchmark for evaluating the performance of Large Language Models (LLMs) in code editing tasks, including debugging, translating, polishing, and requirement switching.
BibTex:
Before browse our site, please accept our cookies policy