Canyon (pronounced /ˈkænjÉ™n/) is a JavaScript code coverage collection platform. We address the difficulties developers and QA engineers encounter in collecting ...
🔔 The automatic evaluation on CodaLab are under construction. The MathVista dataset is derived from three newly collected datasets: IQTest, FunctionQA, and Paper, as well as 28 other source datasets.
Abstract: Large Language Models (LLMs) are increasingly used by software engineers for code generation. However, limitations of LLMs such as irrelevant or incorrect code have highlighted the need for ...