Artikel in einem Konferenzbericht,

An empirical evaluation of GitHub copilot’s code suggestions

N. Nguyen, und S. Nadi.
Proceedings of the 19th International Conference on Mining Software Repositories, Seite 1-5. ACM, (Mai 2022)
DOI: 10.1145/3524842.3528470

Zusammenfassung

GitHub and OpenAI recently launched Copilot, an ÄI pair programmer" that utilizes the power of Natural Language Processing, Static Analysis, Code Synthesis, and Artificial Intelligence. Given a natural language description of the target functionality, Copilot can generate corresponding code in several programming languages. In this paper, we perform an empirical study to evaluate the correctness and understandability of Copilot's suggested code. We use 33 LeetCode questions to create queries for Copilot in four different programming languages. We evaluate the correctness of the corresponding 132 Copilot solutions by running LeetCode's provided tests, and evaluate understandability using SonarQube's cyclomatic complexity and cognitive complexity metrics. We find that Copilot's Java suggestions have the highest correctness score (57%) while JavaScript is the lowest (27%). Overall, Copilot's suggestions have low complexity with no notable differences between the programming languages. We also find some potential Copilot shortcomings, such as generating code that can be further simplified and code that relies on undefined helper methods.

BibTeX-Schlüssel: Nguyen_2022
Eintragstyp: inproceedings
Buchtitel: Proceedings of the 19th International Conference on Mining Software Repositories
Jahr: 2022
Monat: may
Seiten: 1-5
Verlag: ACM
Reihe: MSR ’22
collection: MSR ’22
DOI: 10.1145/3524842.3528470
URL: http://dx.doi.org/10.1145/3524842.3528470

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Bitte melden Sie sich an um selbst Rezensionen oder Kommentare zu erstellen.

Zitieren Sie diese Publikation

@inproceedings{Nguyen_2022, abstract = {GitHub and OpenAI recently launched Copilot, an "AI pair programmer" that utilizes the power of Natural Language Processing, Static Analysis, Code Synthesis, and Artificial Intelligence. Given a natural language description of the target functionality, Copilot can generate corresponding code in several programming languages. In this paper, we perform an empirical study to evaluate the correctness and understandability of Copilot's suggested code. We use 33 LeetCode questions to create queries for Copilot in four different programming languages. We evaluate the correctness of the corresponding 132 Copilot solutions by running LeetCode's provided tests, and evaluate understandability using SonarQube's cyclomatic complexity and cognitive complexity metrics. We find that Copilot's Java suggestions have the highest correctness score (57%) while JavaScript is the lowest (27%). Overall, Copilot's suggestions have low complexity with no notable differences between the programming languages. We also find some potential Copilot shortcomings, such as generating code that can be further simplified and code that relies on undefined helper methods. }, added-at = {2023-12-06T06:03:50.000+0100}, author = {Nguyen, Nhan and Nadi, Sarah}, biburl = {https://www.bibsonomy.org/bibtex/29ec7db85b2c8e33d575d402a54058749/brusilovsky}, booktitle = {Proceedings of the 19th International Conference on Mining Software Repositories}, collection = {MSR ’22}, description = {An empirical evaluation of GitHub copilot's code suggestions | Proceedings of the 19th International Conference on Mining Software Repositories}, doi = {10.1145/3524842.3528470}, interhash = {558f5ce7d3ef6c9801565ab0cc29b7ad}, intrahash = {9ec7db85b2c8e33d575d402a54058749}, keywords = {code-generation llm}, month = may, pages = {1-5}, publisher = {ACM}, series = {MSR ’22}, timestamp = {2023-12-06T06:03:50.000+0100}, title = {An empirical evaluation of GitHub copilot’s code suggestions}, url = {http://dx.doi.org/10.1145/3524842.3528470}, year = 2022 }

BibSonomy

An empirical evaluation of GitHub copilot’s code suggestions

Zusammenfassung

Tags

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf