Comparing Programming Language Models for Design Pattern Recognition

Sushant Kumar Pandey; Miroslaw Staron; Jennifer Horkoff; Mirosław Ochodek; Darko Durisic

doi:10.1109/ICSA-C63560.2024.00041

System Informacji Naukowej Politechniki Poznańskiej

PL EN

Strona główna / Publikacje / Comparing Programming Language Models for Design Pattern Recognition

Zgłoś uwagę

Rozdział

Pobierz BibTeX

Tytuł

Comparing Programming Language Models for Design Pattern Recognition

Autorzy

Sushant Kumar Pandey
Miroslaw Staron
Jennifer Horkoff
Mirosław Ochodek (WIiT) ^{[ 1 ][ 2.3 ][ P ]}
Darko Durisic

^{[ 1 ]} Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | ^{[ P ]} pracownik

Dyscyplina naukowa (Ustawa 2.0)

[2.3] Informatyka techniczna i telekomunikacja

Rok publikacji

2024

Typ rozdziału

rozdział w monografii naukowej / referat

Język publikacji

angielski

Streszczenie

EN Design patterns (DPs) facilitate effective software architecture and design and must be maintained and enforced in existing complex software products, for example, automotive software. Implementing DPs in source code facilitates the development of high-quality software products with less effort. However, recognizing DPs in program code is challenging, and this makes it difficult to keep architectural evolution under control in large software products over time. As DPs are abstract solutions, the programs used to recognize them in source code have significant limitations. In this paper, we employ four programming language models based on Bidirectional Encoder Representations from Transformers (BERT) to study to which extent these models can recognize an exemplar DP, in this case, Singleton. We compare four language representation models - OpenAI CodeX, Facebook AI TransCoder, ACoRA/BERT, and CCFlex/bag-of-words, and compare the models' rankings to a simple base metric. We found a discrepancy between models in identifying Singletons and found that the models are inconsistently sensitive to name and semantic changes. Specifically, CodeX recognizes the existence of Singletons better than other models, while only ACoRA shows some signs of recognizing DP semantics.

Data udostępnienia online

21.08.2024

Strony (od-do)

183 - 190

DOI

10.1109/ICSA-C63560.2024.00041

URL

https://ieeexplore.ieee.org/document/10628366

Książka

IEEE 21st International Conference on Software Architecture Companion ICSA-C 2024, 4 – 8 June 2024, Hyderabad, India

Zaprezentowany na

IEEE 21st International Conference on Software Architecture Companion ICSA-C 2024, 4-8.06.2024, Hyderabad, India