Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Thongtanunam

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.12807  [pdf, other

    cs.SE

    Code Ownership: The Principles, Differences, and Their Associations with Software Quality

    Authors: Patanamon Thongtanunam, Chakkrit Tantithamthavorn

    Abstract: Code ownership -- an approximation of the degree of ownership of a software component -- is one of the important software measures used in quality improvement plans. However, prior studies proposed different variants of code ownership approximations. Yet, little is known about the difference in code ownership approximations and their association with software quality. In this paper, we investigate… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: The paper has been accepted at the 35th IEEE International Symposium on Software Reliability Engineering (ISSRE2024)

  2. arXiv:2407.12241  [pdf, other

    cs.SE

    An Empirical Study of Static Analysis Tools for Secure Code Review

    Authors: Wachiraphan Charoenwet, Patanamon Thongtanunam, Van-Thuan Pham, Christoph Treude

    Abstract: Early identification of security issues in software development is vital to minimize their unanticipated impacts. Code review is a widely used manual analysis method that aims to uncover security issues along with other coding issues in software projects. While some studies suggest that automated static application security testing tools (SASTs) could enhance security issue identification, there i… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted by ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA) 2024

  3. arXiv:2407.06826  [pdf, other

    cs.AI

    VRDSynth: Synthesizing Programs for Multilingual Visually Rich Document Information Extraction

    Authors: Thanh-Dat Nguyen, Tung Do-Viet, Hung Nguyen-Duy, Tuan-Hai Luu, Hung Le, Bach Le, Patanamon, Thongtanunam

    Abstract: Businesses need to query visually rich documents (VRDs) like receipts, medical records, and insurance forms to make decisions. Existing techniques for extracting entities from VRDs struggle with new layouts or require extensive pre-training data. We introduce VRDSynth, a program synthesis method to automatically extract entity relations from multilingual VRDs without pre-training data. To capture… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted in ISSTA'24

  4. arXiv:2405.02213  [pdf, other

    cs.SE cs.AI cs.LG

    Automatic Programming: Large Language Models and Beyond

    Authors: Michael R. Lyu, Baishakhi Ray, Abhik Roychoudhury, Shin Hwei Tan, Patanamon Thongtanunam

    Abstract: Automatic programming has seen increasing popularity due to the emergence of tools like GitHub Copilot which rely on Large Language Models (LLMs). At the same time, automatically generated code faces challenges during deployment due to concerns around quality and trust. In this article, we study automated coding in a general sense and study the concerns around code quality, security and related is… ▽ More

    Submitted 15 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  5. arXiv:2405.01569  [pdf, other

    cs.SE

    A Systematic Literature Review on Reasons and Approaches for Accurate Effort Estimations in Agile

    Authors: Jirat Pasuksmit, Patanamon Thongtanunam, Shanika Karunasekera

    Abstract: Background: Accurate effort estimation is crucial for planning in Agile iterative development. Agile estimation generally relies on consensus-based methods like planning poker, which require less time and information than other formal methods (e.g., COSMIC) but are prone to inaccuracies. Understanding the common reasons for inaccurate estimations and how proposed approaches can assist practitioner… ▽ More

    Submitted 14 April, 2024; originally announced May 2024.

    Comments: Journal article

  6. arXiv:2402.09651  [pdf, other

    cs.SE cs.LG

    Practitioners' Challenges and Perceptions of CI Build Failure Predictions at Atlassian

    Authors: Yang Hong, Chakkrit Tantithamthavorn, Jirat Pasuksmit, Patanamon Thongtanunam, Arik Friedman, Xing Zhao, Anton Krasikov

    Abstract: Continuous Integration (CI) build failures could significantly impact the software development process and teams, such as delaying the release of new features and reducing developers' productivity. In this work, we report on an empirical study that investigates CI build failures throughout product development at Atlassian. Our quantitative analysis found that the repository dimension is the key fa… ▽ More

    Submitted 14 May, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  7. Improving Automated Code Reviews: Learning from Experience

    Authors: Hong Yi Lin, Patanamon Thongtanunam, Christoph Treude, Wachiraphan Charoenwet

    Abstract: Modern code review is a critical quality assurance process that is widely adopted in both industry and open source software environments. This process can help newcomers learn from the feedback of experienced reviewers; however, it often brings a large workload and stress to reviewers. To alleviate this burden, the field of automated code reviews aims to automate the process, teaching large langua… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Accepted by the 21st International Conference on Mining Software Repositories (MSR 24)

  8. Encoding Version History Context for Better Code Representation

    Authors: Huy Nguyen, Christoph Treude, Patanamon Thongtanunam

    Abstract: With the exponential growth of AI tools that generate source code, understanding software has become crucial. When developers comprehend a program, they may refer to additional contexts to look for information, e.g. program documentation or historical code versions. Therefore, we argue that encoding this additional contextual information could also benefit code representation for deep learning. Re… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 5 pages (plus 1 for references), 1 figure, 3 tables, paper was accepted to 21st International Conference on Mining Software Repositories (MSR 2024)

  9. arXiv:2311.16396  [pdf, other

    cs.SE

    Toward Effective Secure Code Reviews: An Empirical Study of Security-Related Coding Weaknesses

    Authors: Wachiraphan Charoenwet, Patanamon Thongtanunam, Van-Thuan Pham, Christoph Treude

    Abstract: Identifying security issues early is encouraged to reduce the latent negative impacts on software systems. Code review is a widely-used method that allows developers to manually inspect modified code, catching security issues during a software development cycle. However, existing code review studies often focus on known vulnerabilities, neglecting coding weaknesses, which can introduce real-world… ▽ More

    Submitted 8 May, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  10. arXiv:2308.11161  [pdf, other

    cs.SE

    Adversarial Attacks on Code Models with Discriminative Graph Patterns

    Authors: Thanh-Dat Nguyen, Yang Zhou, Xuan Bach D. Le, Patanamon, Thongtanunam, David Lo

    Abstract: Pre-trained language models of code are now widely used in various software engineering tasks such as code generation, code completion, vulnerability detection, etc. This, in turn, poses security and reliability risks to these models. One of the important threats is \textit{adversarial attacks}, which can lead to erroneous predictions and largely affect model performance on downstream tasks. Curre… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  11. arXiv:2308.10078  [pdf, other

    cs.SE

    Repeated Builds During Code Review: An Empirical Study of the OpenStack Community

    Authors: Rungroj Maipradit, Dong Wang, Patanamon Thongtanunam, Raula Gaikovina Kula, Yasutaka Kamei, Shane McIntosh

    Abstract: Code review is a popular practice where developers critique each others' changes. Since automated builds can identify low-level issues (e.g., syntactic errors, regression bugs), it is not uncommon for software organizations to incorporate automated builds in the code review process. In such code review deployment scenarios, submitted change sets must be approved for integration by both peer code r… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: conference

  12. An Exploration of Cross-Patch Collaborations via Patch Linkage in OpenStack

    Authors: Dong Wang, Patanamon Thongtanunam, Raula Gaikovina Kula, Kenichi Matsumoto

    Abstract: Contemporary development projects benefit from code review as it improves the quality of a project. Large ecosystems of inter-dependent projects like OpenStack generate a large number of reviews, which poses new challenges for collaboration (improving patches, fixing defects). Review tools allow developers to link between patches, to indicate patch dependency, competing solutions, or provide broad… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

  13. arXiv:2209.07048  [pdf, other

    cs.SE

    Automatically Recommend Code Updates: Are We There Yet?

    Authors: Yue Liu, Chakkrit Tantithamthavorn, Yonghui Liu, Patanamon Thongtanunam, Li Li

    Abstract: In recent years, large pre-trained Language Models of Code (CodeLMs) have shown promising results on various software engineering tasks. One such task is automatic code update recommendation, which transforms outdated code snippets into their approved and revised counterparts. Although many CodeLM-based approaches have been proposed, claiming high accuracy, their effectiveness and reliability on r… ▽ More

    Submitted 12 May, 2024; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: Under review at a SE journal

  14. arXiv:2205.13231  [pdf, other

    cs.SE

    Giving Back: Contributions Congruent to Library Dependency Changes in a Software Ecosystem

    Authors: Supatsara Wattanakriengkrai, Dong Wang, Raula Gaikovina Kula, Christoph Treude, Patanamon Thongtanunam, Takashi Ishio, Kenichi Matsumoto

    Abstract: Popular adoption of third-party libraries for contemporary software development has led to the creation of large inter-dependency networks, where sustainability issues of a single library can have widespread network effects. Maintainers of these libraries are often overworked, relying on the contributions of volunteers to sustain these libraries. In this work, we measure contributions that are ali… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

  15. arXiv:2107.02420  [pdf, other

    cs.SE

    Towards Just-Enough Documentation for Agile Effort Estimation: What Information Should Be Documented?

    Authors: Jirat Pasuksmit, Patanamon Thongtanunam, Shanika Karunasekera

    Abstract: Effort estimation is an integral part of activities planning in Agile iterative development. An Agile team estimates the effort of a task based on the available information which is usually conveyed through documentation. However, as documentation has a lower priority in Agile, little is known about how documentation effort can be optimized while achieving accurate estimation. Hence, to help pract… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

    Comments: To be published at International Conference on Software Maintenance and Evolution (ICSME) 2021

  16. arXiv:2101.04837  [pdf, other

    cs.SE

    Assessing the Students' Understanding and their Mistakes in Code Review Checklists -- An Experience Report of 1,791 Code Review Checklist Questions from 394 Students

    Authors: Chun Yong Chong, Patanamon Thongtanunam, Chakkrit Tantithamthavorn

    Abstract: Code review is a widely-used practice in software development companies to identify defects. Hence, code review has been included in many software engineering curricula at universities worldwide. However, teaching code review is still a challenging task because the code review effectiveness depends on the code reading and analytical skills of a reviewer. While several studies have investigated the… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: 10 pages, accepted at the International Conference on Software Engineering: Joint Track on Software Engineering Education and Training Track (ICSE'21-JSEET)

  17. Predicting Defective Lines Using a Model-Agnostic Technique

    Authors: Supatsara Wattanakriengkrai, Patanamon Thongtanunam, Chakkrit Tantithamthavorn, Hideaki Hata, Kenichi Matsumoto

    Abstract: Defect prediction models are proposed to help a team prioritize source code areas files that need Software QualityAssurance (SQA) based on the likelihood of having defects. However, developers may waste their unnecessary effort on the whole filewhile only a small fraction of its source code lines are defective. Indeed, we find that as little as 1%-3% of lines of a file are defective. Hence, in thi… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

  18. arXiv:1903.06348  [pdf, ps, other

    cs.SE

    Automatically Generating Documentation for Lambda Expressions in Java

    Authors: Anwar Alqaimi, Patanamon Thongtanunam, Christoph Treude

    Abstract: When lambda expressions were introduced to the Java programming language as part of the release of Java 8 in 2014, they were the language's first step into functional programming. Since lambda expressions are still relatively new, not all developers use or understand them. In this paper, we first present the results of an empirical study to determine how frequently developers of GitHub repositorie… ▽ More

    Submitted 14 March, 2019; originally announced March 2019.

    Comments: to appear as full paper at MSR 2019, the 16th International Conference on Mining Software Repositories

  19. arXiv:1806.10277  [pdf, other

    cs.SE

    The Impact of Human Factors on the Participation Decision of Reviewers in Modern Code Review

    Authors: Shade Ruangwan, Patanamon Thongtanunam, Akinori Ihara, Kenichi Matsumoto

    Abstract: Modern Code Review (MCR) plays a key role in software quality practices. In MCR process, a new patch (i.e., a set of code changes) is encouraged to be examined by reviewers in order to identify weaknesses in source code prior to an integration into main software repositories. To mitigate the risk of having future defects, prior work suggests that MCR should be performed with sufficient review part… ▽ More

    Submitted 26 June, 2018; originally announced June 2018.

    Comments: 43 pages, under submission at Springer Journal of Empirical Software Engineering (EMSE)