ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific Languages

Kammakomati, Mehant; Pimparkhede, Sameer; Tamilselvam, Srikanth; Kumar, Prince; Bhattacharyya, Pushpak

Computer Science > Software Engineering

arXiv:2407.03387 (cs)

[Submitted on 3 Jul 2024]

Title:ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific Languages

Authors:Mehant Kammakomati, Sameer Pimparkhede, Srikanth Tamilselvam, Prince Kumar, Pushpak Bhattacharyya

View PDF HTML (experimental)

Abstract:Recent work shows Large Language Models (LLMs) struggle to understand natural language constraints for various text generation tasks in zero- and few-shot settings. While, in the code domain, there is wide usage of constraints in code format to maintain the integrity of code written in Domain-Specific Languages (DSLs), yet there has been no work evaluating LLMs with these constraints. We propose two novel tasks to assess the controllability of LLMs using hard and soft constraints represented as code across five representations. Our findings suggest that LLMs struggle to comprehend constraints in all representations irrespective of their portions in the pre-training data. While models are better at comprehending constraints in JSON, YAML, and natural language representations, they struggle with constraints represented in XML and the resource-rich language Python.

Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2407.03387 [cs.SE]
	(or arXiv:2407.03387v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2407.03387

Submission history

From: Sameer Pimparkhede [view email]
[v1] Wed, 3 Jul 2024 08:36:13 UTC (70 KB)

Computer Science > Software Engineering

Title:ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific Languages

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific Languages

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators