Skip to main content

Showing 1–16 of 16 results for author: Tamilselvam, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03387  [pdf, other

    cs.SE cs.AI cs.CL

    ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific Languages

    Authors: Mehant Kammakomati, Sameer Pimparkhede, Srikanth Tamilselvam, Prince Kumar, Pushpak Bhattacharyya

    Abstract: Recent work shows Large Language Models (LLMs) struggle to understand natural language constraints for various text generation tasks in zero- and few-shot settings. While, in the code domain, there is wide usage of constraints in code format to maintain the integrity of code written in Domain-Specific Languages (DSLs), yet there has been no work evaluating LLMs with these constraints. We propose t… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2406.11925  [pdf, other

    cs.SE cs.AI cs.CL

    DocCGen: Document-based Controlled Code Generation

    Authors: Sameer Pimparkhede, Mehant Kammakomati, Srikanth Tamilselvam, Prince Kumar, Ashok Pon Kumar, Pushpak Bhattacharyya

    Abstract: Recent developments show that Large Language Models (LLMs) produce state-of-the-art performance on natural language (NL) to code generation for resource-rich general-purpose languages like C++, Java, and Python. However, their practical usage for structured domain-specific languages (DSLs) such as YAML, JSON is limited due to domain-specific schema, grammar, and customizations generally unseen by… ▽ More

    Submitted 3 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2403.10205  [pdf, other

    cs.CL cs.AI

    Read between the lines -- Functionality Extraction From READMEs

    Authors: Prince Kumar, Srikanth Tamilselvam, Dinesh Garg

    Abstract: While text summarization is a well-known NLP task, in this paper, we introduce a novel and useful variant of it called functionality extraction from Git README files. Though this task is a text2text generation at an abstract level, it involves its own peculiarities and challenges making existing text2text generation systems not very useful. The motivation behind this task stems from a recent surge… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  4. arXiv:2308.01302  [pdf, other

    cs.SE

    Handling Communication via APIs for Microservices

    Authors: Vini Kanvar, Ridhi Jain, Srikanth Tamilselvam

    Abstract: Enterprises in their journey to the cloud, want to decompose their monolith applications into microservices to maximize cloud benefits. Current research focuses a lot on how to partition the monolith into smaller clusters that perform well across standard metrics like coupling, cohesion, etc. However, there is little research done on taking the partitions, identifying their dependencies between th… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: 15 pages

  5. arXiv:2307.04693  [pdf, other

    cs.SE cs.AI

    COMEX: A Tool for Generating Customized Source Code Representations

    Authors: Debeshee Das, Noble Saji Mathews, Alex Mathai, Srikanth Tamilselvam, Kranthi Sedamaki, Sridhar Chimalakonda, Atul Kumar

    Abstract: Learning effective representations of source code is critical for any Machine Learning for Software Engineering (ML4SE) system. Inspired by natural language processing, large language models (LLMs) like Codex and CodeGen treat code as generic sequences of text and are trained on huge corpora of code data, achieving state of the art performance on several software engineering (SE) tasks. However, v… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: The paper has been accepted for publication at ASE 2023 (Tool Demonstrations Track)

  6. arXiv:2305.11790  [pdf, other

    cs.CL

    Prompting with Pseudo-Code Instructions

    Authors: Mayank Mishra, Prince Kumar, Riyaz Bhat, Rudra Murthy V, Danish Contractor, Srikanth Tamilselvam

    Abstract: Prompting with natural language instructions has recently emerged as a popular method of harnessing the capabilities of large language models. Given the inherent ambiguity present in natural language, it is intuitive to consider the possible advantages of prompting with less ambiguous prompt styles, such as the use of pseudo-code. In this paper we explore if prompting via pseudo-code instruction… ▽ More

    Submitted 19 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Published in EMNLP 2023 main track

  7. arXiv:2112.01317  [pdf, other

    cs.SE cs.AI

    Monolith to Microservices: Representing Application Software through Heterogeneous Graph Neural Network

    Authors: Alex Mathai, Sambaran Bandyopadhyay, Utkarsh Desai, Srikanth Tamilselvam

    Abstract: Monolithic software encapsulates all functional capabilities into a single deployable unit. But managing it becomes harder as the demand for new functionalities grow. Microservice architecture is seen as an alternate as it advocates building an application through a set of loosely coupled small services wherein each service owns a single functional responsibility. But the challenges associated wit… ▽ More

    Submitted 20 May, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: The paper has been accepted for publication at IJCAI-ECAI 2022 (main research track)

  8. arXiv:2102.03827  [pdf, other

    cs.SE cs.AI cs.LG

    Graph Neural Network to Dilute Outliers for Refactoring Monolith Application

    Authors: Utkarsh Desai, Sambaran Bandyopadhyay, Srikanth Tamilselvam

    Abstract: Microservices are becoming the defacto design choice for software architecture. It involves partitioning the software components into finer modules such that the development can happen independently. It also provides natural benefits when deployed on the cloud since resources can be allocated dynamically to necessary components based on demand. Therefore, enterprises as part of their journey to cl… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

    Comments: Published: AAAI 2021

  9. arXiv:2011.03901  [pdf, other

    cs.AI

    Adversarial Black-Box Attacks On Text Classifiers Using Multi-Objective Genetic Optimization Guided By Deep Networks

    Authors: Alex Mathai, Shreya Khare, Srikanth Tamilselvam, Senthil Mani

    Abstract: We propose a novel genetic-algorithm technique that generates black-box adversarial examples which successfully fool neural network based text classifiers. We perform a genetic search with multi-objective optimization guided by deep learning based inferences and Seq2Seq mutation to generate semantically similar but imperceptible adversaries. We compare our approach with DeepWordBug (DWB) on SST an… ▽ More

    Submitted 9 November, 2020; v1 submitted 7 November, 2020; originally announced November 2020.

  10. arXiv:2011.01043  [pdf, other

    cs.SE cs.AI cs.CL

    Evaluation of Siamese Networks for Semantic Code Search

    Authors: Raunak Sinha, Utkarsh Desai, Srikanth Tamilselvam, Senthil Mani

    Abstract: With the increase in the number of open repositories and discussion forums, the use of natural language for semantic code search has become increasingly common. The accuracy of the results returned by such systems, however, can be low due to 1) limited shared vocabulary between code and user query and 2) inadequate semantic understanding of user query and its relation to code syntax. Siamese netwo… ▽ More

    Submitted 12 October, 2020; originally announced November 2020.

  11. arXiv:2002.00754  [pdf, other

    cs.CL

    Benchmarking Popular Classification Models' Robustness to Random and Targeted Corruptions

    Authors: Utkarsh Desai, Srikanth Tamilselvam, Jassimran Kaur, Senthil Mani, Shreya Khare

    Abstract: Text classification models, especially neural networks based models, have reached very high accuracy on many popular benchmark datasets. Yet, such models when deployed in real world applications, tend to perform badly. The primary reason is that these models are not tested against sufficient real world natural data. Based on the application users, the vocabulary and the style of the model's input… ▽ More

    Submitted 31 January, 2020; originally announced February 2020.

  12. arXiv:1911.11433  [pdf, other

    cs.LG cs.CV cs.IR eess.IV stat.ML

    "You might also like this model": Data Driven Approach for Recommending Deep Learning Models for Unknown Image Datasets

    Authors: Ameya Prabhu, Riddhiman Dasgupta, Anush Sankaran, Srikanth Tamilselvam, Senthil Mani

    Abstract: For an unknown (new) classification dataset, choosing an appropriate deep learning architecture is often a recursive, time-taking, and laborious process. In this research, we propose a novel technique to recommend a suitable architecture from a repository of known models. Further, we predict the performance accuracy of the recommended architecture on the given unknown dataset, without the need for… ▽ More

    Submitted 20 May, 2020; v1 submitted 26 November, 2019; originally announced November 2019.

    Comments: NeurIPS 2019, New in ML Group

  13. arXiv:1911.07309  [pdf, other

    cs.LG stat.ML

    Coverage Testing of Deep Learning Models using Dataset Characterization

    Authors: Senthil Mani, Anush Sankaran, Srikanth Tamilselvam, Akshay Sethi

    Abstract: Deep Neural Networks (DNNs), with its promising performance, are being increasingly used in safety critical applications such as autonomous driving, cancer detection, and secure authentication. With growing importance in deep learning, there is a requirement for a more standardized framework to evaluate and test deep learning models. The primary challenge involved in automated generation of extens… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.

  14. arXiv:1908.07190  [pdf, other

    cs.LG cs.CY stat.ML

    Compliance Change Tracking in Business Process Services

    Authors: Srikanth G Tamilselvam, Ankush Gupta, Arvind Agarwal

    Abstract: Regulatory compliance is an organization's adherence to laws, regulations, guidelines and specifications relevant to its business. Compliance officers responsible for maintaining adherence constantly struggle to keep up with the large amount of changes in regulatory requirements. Keeping up with the changes entail two main tasks: fetching the regulatory announcements that actually contain changes… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

  15. arXiv:1905.02486  [pdf, other

    cs.HC cs.LG

    A Visual Programming Paradigm for Abstract Deep Learning Model Development

    Authors: Srikanth Tamilselvam, Naveen Panwar, Shreya Khare, Rahul Aralikatte, Anush Sankaran, Senthil Mani

    Abstract: Deep learning is one of the fastest growing technologies in computer science with a plethora of applications. But this unprecedented growth has so far been limited to the consumption of deep learning experts. The primary challenge being a steep learning curve for learning the programming libraries and the lack of intuitive systems enabling non-experts to consume deep learning. Towards this goal, w… ▽ More

    Submitted 19 August, 2019; v1 submitted 7 May, 2019; originally announced May 2019.

  16. arXiv:1709.08366  [pdf, other

    cs.AI cs.CL

    "Let me convince you to buy my product ... ": A Case Study of an Automated Persuasive System for Fashion Products

    Authors: Vitobha Munigala, Srikanth Tamilselvam, Anush Sankaran

    Abstract: Persuasivenes is a creative art aimed at making people believe in certain set of beliefs. Many a times, such creativity is about adapting richness of one domain into another to strike a chord with the target audience. In this research, we present PersuAIDE! - A persuasive system based on linguistic creativity to transform given sentence to generate various forms of persuading sentences. These vari… ▽ More

    Submitted 25 September, 2017; originally announced September 2017.

    Comments: ML4Creativity workshop at SIGKDD 2017