Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Selvam, P

.
  1. arXiv:2407.13739  [pdf, other

    cs.AI cs.CL cs.SE

    Scaling Granite Code Models to 128K Context

    Authors: Matt Stallone, Vaibhav Saxena, Leonid Karlinsky, Bridget McGinn, Tim Bula, Mayank Mishra, Adriana Meza Soria, Gaoyuan Zhang, Aditya Prasad, Yikang Shen, Saptha Surendran, Shanmukha Guttula, Hima Patel, Parameswaran Selvam, Xuan-Hong Dang, Yan Koyfman, Atin Sood, Rogerio Feris, Nirmit Desai, David D. Cox, Ruchir Puri, Rameswar Panda

    Abstract: This paper introduces long-context Granite code models that support effective context windows of up to 128K tokens. Our solution for scaling context length of Granite 3B/8B code models from 2K/4K to 128K consists of a light-weight continual pretraining by gradually increasing its RoPE base frequency with repository-level file packing and length-upsampled long-context data. Additionally, we also re… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  2. arXiv:2405.04324  [pdf, other

    cs.AI cs.CL cs.SE

    Granite Code Models: A Family of Open Foundation Models for Code Intelligence

    Authors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang, Yikang Shen, Aditya Prasad, Adriana Meza Soria, Michele Merler, Parameswaran Selvam, Saptha Surendran, Shivdeep Singh, Manish Sethi, Xuan-Hong Dang, Pengyuan Li, Kun-Lung Wu, Syed Zawad, Andrew Coleman, Matthew White, Mark Lewis, Raju Pavuluri, Yan Koyfman, Boris Lublinsky, Maximilien de Bayser, Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal , et al. (21 additional authors not shown)

    Abstract: Large Language Models (LLMs) trained on code are revolutionizing the software development process. Increasingly, code LLMs are being integrated into software development environments to improve the productivity of human programmers, and LLM-based agents are beginning to show promise for handling complex tasks autonomously. Realizing the full potential of code LLMs requires a wide range of capabili… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Corresponding Authors: Rameswar Panda, Ruchir Puri; Equal Contributors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang

  3. arXiv:2304.13807  [pdf, other

    cs.NE

    A Survey on Solving and Discovering Differential Equations Using Deep Neural Networks

    Authors: Hyeonjung, Jung, Jayant Gupta, Bharat Jayaprakash, Matthew Eagon, Harish Panneer Selvam, Carl Molnar, William Northrop, Shashi Shekhar

    Abstract: Ordinary and partial differential equations (DE) are used extensively in scientific and mathematical domains to model physical systems. Current literature has focused primarily on deep neural network (DNN) based methods for solving a specific DE or a family of DEs. Research communities with a history of using DE models may view DNN-based differential equation solvers (DNN-DEs) as a faster and tran… ▽ More

    Submitted 19 June, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: Under review for ACM Computing Surveys journal. 29 pages

  4. arXiv:2303.11733  [pdf, other

    cs.PF cs.AI cs.AR cs.DC cs.LG

    DIPPM: a Deep Learning Inference Performance Predictive Model using Graph Neural Networks

    Authors: Karthick Panner Selvam, Mats Brorsson

    Abstract: Deep Learning (DL) has developed to become a corner-stone in many everyday applications that we are now relying on. However, making sure that the DL model uses the underlying hardware efficiently takes a lot of effort. Knowledge about inference characteristics can help to find the right match so that enough resources are given to the model, but not too much. We have developed a DL Inference Perfor… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  5. arXiv:2211.10527  [pdf, other

    cs.NI eess.SP

    PMNet: Robust Pathloss Map Prediction via Supervised Learning

    Authors: Ju-Hyung Lee, Omer Gokalp Serbetci, Dheeraj Panneer Selvam, Andreas F. Molisch

    Abstract: Pathloss prediction is an essential component of wireless network planning. While ray tracing based methods have been successfully used for many years, they require significant computational effort that may become prohibitive with the increased network densification and/or use of higher frequencies in 5G/B5G (beyond 5G) systems. In this paper, we propose and evaluate a data-driven and model-free p… ▽ More

    Submitted 16 May, 2023; v1 submitted 18 November, 2022; originally announced November 2022.

  6. arXiv:2105.00375  [pdf, other

    cs.AI

    Vehicle Emissions Prediction with Physics-Aware AI Models: Preliminary Results

    Authors: Harish Panneer Selvam, Yan Li, Pengyue Wang, William F. Northrop, Shashi Shekhar

    Abstract: Given an on-board diagnostics (OBD) dataset and a physics-based emissions prediction model, this paper aims to develop an accurate and computational-efficient AI (Artificial Intelligence) method that predicts vehicle emissions. The problem is of societal importance because vehicular emissions lead to climate change and impact human health. This problem is challenging because the OBD data does not… ▽ More

    Submitted 1 May, 2021; originally announced May 2021.

    Comments: Accepted by Association for Advancement of Artificial Intelligence (AAAI) Fall Symposium Series 2020: Physics-Guided AI to Accelerate Scientific Discovery (https://sites.google.com/vt.edu/pgai-aaai-20)

    Journal ref: PGAI-AAAI-20(2020)

  7. Blockchain Based Accounts Payable Platform for Goods Trade

    Authors: Krishnasuri Narayanam, Seep Goel, Abhishek Singh, Yedendra Shrinivasan, Parameswaram Selvam

    Abstract: Goods trade is a supply chain transaction that involves shippers buying goods from suppliers and carriers providing goods transportation. Shippers are issued invoices from suppliers and carriers. Shippers carry out goods receiving and invoice processing before payment processing of bills for suppliers and carriers, where invoice processing includes tasks like processing claims and adjusting the bi… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

  8. arXiv:2101.12582  [pdf, other

    hep-th cond-mat.dis-nn cs.CC nlin.CD quant-ph

    Circuit Complexity From Supersymmetric Quantum Field Theory With Morse Function

    Authors: Sayantan Choudhury, Sachin Panneer Selvam, K. Shirish

    Abstract: Computation of circuit complexity has gained much attention in the Theoretical Physics community in recent times to gain insights into the chaotic features and random fluctuations of fields in the quantum regime. Recent studies of circuit complexity take inspiration from Nielsen's geometric approach, which is based on the idea of optimal quantum control in which a cost function is introduced for t… ▽ More

    Submitted 11 August, 2022; v1 submitted 25 January, 2021; originally announced January 2021.

    Comments: 41 pages, 10 figures, 4 tables, This project is the part of the non-profit virtual international research consortium "Quantum Aspects of Space-Time and Matter (QASTM)", Revised version, References and some of the explanations elaborated and updated, Accepted for publication in Symmetry

    Journal ref: Symmetry 14 (2022) no. 8, 1656

  9. arXiv:2012.10234  [pdf, other

    hep-th cond-mat.dis-nn gr-qc nlin.CD quant-ph

    Circuit Complexity From Cosmological Islands

    Authors: Sayantan Choudhury, Satyaki Chowdhury, Nitin Gupta, Anurag Mishara, Sachin Panneer Selvam, Sudhakar Panda, Gabriel D. Pasquino, Chiranjeeb Singha, Abinash Swain

    Abstract: Recently in various theoretical works, path-breaking progress has been made in recovering the well-known Page Curve of an evaporating black hole with Quantum Extremal Islands, proposed to solve the long-standing black hole information loss problem related to the unitarity issue. Motivated by this concept, in this paper, we study cosmological circuit complexity in the presence (or absence) of Quant… ▽ More

    Submitted 3 July, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: 75 pages, 29 figures, 4 tables, Dr. Sayantan Choudhury would like to dedicate this work to his lovable father and prime inspiration Professor Manoranjan Choudhury who recently have passed away due to COVID 19. Updated and revised version, Accepted for publication in Symmetry (section: Physics and Symmetry/Asymmetry, Special issue: Manifest and Hidden Symmetries in Field and String Theories)

    Journal ref: Symmetry 13 (2021) no. 7, 1301

  10. arXiv:2009.03893  [pdf, other

    hep-th cond-mat.dis-nn gr-qc nlin.CD quant-ph

    Quantum aspects of chaos and complexity from bouncing cosmology: A study with two-mode single field squeezed state formalism

    Authors: Parth Bhargava, Sayantan Choudhury, Satyaki Chowdhury, Anurag Mishara, Sachin Panneer Selvam, Sudhakar Panda, Gabriel D. Pasquino

    Abstract: $Circuit~ Complexity$, a well known computational technique has recently become the backbone of the physics community to probe the chaotic behaviour and random quantum fluctuations of quantum fields. This paper is devoted to the study of out-of-equilibrium aspects and quantum chaos appearing in the universe from the paradigm of two well known bouncing cosmological solutions viz. $Cosine~ hyperboli… ▽ More

    Submitted 15 September, 2021; v1 submitted 8 September, 2020; originally announced September 2020.

    Comments: 104 pages, 41 figures, 9 tables, This project is the part of the non-profit virtual international research consortium "Quantum Aspects of Space-Time and Matter (QASTM)", Accepted for publication in SciPost Physics Core

    Journal ref: SciPost Phys. Core 4, 026 (2021)