Zum Hauptinhalt springen

Showing 1–12 of 12 results for author: Ramesh, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13922  [pdf, other

    cs.CV cs.AI cs.LG

    Synthetic Counterfactual Faces

    Authors: Guruprasad V Ramesh, Harrison Rosenberg, Ashish Hooda, Shimaa Ahmed Kassem Fawaz

    Abstract: Computer vision systems have been deployed in various applications involving biometrics like human faces. These systems can identify social media users, search for missing persons, and verify identity of individuals. While computer vision models are often evaluated for accuracy on available benchmarks, more annotated data is necessary to learn about their robustness and fairness against semantic d… ▽ More

    Submitted 29 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: Paper under review. Full text and results will be updated after acceptance

  2. arXiv:2405.13077  [pdf, other

    cs.CR cs.AI cs.CL

    GPT-4 Jailbreaks Itself with Near-Perfect Success Using Self-Explanation

    Authors: Govind Ramesh, Yao Dou, Wei Xu

    Abstract: Research on jailbreaking has been valuable for testing and understanding the safety and security issues of large language models (LLMs). In this paper, we introduce Iterative Refinement Induced Self-Jailbreak (IRIS), a novel approach that leverages the reflective capabilities of LLMs for jailbreaking with only black-box access. Unlike previous methods, IRIS simplifies the jailbreaking process by u… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  3. arXiv:2309.07277  [pdf, ps, other

    cs.CV cs.LG

    Limitations of Face Image Generation

    Authors: Harrison Rosenberg, Shimaa Ahmed, Guruprasad V Ramesh, Ramya Korlakai Vinayak, Kassem Fawaz

    Abstract: Text-to-image diffusion models have achieved widespread popularity due to their unprecedented image generation capability. In particular, their ability to synthesize and modify human faces has spurred research into using generated face images in both training data augmentation and model performance assessments. In this paper, we study the efficacy and shortcomings of generative models in the conte… ▽ More

    Submitted 21 December, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: Accepted to The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)

  4. arXiv:2308.02013  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Federated Representation Learning for Automatic Speech Recognition

    Authors: Guruprasad V Ramesh, Gopinath Chennupati, Milind Rao, Anit Kumar Sahu, Ariya Rastrow, Jasha Droppo

    Abstract: Federated Learning (FL) is a privacy-preserving paradigm, allowing edge devices to learn collaboratively without sharing data. Edge devices like Alexa and Siri are prospective sources of unlabeled audio data that can be tapped to learn robust audio representations. In this work, we bring Self-supervised Learning (SSL) and FL together to learn representations for Automatic Speech Recognition respec… ▽ More

    Submitted 7 August, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: Accepted at ISCA SPSC Symposium 3rd Symposium on Security and Privacy in Speech Communication, 2023

  5. arXiv:2307.00335  [pdf, other

    cs.CL cs.LG

    Single Sequence Prediction over Reasoning Graphs for Multi-hop QA

    Authors: Gowtham Ramesh, Makesh Sreedhar, Junjie Hu

    Abstract: Recent generative approaches for multi-hop question answering (QA) utilize the fusion-in-decoder method~\cite{izacard-grave-2021-leveraging} to generate a single sequence output which includes both a final answer and a reasoning path taken to arrive at that answer, such as passage titles and key facts from those passages. While such models can lead to better interpretability and high quantitative… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

  6. arXiv:2212.05409  [pdf, other

    cs.CL

    Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages

    Authors: Sumanth Doddapaneni, Rahul Aralikatte, Gowtham Ramesh, Shreya Goyal, Mitesh M. Khapra, Anoop Kunchukuttan, Pratyush Kumar

    Abstract: Building Natural Language Understanding (NLU) capabilities for Indic languages, which have a collective speaker base of more than one billion speakers is absolutely crucial. In this work, we aim to improve the NLU capabilities of Indic languages by making contributions along 3 important axes (i) monolingual corpora (ii) NLU testsets (iii) multilingual LLMs focusing on Indic languages. Specifically… ▽ More

    Submitted 24 May, 2023; v1 submitted 10 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  7. arXiv:2111.03945  [pdf, other

    cs.CL cs.SD eess.AS

    Towards Building ASR Systems for the Next Billion Users

    Authors: Tahir Javed, Sumanth Doddapaneni, Abhigyan Raman, Kaushal Santosh Bhogale, Gowtham Ramesh, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M. Khapra

    Abstract: Recent methods in speech and language technology pretrain very LARGE models which are fine-tuned for specific tasks. However, the benefits of such LARGE models are often limited to a few resource rich languages of the world. In this work, we make multiple contributions towards building ASR systems for low resource languages from the Indian subcontinent. First, we curate 17,000 hours of raw speech… ▽ More

    Submitted 22 December, 2021; v1 submitted 6 November, 2021; originally announced November 2021.

  8. arXiv:2110.04711  [pdf, other

    cs.LG cs.CL

    SuperShaper: Task-Agnostic Super Pre-training of BERT Models with Variable Hidden Dimensions

    Authors: Vinod Ganesan, Gowtham Ramesh, Pratyush Kumar

    Abstract: Task-agnostic pre-training followed by task-specific fine-tuning is a default approach to train NLU models. Such models need to be deployed on devices across the cloud and the edge with varying resource and accuracy constraints. For a given task, repeating pre-training and fine-tuning across tens of devices is prohibitively expensive. We propose SuperShaper, a task agnostic pre-training approach w… ▽ More

    Submitted 10 October, 2021; originally announced October 2021.

  9. arXiv:2107.00676  [pdf, other

    cs.CL

    A Primer on Pretrained Multilingual Language Models

    Authors: Sumanth Doddapaneni, Gowtham Ramesh, Mitesh M. Khapra, Anoop Kunchukuttan, Pratyush Kumar

    Abstract: Multilingual Language Models (\MLLMs) such as mBERT, XLM, XLM-R, \textit{etc.} have emerged as a viable option for bringing the power of pretraining to a large number of languages. Given their success in zero-shot transfer learning, there has emerged a large body of work in (i) building bigger \MLLMs~covering a large number of languages (ii) creating exhaustive benchmarks covering a wider variety… ▽ More

    Submitted 23 December, 2021; v1 submitted 1 July, 2021; originally announced July 2021.

  10. arXiv:2104.05596  [pdf

    cs.CL

    Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages

    Authors: Gowtham Ramesh, Sumanth Doddapaneni, Aravinth Bheemaraj, Mayank Jobanputra, Raghavan AK, Ajitesh Sharma, Sujit Sahoo, Harshita Diddee, Mahalakshmi J, Divyanshu Kakwani, Navneet Kumar, Aswin Pradeep, Srihari Nagaraj, Kumar Deepak, Vivek Raghavan, Anoop Kunchukuttan, Pratyush Kumar, Mitesh Shantadevi Khapra

    Abstract: We present Samanantar, the largest publicly available parallel corpora collection for Indic languages. The collection contains a total of 49.7 million sentence pairs between English and 11 Indic languages (from two language families). Specifically, we compile 12.4 million sentence pairs from existing, publicly-available parallel corpora, and additionally mine 37.4 million sentence pairs from the w… ▽ More

    Submitted 12 June, 2023; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Accepted to the Transactions of the Association for Computational Linguistics (TACL)

  11. arXiv:1304.7025  [pdf, other

    cs.IT

    Recovery of bilevel causal signals with finite rate of innovation using positive sampling kernels

    Authors: Gayatri Ramesh, Elie Atallah, Qiyu Sun

    Abstract: Bilevel signal $x$ with maximal local rate of innovation $R$ is a continuous-time signal that takes only two values 0 and 1 and that there is at most one transition position in any time period of 1/R.In this note, we introduce a recovery method for bilevel causal signals $x$ with maximal local rate of innovation $R$ from their uniform samples $x*h(nT), n\ge 1$, where the sampling kernel $h$ is cau… ▽ More

    Submitted 25 April, 2013; originally announced April 2013.

  12. arXiv:0912.0602  [pdf

    cs.NI

    A Reliable and Fault Tolerant Routing for Optical WDM Networks

    Authors: G. Ramesh, S. SundaraVadivelu

    Abstract: In optical WDM networks, since each lightpath can carry a huge mount of traffic, failures may seriously damage the end user applications. Hence fault tolerance becomes an important issue on these networks. The light path which carries traffic during normal operation is called as primary path. The traffic is rerouted on a backup path in case of a failure. In this paper we propose to design a reli… ▽ More

    Submitted 3 December, 2009; originally announced December 2009.

    Comments: 7 pages IEEE format, International Journal of Computer Science and Information Security, IJCSIS November 2009, ISSN 1947 5500, http://sites.google.com/site/ijcsis/

    Report number: ISSN 1947 5500

    Journal ref: International Journal of Computer Science and Information Security, IJCSIS, Vol. 6, No. 2, pp. 048-054, November 2009, USA