Zum Hauptinhalt springen

Showing 1–14 of 14 results for author: Seshadri, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.06324  [pdf, other

    cs.LG cs.CL cs.NE

    B'MOJO: Hybrid State Space Realizations of Foundation Models with Eidetic and Fading Memory

    Authors: Luca Zancato, Arjun Seshadri, Yonatan Dukler, Aditya Golatkar, Yantao Shen, Benjamin Bowman, Matthew Trager, Alessandro Achille, Stefano Soatto

    Abstract: We describe a family of architectures to support transductive inference by allowing memory to grow to a finite but a-priori unknown bound while making efficient use of finite resources for inference. Current architectures use such resources to represent data either eidetically over a finite span ("context" in Transformers), or fading over an infinite span (in State Space Models, or SSMs). Recent h… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2406.08431  [pdf, other

    cs.CV cs.AI cs.CR cs.LG

    Diffusion Soup: Model Merging for Text-to-Image Diffusion Models

    Authors: Benjamin Biggs, Arjun Seshadri, Yang Zou, Achin Jain, Aditya Golatkar, Yusheng Xie, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto

    Abstract: We present Diffusion Soup, a compartmentalization method for Text-to-Image Generation that averages the weights of diffusion models trained on sharded data. By construction, our approach enables training-free continual learning and unlearning with no additional memory or inference costs, since models corresponding to data shards can be added or removed by re-averaging. We show that Diffusion Soup… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2405.00172  [pdf, other

    cs.LG cs.SI stat.ML

    Re-visiting Skip-Gram Negative Sampling: Dimension Regularization for More Efficient Dissimilarity Preservation in Graph Embeddings

    Authors: David Liu, Arjun Seshadri, Tina Eliassi-Rad, Johan Ugander

    Abstract: A wide range of graph embedding objectives decompose into two components: one that attracts the embeddings of nodes that are perceived as similar, and another that repels embeddings of nodes that are perceived as dissimilar. Because real-world graphs are sparse and the number of dissimilar pairs grows quadratically with the number of nodes, Skip-Gram Negative Sampling (SGNS) has emerged as a popul… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  4. arXiv:2312.15081  [pdf, other

    cs.LG cs.IR stat.ML

    Learning Rich Rankings

    Authors: Arjun Seshadri, Stephen Ragain, Johan Ugander

    Abstract: Although the foundations of ranking are well established, the ranking literature has primarily been focused on simple, unimodal models, e.g. the Mallows and Plackett-Luce models, that define distributions centered around a single total ordering. Explicit mixture models have provided some tools for modelling multimodal ranking data, though learning such models from data is often difficult. In this… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 45 pages

  5. arXiv:2307.04132  [pdf, other

    cs.CV cs.AI cs.SC

    Reasoning over the Behaviour of Objects in Video-Clips for Adverb-Type Recognition

    Authors: Amrit Diggavi Seshadri, Alessandra Russo

    Abstract: In this work, following the intuition that adverbs describing scene-sequences are best identified by reasoning over high-level concepts of object-behavior, we propose the design of a new framework that reasons over object-behaviours extracted from raw-video-clips to recognize the clip's corresponding adverb-types. Importantly, while previous works for general scene adverb-recognition assume knowle… ▽ More

    Submitted 27 March, 2024; v1 submitted 9 July, 2023; originally announced July 2023.

  6. arXiv:2212.12645  [pdf, other

    cs.CV cs.LG

    HandsOff: Labeled Dataset Generation With No Additional Human Annotations

    Authors: Austin Xu, Mariya I. Vasileva, Achal Dave, Arjun Seshadri

    Abstract: Recent work leverages the expressive power of generative adversarial networks (GANs) to generate labeled synthetic datasets. These dataset generation methods often require new annotations of synthetic images, which forces practitioners to seek out annotators, curate a set of synthetic images, and ensure the quality of generated labels. We introduce the HandsOff framework, a technique capable of pr… ▽ More

    Submitted 30 March, 2023; v1 submitted 23 December, 2022; originally announced December 2022.

    Comments: 22 pages, 20 figures. CVPR 2023

  7. arXiv:2211.14935  [pdf, other

    cs.IR cs.AI cs.CY cs.LG

    RecXplainer: Amortized Attribute-based Personalized Explanations for Recommender Systems

    Authors: Sahil Verma, Chirag Shah, John P. Dickerson, Anurag Beniwal, Narayanan Sadagopan, Arjun Seshadri

    Abstract: Recommender systems influence many of our interactions in the digital world -- impacting how we shop for clothes, sorting what we see when browsing YouTube or TikTok, and determining which restaurants and hotels we are shown when using hospitality platforms. Modern recommender systems are large, opaque models trained on a mixture of proprietary and open-source datasets. Naturally, issues of trust… ▽ More

    Submitted 29 August, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: Awarded the Best Student Paper at TEA Workshop at NeurIPS 2022

  8. arXiv:2207.12033  [pdf, other

    cs.IR

    Contrastive Learning for Interactive Recommendation in Fashion

    Authors: Karin Sevegnani, Arjun Seshadri, Tian Wang, Anurag Beniwal, Julian McAuley, Alan Lu, Gerard Medioni

    Abstract: Recommender systems and search are both indispensable in facilitating personalization and ease of browsing in online fashion platforms. However, the two tools often operate independently, failing to combine the strengths of recommender systems to accurately capture user tastes with search systems' ability to process user queries. We propose a novel remedy to this problem by automatically recommend… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

  9. On the separation of correlation-assisted sum capacities of multiple access channels

    Authors: Akshay Seshadri, Felix Leditzky, Vikesh Siddhu, Graeme Smith

    Abstract: The capacity of a channel characterizes the maximum rate at which information can be transmitted through the channel asymptotically faithfully. For a channel with multiple senders and a single receiver, computing its sum capacity is possible in theory, but challenging in practice because of the nonconvex optimization involved. To address this challenge, we investigate three topics in our study. In… ▽ More

    Submitted 3 August, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: v3: 70 pages, 3 figures; to appear in IEEE Transactions on Information Theory

    Journal ref: IEEE Transactions on Information Theory, vol. 69, no. 9, pp. 5805-5844 (2023)

  10. arXiv:2110.08143  [pdf, other

    cs.CV

    Multi-Tailed, Multi-Headed, Spatial Dynamic Memory refined Text-to-Image Synthesis

    Authors: Amrit Diggavi Seshadri, Balaraman Ravindran

    Abstract: Synthesizing high-quality, realistic images from text-descriptions is a challenging task, and current methods synthesize images from text in a multi-stage manner, typically by first generating a rough initial image and then refining image details at subsequent stages. However, existing methods that follow this paradigm suffer from three important limitations. Firstly, they synthesize initial image… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

  11. arXiv:1902.03266  [pdf, other

    cs.LG cs.GT stat.ML

    Discovering Context Effects from Raw Choice Data

    Authors: Arjun Seshadri, Alexander Peysakhovich, Johan Ugander

    Abstract: Many applications in preference learning assume that decisions come from the maximization of a stable utility function. Yet a large experimental literature shows that individual choices and judgements can be affected by "irrelevant" aspects of the context in which they are made. An important class of such contexts is the composition of the choice set. In this work, our goal is to discover such cho… ▽ More

    Submitted 31 January, 2020; v1 submitted 8 February, 2019; originally announced February 2019.

    Comments: 24 pages

  12. arXiv:1806.01104  [pdf

    cs.DC

    Consolidating the innovative concepts towards Exascale computing for Co-Design of Co-Applications ll: Co-Design Automation - Workload Characterization

    Authors: Dhanasekar, Anirudh Seshadri, Sudharshan Srinivasan, Suryanarayanan, Akash Sridhar

    Abstract: Many-core co-design is a complex task in which application complexity design space, heterogeneous many-core architecture design space, parallel programming language design space, simulator design space and optimizer design space should get integrated through a binding process and these design spaces, an ensemble of what is called many-core co-design spaces. It is indispensable to build a co-design… ▽ More

    Submitted 29 April, 2018; originally announced June 2018.

    Comments: Revised Submission 2

  13. arXiv:1311.1928   

    cs.DM

    On Uni Chord Free Graphs

    Authors: Mahati Kumar, S. Manasvini, N. Sadagopan, Adithya Seshadri

    Abstract: A graph is unichord free if it does not contain a cycle with exactly one chord as its subgraph. In [3], it is shown that a graph is unichord free if and only if every minimal vertex separator is a stable set. In this paper, we first show that such a graph can be recognized in polynomial time. Further, we show that the chromatic number of unichord free graphs is one of (2,3, ω(G)). We also present… ▽ More

    Submitted 24 October, 2014; v1 submitted 8 November, 2013; originally announced November 2013.

    Comments: This paper has been withdrawn due to a bug in the algorithm

  14. arXiv:1212.0240  [pdf

    cs.MA eess.SY

    Onboard Dynamic Rail Track Safety Monitoring System

    Authors: Abhisekh Jain, Arvind Seshadri, Balaji B. S, Ramviyas Parasuraman

    Abstract: This proposal aims at solving one of the long prevailing problems in the Indian Railways. This simple method of continuous monitoring and assessment of the condition of the rail tracks can prevent major disasters and save precious human lives. Our method is capable of alerting the train in case of any dislocations in the track or change in strength of the soil. Also it can avert the collisions of… ▽ More

    Submitted 21 October, 2014; v1 submitted 2 December, 2012; originally announced December 2012.

    Comments: International Conference on Advanced Communication Systems 2007, Coimbatore, India