Zum Hauptinhalt springen

Showing 1–50 of 71 results for author: Garg, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.06050  [pdf, other

    cs.LG q-bio.BM

    What Ails Generative Structure-based Drug Design: Too Little or Too Much Expressivity?

    Authors: Rafał Karczewski, Samuel Kaski, Markus Heinonen, Vikas Garg

    Abstract: Several generative models with elaborate training and sampling procedures have been proposed recently to accelerate structure-based drug design (SBDD); however, perplexingly, their empirical performance turns out to be suboptimal. We seek to better understand this phenomenon from both theoretical and empirical perspectives. Since most of these models apply graph neural networks (GNNs), one may sus… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: 25 pages, 11 figures

  2. arXiv:2408.04118  [pdf, ps, other

    cs.DS cs.DC cs.DM math.OC

    Reducing Matroid Optimization to Basis Search

    Authors: Robert Streit, Vijay K. Garg

    Abstract: In combinatorial optimization, matroids provide one of the most elegant structures for algorithm design. This is perhaps best identified by the Edmonds-Rado theorem relating the success of the simple greedy algorithm to the anatomy of the optimal basis of a matroid [Edm71; Rad57]. As a response, much energy has been devoted to understanding a matroid's favorable computational properties. Yet surpr… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 43 pages, 7 figures, 3 algorithms

    ACM Class: G.2.1; F.2.0

  3. arXiv:2406.09443  [pdf, other

    eess.AS cs.HC cs.LG

    Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness

    Authors: Satyam Kumar, Sai Srujana Buddi, Utkarsh Oggy Sarawgi, Vineet Garg, Shivesh Ranjan, Ognjen, Rudovic, Ahmed Hussen Abdelaziz, Saurabh Adya

    Abstract: Voice activity detection (VAD) is a critical component in various applications such as speech recognition, speech enhancement, and hands-free communication systems. With the increasing demand for personalized and context-aware technologies, the need for effective personalized VAD systems has become paramount. In this paper, we present a comparative analysis of Personalized Voice Activity Detection… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  4. arXiv:2406.03164  [pdf, other

    cs.LG

    Topological Neural Networks go Persistent, Equivariant, and Continuous

    Authors: Yogesh Verma, Amauri H Souza, Vikas Garg

    Abstract: Topological Neural Networks (TNNs) incorporate higher-order relational information beyond pairwise interactions, enabling richer representations than Graph Neural Networks (GNNs). Concurrently, topological descriptors based on persistent homology (PH) are being increasingly employed to augment the GNNs. We investigate the benefits of integrating these two paradigms. Specifically, we introduce TopN… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024

  5. arXiv:2405.17656  [pdf, other

    cs.LG q-bio.QM

    Alignment is Key for Applying Diffusion Models to Retrosynthesis

    Authors: Najwa Laabid, Severi Rissanen, Markus Heinonen, Arno Solin, Vikas Garg

    Abstract: Retrosynthesis, the task of identifying precursors for a given molecule, can be naturally framed as a conditional graph generation task. Diffusion models are a particularly promising modelling approach, enabling post-hoc conditioning and trading off quality for speed during generation. We show mathematically that permutation equivariant denoisers severely limit the expressiveness of graph diffusio… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 9 figures

  6. arXiv:2405.14657  [pdf, other

    cs.LG stat.ML

    Heteroscedastic Preferential Bayesian Optimization with Informative Noise Distributions

    Authors: Marshal Arijona Sinaga, Julien Martinelli, Vikas Garg, Samuel Kaski

    Abstract: Preferential Bayesian optimization (PBO) is a sample-efficient framework for learning human preferences between candidate designs. PBO classically relies on homoscedastic noise models to represent human aleatoric uncertainty. Yet, such noise fails to accurately capture the varying levels of human aleatoric uncertainty, particularly when the user possesses partial knowledge among different pairs of… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  7. arXiv:2405.00389  [pdf, other

    math.OC cs.LG eess.SY

    Employing Federated Learning for Training Autonomous HVAC Systems

    Authors: Fredrik Hagström, Vikas Garg, Fabricio Oliveira

    Abstract: Buildings account for 40 % of global energy consumption. A considerable portion of building energy consumption stems from heating, ventilation, and air conditioning (HVAC), and thus implementing smart, energy-efficient HVAC systems has the potential to significantly impact the course of climate change. In recent years, model-free reinforcement learning algorithms have been increasingly assessed fo… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  8. arXiv:2404.13521  [pdf, other

    cs.HC cs.AI cs.CV cs.LG

    Graph4GUI: Graph Neural Networks for Representing Graphical User Interfaces

    Authors: Yue Jiang, Changkong Zhou, Vikas Garg, Antti Oulasvirta

    Abstract: Present-day graphical user interfaces (GUIs) exhibit diverse arrangements of text, graphics, and interactive elements such as buttons and menus, but representations of GUIs have not kept up. They do not encapsulate both semantic and visuo-spatial relationships among elements. To seize machine learning's potential for GUIs more efficiently, Graph4GUI exploits graph neural networks to capture indivi… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 18 pages

  9. arXiv:2404.10024  [pdf, other

    cs.AI cs.ET cs.LG physics.ao-ph

    ClimODE: Climate and Weather Forecasting with Physics-informed Neural ODEs

    Authors: Yogesh Verma, Markus Heinonen, Vikas Garg

    Abstract: Climate and weather prediction traditionally relies on complex numerical simulations of atmospheric physics. Deep learning approaches, such as transformers, have recently challenged the simulation paradigm with complex network forecasts. However, they often act as data-driven black-box models that neglect the underlying physics and lack uncertainty quantification. We address these limitations with… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted as ICLR 2024 Oral. Project website: https://yogeshverma1998.github.io/ClimODE/

  10. arXiv:2402.15864  [pdf, other

    cs.LG physics.chem-ph q-bio.BM

    Field-based Molecule Generation

    Authors: Alexandru Dumitrescu, Dani Korpela, Markus Heinonen, Yogesh Verma, Valerii Iakovlev, Vikas Garg, Harri Lähdesmäki

    Abstract: This work introduces FMG, a field-based model for drug-like molecule generation. We show how the flexibility of this method provides crucial advantages over the prevalent, point-cloud based methods, and achieves competitive molecular stability generation. We tackle optical isomerism (enantiomers), a previously omitted molecular property that is crucial for drug safety and effectiveness, and thus a… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: 15 pages, 14 figures

  11. arXiv:2312.16045  [pdf, other

    cs.LG cs.AI

    Algebraic Positional Encodings

    Authors: Konstantinos Kogkalidis, Jean-Philippe Bernardy, Vikas Garg

    Abstract: We introduce a novel positional encoding strategy for Transformer-style models, addressing the shortcomings of existing, often ad hoc, approaches. Our framework provides a flexible mapping from the algebraic specification of a domain to an interpretation as orthogonal operators. This design preserves the algebraic characteristics of the source domain, ensuring that the model upholds the desired st… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  12. arXiv:2311.06206  [pdf, ps, other

    cs.DC cs.DS

    Parallel Algorithms for Equilevel Predicates

    Authors: Vijay K. Garg, Robert P. Streit

    Abstract: We define a new class of predicates called equilevel predicates on a distributive lattice which eases the analysis of parallel algorithms. Many combinatorial problems such as the vertex cover problem, the bipartite matching problem, and the minimum spanning tree problem can be modeled as detecting an equilevel predicate. The problem of detecting an equilevel problem is NP-complete, but equilevel p… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: To appear in ICDCN 2024

  13. arXiv:2311.06152  [pdf, other

    cs.LG cs.AI

    Going beyond persistent homology using persistent homology

    Authors: Johanna Immonen, Amauri H. Souza, Vikas Garg

    Abstract: Representational limits of message-passing graph neural networks (MP-GNNs), e.g., in terms of the Weisfeiler-Leman (WL) test for isomorphism, are well understood. Augmenting these graph models with topological features via persistent homology (PH) has gained prominence, but identifying the class of attributed graphs that PH can recognize remains open. We introduce a novel concept of color-separati… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: Accepted to NeurIPS 2023

  14. Streaming Anchor Loss: Augmenting Supervision with Temporal Significance

    Authors: Utkarsh Oggy Sarawgi, John Berkowitz, Vineet Garg, Arnav Kundu, Minsik Cho, Sai Srujana Buddi, Saurabh Adya, Ahmed Tewfik

    Abstract: Streaming neural network models for fast frame-wise responses to various speech and sensory signals are widely adopted on resource-constrained platforms. Hence, increasing the learning capacity of such streaming models (i.e., by adding more parameters) to improve the predictive power may not be viable for real-world tasks. In this work, we propose a new loss, Streaming Anchor Loss (SAL), to better… ▽ More

    Submitted 18 April, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Published at IEEE ICASSP 2024, please see https://ieeexplore.ieee.org/abstract/document/10447222

    ACM Class: I.2.6; I.5.1; I.5.4; I.6.5

    Journal ref: In ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6110-6114). IEEE

  15. arXiv:2309.16115  [pdf, other

    cs.LG

    Compositional Sculpting of Iterative Generative Processes

    Authors: Timur Garipov, Sebastiaan De Peuter, Ge Yang, Vikas Garg, Samuel Kaski, Tommi Jaakkola

    Abstract: High training costs of generative models and the need to fine-tune them for specific tasks have created a strong interest in model reuse and composition. A key challenge in composing iterative generative processes, such as GFlowNets and diffusion models, is that to realize the desired target distribution, all steps of the generative process need to be coordinated, and satisfy delicate balance cond… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Extended version of NeurIPS 2023 paper

  16. arXiv:2309.16060  [pdf, other

    eess.AS cs.SD

    Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study

    Authors: Avamarie Brueggeman, Takuya Higuchi, Masood Delfarah, Stephen Shum, Vineet Garg

    Abstract: Noise robustness is a key aspect of successful speech applications. Speech enhancement (SE) has been investigated to improve automatic speech recognition accuracy; however, its effectiveness for keyword spotting (KWS) is still under-investigated. In this paper, we conduct a comprehensive study on single-channel speech enhancement for keyword spotting on the Google Speech Command (GSC) dataset. To… ▽ More

    Submitted 21 February, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

  17. arXiv:2309.04842  [pdf, other

    cs.CL cs.HC cs.SD eess.AS

    Leveraging Large Language Models for Exploiting ASR Uncertainty

    Authors: Pranay Dighe, Yi Su, Shangshang Zheng, Yunshu Liu, Vineet Garg, Xiaochuan Niu, Ahmed Tewfik

    Abstract: While large language models excel in a variety of natural language processing (NLP) tasks, to perform well on spoken language understanding (SLU) tasks, they must either rely on off-the-shelf automatic speech recognition (ASR) systems for transcription, or be equipped with an in-built speech modality. This work focuses on the former scenario, where LLM's accuracy on SLU tasks is constrained by the… ▽ More

    Submitted 12 September, 2023; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: Added references

  18. arXiv:2306.01005  [pdf, other

    cs.LG cs.AI q-bio.BM

    AbODE: Ab Initio Antibody Design using Conjoined ODEs

    Authors: Yogesh Verma, Markus Heinonen, Vikas Garg

    Abstract: Antibodies are Y-shaped proteins that neutralize pathogens and constitute the core of our adaptive immune system. De novo generation of new antibodies that target specific antigens holds the key to accelerating vaccine discovery. However, this co-design of the amino acid sequence and the 3D structure subsumes and accentuates some central challenges from multiple tasks, including protein folding (s… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: Accepted at ICML 2023

  19. arXiv:2303.12808  [pdf, other

    cs.CL cs.AI

    PACO: Provocation Involving Action, Culture, and Oppression

    Authors: Vaibhav Garg, Ganning Xu, Munindar P. Singh

    Abstract: In India, people identify with a particular group based on certain attributes such as religion. The same religious groups are often provoked against each other. Previous studies show the role of provocation in increasing tensions between India's two prominent religious groups: Hindus and Muslims. With the advent of the Internet, such provocation also surfaced on social media platforms such as What… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

  20. arXiv:2303.10795  [pdf, other

    cs.CR

    iRogue: Identifying Rogue Behavior from App Reviews

    Authors: Vaibhav Garg, Hui Guo, Nirav Ajmeri, Saikath Bhattacharya, Munindar P. Singh

    Abstract: An app user can access information of other users or third parties. We define rogue mobile apps as those that enable a user (abuser) to access information of another user or third party (victim), in a way that violates the victim's privacy expectations. Such apps are dual-use and their identification is nontrivial. We propose iRogue, an approach for identifying rogue apps based on their reviews, p… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

  21. arXiv:2303.10573  [pdf, other

    cs.CL

    Extracting Incidents, Effects, and Requested Advice from MeToo Posts

    Authors: Vaibhav Garg, Jiaqing Yuan, Rujie Xi, Munindar P. Singh

    Abstract: Survivors of sexual harassment frequently share their experiences on social media, revealing their feelings and emotions and seeking advice. We observed that on Reddit, survivors regularly share long posts that describe a combination of (i) a sexual harassment incident, (ii) its effect on the survivor, including their feelings and emotions, and (iii) the advice being sought. We term such posts MeT… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

  22. arXiv:2210.06032  [pdf, other

    cs.LG cs.ET q-bio.BM stat.ML

    Modular Flows: Differential Molecular Generation

    Authors: Yogesh Verma, Samuel Kaski, Markus Heinonen, Vikas Garg

    Abstract: Generating new molecules is fundamental to advancing critical applications such as drug discovery and material synthesis. Flows can generate molecules effectively by inverting the encoding process, however, existing flow models either require artifactual dequantization or specific node/edge orderings, lack desiderata such as permutation invariance, or induce discrepancy between the encoding and th… ▽ More

    Submitted 13 October, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022. More info at: https://yogeshverma1998.github.io/ModFlow/

  23. arXiv:2209.15059  [pdf, other

    cs.LG

    Provably expressive temporal graph networks

    Authors: Amauri H. Souza, Diego Mesquita, Samuel Kaski, Vikas Garg

    Abstract: Temporal graph networks (TGNs) have gained prominence as models for embedding dynamic interactions, but little is known about their theoretical underpinnings. We establish fundamental results about the representational power and limits of the two main categories of TGNs: those that aggregate temporal walks (WA-TGNs), and those that augment local message passing with recurrent memory modules (MP-TG… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: Accepted to NeurIPS 2022

  24. arXiv:2208.01370  [pdf, other

    cs.DS cs.DC

    Lattice Linear Predicate Algorithms for the Constrained Stable Marriage Problem with Ties

    Authors: Vijay K. Garg

    Abstract: We apply Lattice-Linear Predicate Detection Technique to derive parallel and distributed algorithms for various variants of the stable matching problem. These problems are: (a) the constrained stable marriage problem (b) the super stable marriage problem in presence of ties, and (c) the strongly stable marriage in presence of ties. All these problems are solved using the Lattice-Linear Predicate (… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: arXiv admin note: text overlap with arXiv:1812.10431

  25. arXiv:2205.09838  [pdf, ps, other

    cs.LG stat.ML

    Why GANs are overkill for NLP

    Authors: David Alvarez-Melis, Vikas Garg, Adam Tauman Kalai

    Abstract: This work offers a novel theoretical perspective on why, despite numerous attempts, adversarial approaches to generative modeling (e.g., GANs) have not been as popular for certain generation tasks, particularly sequential tasks such as Natural Language Generation, as they have in others, such as Computer Vision. In particular, on sequential data such as text, maximum-likelihood approaches are sign… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  26. arXiv:2203.15975  [pdf, other

    eess.AS cs.HC cs.LG cs.SD

    Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models

    Authors: Vineet Garg, Ognjen Rudovic, Pranay Dighe, Ahmed H. Abdelaziz, Erik Marchi, Saurabh Adya, Chandra Dhir, Ahmed Tewfik

    Abstract: We address the problem of detecting speech directed to a device that does not contain a specific wake-word. Specifically, we focus on audio coming from a touch-based invocation. Mitigating virtual assistants (VAs) activation due to accidental button presses is critical for user experience. While the majority of approaches to false trigger mitigation (FTM) are designed to detect the presence of a t… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Submitted to INTERSPEECH 2022

  27. arXiv:2111.15140  [pdf, other

    cs.CV

    Robust 3D Garment Digitization from Monocular 2D Images for 3D Virtual Try-On Systems

    Authors: Sahib Majithia, Sandeep N. Parameswaran, Sadbhavana Babar, Vikram Garg, Astitva Srivastava, Avinash Sharma

    Abstract: In this paper, we develop a robust 3D garment digitization solution that can generalize well on real-world fashion catalog images with cloth texture occlusions and large body pose variations. We assumed fixed topology parametric template mesh models for known types of garments (e.g., T-shirts, Trousers) and perform mapping of high-quality texture from an input catalog image to UV map panels corres… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

  28. arXiv:2111.05120  [pdf

    eess.SP cs.AI cs.LG

    A Deep Learning Technique using Low Sampling rate for residential Non Intrusive Load Monitoring

    Authors: Ronak Aghera, Sahil Chilana, Vishal Garg, Raghunath Reddy

    Abstract: Individual device loads and energy consumption feedback is one of the important approaches for pursuing users to save energy in residences. This can help in identifying faulty devices and wasted energy by devices when left On unused. The main challenge is to identity and estimate the energy consumption of individual devices without intrusive sensors on each device. Non-intrusive load monitoring (N… ▽ More

    Submitted 7 November, 2021; originally announced November 2021.

  29. arXiv:2110.15559  [pdf, ps, other

    cs.GT

    Minimal Envy Matchings in the Hospitals/Residents Problem with Lower Quotas

    Authors: Changyong Hu, Vijay K. Garg

    Abstract: In the Hospitals/Residents problem, every hospital has an upper quota that limits the number of residents assigned to it. While, in some applications, each hospital also has a lower quota for the number of residents it receives. In this setting, a stable matching may not exist. Envy-freeness is introduced as a relaxation of stability that allows blocking pairs involving a resident and an empty pos… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

  30. arXiv:2110.07981  [pdf, other

    cs.LG cs.AI

    Reappraising Domain Generalization in Neural Networks

    Authors: Sarath Sivaprasad, Akshay Goindani, Vaibhav Garg, Ritam Basu, Saiteja Kosgi, Vineet Gandhi

    Abstract: Given that Neural Networks generalize unreasonably well in the IID setting (with benign overfitting and betterment in performance with more parameters), OOD presents a consistent failure case to better the understanding of how they learn. This paper focuses on Domain Generalization (DG), which is perceived as the front face of OOD generalization. We find that the presence of multiple domains incen… ▽ More

    Submitted 28 April, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  31. arXiv:2110.04656  [pdf, other

    cs.SD cs.LG eess.AS

    Streaming on-device detection of device directed speech from voice and touch-based invocation

    Authors: Ognjen Rudovic, Akanksha Bindal, Vineet Garg, Pramod Simha, Pranay Dighe, Sachin Kajarekar

    Abstract: When interacting with smart devices such as mobile phones or wearables, the user typically invokes a virtual assistant (VA) by saying a keyword or by pressing a button on the device. However, in many cases, the VA can accidentally be invoked by the keyword-like speech or accidental button press, which may have implications on user experience and privacy. To this end, we propose an acoustic false-t… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

  32. arXiv:2105.09602  [pdf, ps, other

    cs.DM cs.DS

    Characterization of Super-stable Matchings

    Authors: Changyong Hu, Vijay K. Garg

    Abstract: An instance of the super-stable matching problem with incomplete lists and ties is an undirected bipartite graph $G = (A \cup B, E)$, with an adjacency list being a linearly ordered list of ties. Ties are subsets of vertices equally good for a given vertex. An edge $(x,y) \in E \backslash M$ is a blocking edge for a matching $M$ if by getting matched to each other neither of the vertices $x$ and… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

  33. arXiv:2105.06598  [pdf, other

    eess.AS cs.HC cs.LG cs.SD

    Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation

    Authors: Vineet Garg, Wonil Chang, Siddharth Sigtia, Saurabh Adya, Pramod Simha, Pranay Dighe, Chandra Dhir

    Abstract: We present a unified and hardware efficient architecture for two stage voice trigger detection (VTD) and false trigger mitigation (FTM) tasks. Two stage VTD systems of voice assistants can get falsely activated to audio segments acoustically similar to the trigger phrase of interest. FTM systems cancel such activations by using post trigger audio context. Traditional FTM systems rely on automatic… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

  34. arXiv:2104.07574  [pdf, ps, other

    cs.DC

    Who Needs Consensus? A Distributed Monetary System Between Rational Agents via Hearsay

    Authors: Yanni Georghiades, Robert Streit, Vijay Garg

    Abstract: We propose a novel distributed monetary system called Hearsay that tolerates both Byzantine and rational behavior without the need for agents to reach consensus on executed transactions. Recent work [5, 10, 15] has shown that distributed monetary systems do not require consensus and can operate using a broadcast primitive with weaker guarantees, such as reliable broadcast. However, these protocols… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  35. arXiv:2103.06264  [pdf, other

    cs.DC cs.DS

    A Lattice Linear Predicate Parallel Algorithm for the Dynamic Programming Problems

    Authors: Vijay K. Garg

    Abstract: It has been shown that the parallel Lattice Linear Predicate (LLP) algorithm solves many combinatorial optimization problems such as the shortest path problem, the stable marriage problem and the market clearing price problem. In this paper, we give the parallel LLP algorithm for many dynamic programming problems. In particular, we show that the LLP algorithm solves the longest subsequence problem… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

  36. arXiv:2010.15446  [pdf, other

    eess.AS cs.HC cs.LG cs.SD

    Progressive Voice Trigger Detection: Accuracy vs Latency

    Authors: Siddharth Sigtia, John Bridle, Hywel Richards, Pascal Clark, Erik Marchi, Vineet Garg

    Abstract: We present an architecture for voice trigger detection for virtual assistants. The main idea in this work is to exploit information in words that immediately follow the trigger phrase. We first demonstrate that by including more audio context after a detected trigger phrase, we can indeed get a more accurate decision. However, waiting to listen to more audio each time incurs a latency increase. Pr… ▽ More

    Submitted 2 March, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: Camera Ready Version: ICASSP 2021

  37. arXiv:2008.11837  [pdf, other

    cs.DC

    Amortized Constant Round Atomic Snapshot in Message-Passing Systems

    Authors: Vijay Garg, Saptaparni Kumar, Lewis Tseng, Xiong Zheng

    Abstract: We study the lattice agreement (LA) and atomic snapshot problems in asynchronous message-passing systems where up to $f$ nodes may crash. Our main result is a crash-tolerant atomic snapshot algorithm with \textit{amortized constant round complexity}. To the best of our knowledge, the best prior result is given by Delporte et al. [TPDS, 18] with amortized $O(n)$ complexity if there are more scans t… ▽ More

    Submitted 29 August, 2020; v1 submitted 26 August, 2020; originally announced August 2020.

  38. arXiv:2008.02323  [pdf, other

    eess.AS cs.HC cs.LG cs.SD

    Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering

    Authors: Saurabh Adya, Vineet Garg, Siddharth Sigtia, Pramod Simha, Chandra Dhir

    Abstract: We consider the design of two-pass voice trigger detection systems. We focus on the networks in the second pass that are used to re-score candidate segments obtained from the first-pass. Our baseline is an acoustic model(AM), with BiLSTM layers, trained by minimizing the CTC loss. We replace the BiLSTM layers with self-attention layers. Results on internal evaluation sets show that self-attention… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

    Comments: INTERSPEECH, 2020

  39. arXiv:2007.07121  [pdf, ps, other

    cs.GT cs.DM

    Improved Paths to Stability for the Stable Marriage Problem

    Authors: Vijay Kumar Garg, Changyong Hu

    Abstract: The stable marriage problem requires one to find a marriage with no blocking pair. Given a matching that is not stable, Roth and Vande Vate have shown that there exists a sequence of matchings that leads to a stable matching in which each successive matching is obtained by satisfying a blocking pair. The sequence produced by Roth and Vande Vate's algorithm is of length $O(n^3)$ where $n$ is the nu… ▽ More

    Submitted 16 May, 2023; v1 submitted 14 July, 2020; originally announced July 2020.

  40. arXiv:2002.06779  [pdf, other

    cs.DC

    Byzantine Lattice Agreement in Asynchronous Systems

    Authors: Xiong Zheng, Vijay Garg

    Abstract: We study the Byzantine lattice agreement (BLA) problem in asynchronous distributed message passing systems. In the BLA problem, each process proposes a value from a join semi-lattice and needs to output a value also in the lattice such that all output values of correct processes lie on a chain despite the presence of Byzantine processes. We present an algorithm for this problem with round complexi… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

  41. arXiv:2002.06157  [pdf, ps, other

    cs.LG stat.ML

    Generalization and Representational Limits of Graph Neural Networks

    Authors: Vikas K. Garg, Stefanie Jegelka, Tommi Jaakkola

    Abstract: We address two fundamental questions about graph neural networks (GNNs). First, we prove that several important graph properties cannot be computed by GNNs that rely entirely on local information. Such GNNs include the standard message passing models, and more powerful spatial variants that exploit local graph structure (e.g., via relative orientation of messages, or local port ordering) to distin… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

  42. arXiv:2002.05660  [pdf, other

    cs.LG stat.ML

    Learn to Expect the Unexpected: Probably Approximately Correct Domain Generalization

    Authors: Vikas K. Garg, Adam Kalai, Katrina Ligett, Zhiwei Steven Wu

    Abstract: Domain generalization is the problem of machine learning when the training data and the test data come from different data domains. We present a simple theoretical model of learning to generalize across domains in which there is a meta-distribution over data distributions, and those data distributions may even have different supports. In our model, the training data given to a learning algorithm c… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

  43. arXiv:2001.03133  [pdf, other

    cs.DM cs.MA

    A Generalization of Teo and Sethuraman's Median Stable Marriage Theorem

    Authors: Vijay K. Garg

    Abstract: Let $L$ be any finite distributive lattice and $B$ be any boolean predicate defined on $L$ such that the set of elements satisfying $B$ is a sublattice of $L$. Consider any subset $M$ of $L$ of size $k$ of elements of $L$ that satisfy $B$. Then, we show that $k$ generalized median elements generated from $M$ also satisfy $B$. We call this result generalized median theorem on finite distributive la… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

    Comments: 5 pages

  44. arXiv:1910.14141  [pdf, other

    cs.DC

    Byzantine Lattice Agreement in Synchronous Systems

    Authors: Xiong Zheng, Vijay Garg

    Abstract: In this paper, we study the Byzantine lattice agreement problem in synchronous systems. The lattice agreement problem in crash failure model has been studied both in synchronous and asynchronous systems, which leads to the current best upper bound of $O(\log f)$ rounds in both systems. However, very few algorithmic results are known for the lattice agreement problem in Byzantine failure model. The… ▽ More

    Submitted 16 February, 2020; v1 submitted 30 October, 2019; originally announced October 2019.

  45. arXiv:1910.13386  [pdf, ps, other

    cs.DS cs.DC

    NC Algorithms for Popular Matchings in One-Sided Preference Systems and Related Problems

    Authors: Changyong Hu, Vijay K. Garg

    Abstract: The popular matching problem is of matching a set of applicants to a set of posts, where each applicant has a preference list, ranking a non-empty subset of posts in the order of preference, possibly with ties. A matching M is popular if there is no other matching M' such that more applicants prefer M' to M. We give the first NC algorithm to solve the popular matching problem without ties. We also… ▽ More

    Submitted 20 December, 2019; v1 submitted 23 October, 2019; originally announced October 2019.

  46. arXiv:1908.10408  [pdf, other

    cs.LG cs.IR stat.ML

    Multiresolution Transformer Networks: Recurrence is Not Essential for Modeling Hierarchical Structure

    Authors: Vikas K. Garg, Inderjit S. Dhillon, Hsiang-Fu Yu

    Abstract: The architecture of Transformer is based entirely on self-attention, and has been shown to outperform models that employ recurrence on sequence transduction tasks such as machine translation. The superior performance of Transformer has been attributed to propagating signals over shorter distances, between positions in the input and the output, compared to the recurrent architectures. We establish… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

    Comments: Initial version

  47. arXiv:1906.12159  [pdf, other

    cs.CV cs.LG

    Teaching DNNs to design fast fashion

    Authors: Abhinav Ravi, Arun Patro, Vikram Garg, Anoop Kolar Rajagopal, Aruna Rajan, Rajdeep Hazra Banerjee

    Abstract: $ $"Fast Fashion" spearheads the biggest disruption in fashion that enabled to engineer resilient supply chains to quickly respond to changing fashion trends. The conventional design process in commercial manufacturing is often fed through "trends" or prevailing modes of dressing around the world that indicate sudden interest in a new form of expression, cyclic patterns, and popular modes of expre… ▽ More

    Submitted 3 July, 2019; v1 submitted 27 June, 2019; originally announced June 2019.

    Comments: 8 pages, 9 figures, KDD conference

  48. arXiv:1905.12169  [pdf, other

    cs.LG stat.ML

    Strategic Prediction with Latent Aggregative Games

    Authors: Vikas K. Garg, Tommi Jaakkola

    Abstract: We introduce a new class of context dependent, incomplete information games to serve as structured prediction models for settings with significant strategic interactions. Our games map the input context to outcomes by first condensing the input into private player types that specify the utilities, weighted interactions, as well as the initial strategies for the players. The game is played over mul… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

  49. arXiv:1905.12158  [pdf, other

    cs.LG stat.ML

    Solving graph compression via optimal transport

    Authors: Vikas K. Garg, Tommi Jaakkola

    Abstract: We propose a new approach to graph compression by appeal to optimal transport. The transport problem is seeded with prior information about node importance, attributes, and edges in the graph. The transport formulation can be setup for either directed or undirected graphs, and its dual characterization is cast in terms of distributions over the nodes. The compression pertains to the support of nod… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

  50. arXiv:1905.03111  [pdf, other

    cs.DC

    Parallel and Distributed Algorithms for the housing allocation Problem

    Authors: Xiong Zheng, Vijay Garg

    Abstract: We give parallel and distributed algorithms for the housing allocation problem. In this problem, there is a set of agents and a set of houses. Each agent has a strict preference list for a subset of houses. We need to find a matching such that some criterion is optimized. One such criterion is Pareto Optimality. A matching is Pareto optimal if no coalition of agents can be strictly better off by e… ▽ More

    Submitted 8 May, 2019; originally announced May 2019.