Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Dwivedi, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.14418  [pdf, other

    cs.CL cs.AI

    MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues

    Authors: Kuluhan Binici, Abhinav Ramesh Kashyap, Viktor Schlegel, Andy T. Liu, Vijay Prakash Dwivedi, Thanh-Tung Nguyen, Xiaoxue Gao, Nancy F. Chen, Stefan Winkler

    Abstract: Automatic Speech Recognition (ASR) systems are pivotal in transcribing speech into text, yet the errors they introduce can significantly degrade the performance of downstream tasks like summarization. This issue is particularly pronounced in clinical dialogue summarization, a low-resource domain where supervised data for fine-tuning is scarce, necessitating the use of ASR models as black-box solut… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  2. arXiv:2408.12095  [pdf, other

    cs.CL cs.AI cs.LG

    uMedSum: A Unified Framework for Advancing Medical Abstractive Summarization

    Authors: Aishik Nagar, Yutong Liu, Andy T. Liu, Viktor Schlegel, Vijay Prakash Dwivedi, Arun-Kumar Kaliya-Perumal, Guna Pratheep Kalanchiam, Yili Tang, Robby T. Tan

    Abstract: Medical abstractive summarization faces the challenge of balancing faithfulness and informativeness. Current methods often sacrifice key information for faithfulness or introduce confabulations when prioritizing informativeness. While recent advancements in techniques like in-context learning (ICL) and fine-tuning have improved medical summarization, they often overlook crucial aspects such as fai… ▽ More

    Submitted 25 August, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: 12 pages

  3. arXiv:2406.03699  [pdf, other

    cs.CL

    M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question Answering

    Authors: Anand Subramanian, Viktor Schlegel, Abhinav Ramesh Kashyap, Thanh-Tung Nguyen, Vijay Prakash Dwivedi, Stefan Winkler

    Abstract: There is vivid research on adapting Large Language Models (LLMs) to perform a variety of tasks in high-stakes domains such as healthcare. Despite their popularity, there is a lack of understanding of the extent and contributing factors that allow LLMs to recall relevant knowledge and combine it with presented information in the clinical and biomedical domain: a fundamental pre-requisite for succes… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL 2024 (Findings)

  4. arXiv:2312.13533  [pdf, other

    cs.CL

    Automated Clinical Coding for Outpatient Departments

    Authors: Viktor Schlegel, Abhinav Ramesh Kashyap, Thanh-Tung Nguyen, Tsung-Han Yang, Vijay Prakash Dwivedi, Wei-Hsian Yin, Jeng Wei, Stefan Winkler

    Abstract: Computerised clinical coding approaches aim to automate the process of assigning a set of codes to medical records. While there is active research pushing the state of the art on clinical coding for hospitalized patients, the outpatient setting -- where doctors tend to non-hospitalised patients -- is overlooked. Although both settings can be formalised as a multi-label classification task, they pr… ▽ More

    Submitted 24 December, 2023; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: 9 pages, preprint under review

  5. arXiv:2312.11109  [pdf, other

    cs.LG

    Graph Transformers for Large Graphs

    Authors: Vijay Prakash Dwivedi, Yozen Liu, Anh Tuan Luu, Xavier Bresson, Neil Shah, Tong Zhao

    Abstract: Transformers have recently emerged as powerful neural networks for graph learning, showcasing state-of-the-art performance on several graph property prediction tasks. However, these results have been limited to small-scale graphs, where the computational feasibility of the global attention mechanism is possible. The next goal is to scale up these architectures to handle very large graphs on the sc… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  6. arXiv:2306.06493  [pdf, other

    cs.NE

    RAMAN: A Re-configurable and Sparse tinyML Accelerator for Inference on Edge

    Authors: Adithya Krishna, Srikanth Rohit Nudurupati, Chandana D G, Pritesh Dwivedi, André van Schaik, Mahesh Mehendale, Chetan Singh Thakur

    Abstract: Deep Neural Network (DNN) based inference at the edge is challenging as these compute and data-intensive algorithms need to be implemented at low cost and low power while meeting the latency constraints of the target applications. Sparsity, in both activations and weights inherent to DNNs, is a key knob to leverage. In this paper, we present RAMAN, a Re-configurable and spArse tinyML Accelerator f… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

  7. arXiv:2305.15747  [pdf, other

    cs.LG

    Union Subgraph Neural Networks

    Authors: Jiaxing Xu, Aihu Zhang, Qingtian Bian, Vijay Prakash Dwivedi, Yiping Ke

    Abstract: Graph Neural Networks (GNNs) are widely used for graph representation learning in many application domains. The expressiveness of vanilla GNNs is upper-bounded by 1-dimensional Weisfeiler-Leman (1-WL) test as they operate on rooted subtrees through iterative message passing. In this paper, we empower GNNs by injecting neighbor-connectivity information extracted from a new type of substructure. We… ▽ More

    Submitted 9 January, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

  8. arXiv:2304.11325  [pdf, ps, other

    cs.CC cs.SC math.AC

    Deterministic identity testing paradigms for bounded top-fanin depth-4 circuits

    Authors: Pranjal Dutta, Prateek Dwivedi, Nitin Saxena

    Abstract: Polynomial Identity Testing (PIT) is a fundamental computational problem. The famous depth-$4$ reduction result by Agrawal and Vinay (FOCS 2008) has made PIT for depth-$4$ circuits an enticing pursuit. A restricted depth-4 circuit computing a $n$-variate degree-$d$ polynomial of the form $\sum_{i = 1}^{k} \prod_{j} g_{ij}$, where $°g_{ij} \leq δ$ is called $Σ^{[k]}ΠΣΠ^{[δ]}$ circuit. On further re… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

    Comments: A preliminary version appeared in 36th Computational Complexity Conference (CCC), 2021

    ACM Class: F.2.1

  9. arXiv:2209.06321  [pdf, other

    cs.CL cs.AI cs.HC

    Alexa, Let's Work Together: Introducing the First Alexa Prize TaskBot Challenge on Conversational Task Assistance

    Authors: Anna Gottardi, Osman Ipek, Giuseppe Castellucci, Shui Hu, Lavina Vaz, Yao Lu, Anju Khatri, Anjali Chadha, Desheng Zhang, Sattvik Sahai, Prerna Dwivedi, Hangjie Shi, Lucy Hu, Andy Huang, Luke Dai, Bofei Yang, Varun Somani, Pankaj Rajan, Ron Rezac, Michael Johnston, Savanna Stiff, Leslie Ball, David Carmel, Yang Liu, Dilek Hakkani-Tur , et al. (5 additional authors not shown)

    Abstract: Since its inception in 2016, the Alexa Prize program has enabled hundreds of university students to explore and compete to develop conversational agents through the SocialBot Grand Challenge. The goal of the challenge is to build agents capable of conversing coherently and engagingly with humans on popular topics for 20 minutes, while achieving an average rating of at least 4.0/5.0. However, as co… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 14 pages, Proceedings of Alexa Prize Taskbot (Alexa Prize 2021)

    ACM Class: I.2.7; J.0; H.5.1; H.5.2

  10. arXiv:2206.08164  [pdf, other

    cs.LG

    Long Range Graph Benchmark

    Authors: Vijay Prakash Dwivedi, Ladislav Rampášek, Mikhail Galkin, Ali Parviz, Guy Wolf, Anh Tuan Luu, Dominique Beaini

    Abstract: Graph Neural Networks (GNNs) that are based on the message passing (MP) paradigm generally exchange information between 1-hop neighbors to build node representations at each layer. In principle, such networks are not able to capture long-range interactions (LRI) that may be desired or necessary for learning a given task on graphs. Recently, there has been an increasing interest in development of T… ▽ More

    Submitted 28 November, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: Added reference to Tönshoff et al., 2023 in Sec. 4.1; NeurIPS 2022 Track on D&B; Open-sourced at: https://github.com/vijaydwivedi75/lrgb

  11. arXiv:2205.12454  [pdf, other

    cs.LG

    Recipe for a General, Powerful, Scalable Graph Transformer

    Authors: Ladislav Rampášek, Mikhail Galkin, Vijay Prakash Dwivedi, Anh Tuan Luu, Guy Wolf, Dominique Beaini

    Abstract: We propose a recipe on how to build a general, powerful, scalable (GPS) graph Transformer with linear complexity and state-of-the-art results on a diverse set of benchmarks. Graph Transformers (GTs) have gained popularity in the field of graph representation learning with a variety of recent publications but they lack a common foundation about what constitutes a good positional or structural encod… ▽ More

    Submitted 15 January, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: In Proceedings of NeurIPS 2022

  12. arXiv:2201.04563  [pdf, ps, other

    cs.SI cs.DS

    Inexact Graph Matching Using Centrality Measures

    Authors: Shri Prakash Dwivedi

    Abstract: Graph matching is the process of computing the similarity between two graphs. Depending on the requirement, it can be exact or inexact. Exact graph matching requires a strict correspondence between nodes of two graphs, whereas inexact matching allows some flexibility or tolerance during the graph matching. In this chapter, we describe an approximate inexact graph matching by reducing the size of t… ▽ More

    Submitted 31 December, 2021; originally announced January 2022.

    Comments: 12 pages, 8 figures

  13. arXiv:2110.07875  [pdf, other

    cs.LG

    Graph Neural Networks with Learnable Structural and Positional Representations

    Authors: Vijay Prakash Dwivedi, Anh Tuan Luu, Thomas Laurent, Yoshua Bengio, Xavier Bresson

    Abstract: Graph neural networks (GNNs) have become the standard learning architectures for graphs. GNNs have been applied to numerous domains ranging from quantum chemistry, recommender systems to knowledge graphs and natural language processing. A major issue with arbitrary graphs is the absence of canonical positional information of nodes, which decreases the representation power of GNNs to distinguish e.… ▽ More

    Submitted 10 February, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Code at https://github.com/vijaydwivedi75/gnn-lspe

    Journal ref: ICLR 2022 (https://openreview.net/pdf?id=wTTjnvGphYj)

  14. arXiv:2012.15279  [pdf, other

    cs.DS cs.CV

    Some Algorithms on Exact, Approximate and Error-Tolerant Graph Matching

    Authors: Shri Prakash Dwivedi

    Abstract: The graph is one of the most widely used mathematical structures in engineering and science because of its representational power and inherent ability to demonstrate the relationship between objects. The objective of this work is to introduce the novel graph matching techniques using the representational power of the graph and apply it to structural pattern recognition applications. We present an… ▽ More

    Submitted 30 December, 2020; originally announced December 2020.

    Comments: Ph.D. Thesis, Indian Institute of Technology (BHU), Varanasi, July 2019. (Adviser: Dr. R.S. Singh)

  15. arXiv:2012.09699  [pdf, other

    cs.LG

    A Generalization of Transformer Networks to Graphs

    Authors: Vijay Prakash Dwivedi, Xavier Bresson

    Abstract: We propose a generalization of transformer neural network architecture for arbitrary graphs. The original transformer was designed for Natural Language Processing (NLP), which operates on fully connected graphs representing all connections between the words in a sequence. Such architecture does not leverage the graph connectivity inductive bias, and can perform poorly when the graph topology is im… ▽ More

    Submitted 24 January, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: AAAI 2021 Workshop on Deep Learning on Graphs: Methods and Applications (DLG-AAAI 2021); Code at https://github.com/graphdeeplearning/graphtransformer

  16. arXiv:2007.08004  [pdf, ps, other

    eess.AS cs.SD

    Data augmentation enhanced speaker enrollment for text-dependent speaker verification

    Authors: Achintya Kumar Sarkar, Himangshu Sarma, Priyanka Dwivedi, Zheng-Hua Tan

    Abstract: Data augmentation is commonly used for generating additional data from the available training data to achieve a robust estimation of the parameters of complex models like the one for speaker verification (SV), especially for under-resourced applications. SV involves training speaker-independent (SI) models and speaker-dependent models where speakers are represented by models derived from an SI mod… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Journal ref: Proc. of ICEPE 2020

  17. arXiv:2003.00982  [pdf, other

    cs.LG stat.ML

    Benchmarking Graph Neural Networks

    Authors: Vijay Prakash Dwivedi, Chaitanya K. Joshi, Anh Tuan Luu, Thomas Laurent, Yoshua Bengio, Xavier Bresson

    Abstract: In the last few years, graph neural networks (GNNs) have become the standard toolkit for analyzing and learning from data on graphs. This emerging field has witnessed an extensive growth of promising techniques that have been applied with success to computer science, mathematics, biology, physics and chemistry. But for any successful field to become mainstream and reliable, benchmarks must be deve… ▽ More

    Submitted 27 December, 2022; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: Benchmarking framework on GitHub at https://github.com/graphdeeplearning/benchmarking-gnns

    Journal ref: Journal of Machine Learning Research (JMLR), 2022

  18. arXiv:1803.06555  [pdf, other

    cs.AI cs.IR

    Tell Me Why Is It So? Explaining Knowledge Graph Relationships by Finding Descriptive Support Passages

    Authors: Sumit Bhatia, Purusharth Dwivedi, Avneet Kaur

    Abstract: We address the problem of finding descriptive explanations of facts stored in a knowledge graph. This is important in high-risk domains such as healthcare, intelligence, etc. where users need additional information for decision making and is especially crucial for applications that rely on automatically constructed knowledge bases where machine learned systems extract facts from an input corpus an… ▽ More

    Submitted 17 March, 2018; originally announced March 2018.

    Comments: 12 pages

  19. Computing Multiplicative Order and Primitive Root in Finite Cyclic Group

    Authors: Shri Prakash Dwivedi

    Abstract: Multiplicative order of an element $a$ of group $G$ is the least positive integer $n$ such that $a^n=e$, where $e$ is the identity element of $G$. If the order of an element is equal to $|G|$, it is called generator or primitive root. This paper describes the algorithms for computing multiplicative order and primitive root in $\mathbb{Z}^*_{p}$, we also present a logarithmic improvement over class… ▽ More

    Submitted 21 August, 2014; originally announced August 2014.

    Comments: 8 pages

  20. GCD Computation of n Integers

    Authors: Shri Prakash Dwivedi

    Abstract: Greatest Common Divisor (GCD) computation is one of the most important operation of algorithmic number theory. In this paper we present the algorithms for GCD computation of $n$ integers. We extend the Euclid's algorithm and binary GCD algorithm to compute the GCD of more than two integers.

    Submitted 25 July, 2014; originally announced July 2014.

    Comments: RAECS 2014

  21. An Efficient Multiplication Algorithm Using Nikhilam Method

    Authors: Shri Prakash Dwivedi

    Abstract: Multiplication is one of the most important operation in computer arithmetic. Many integer operations such as squaring, division and computing reciprocal require same order of time as multiplication whereas some other operations such as computing GCD and residue operation require at most a factor of $\log n$ time more than multiplication. We propose an integer multiplication algorithm using Nikhil… ▽ More

    Submitted 10 July, 2013; originally announced July 2013.

    Comments: Extended version to appear in ITC 2013

  22. arXiv:1212.3502  [pdf, other

    cs.OS

    Adaptive Scheduling in Real-Time Systems Through Period Adjustment

    Authors: Shri Prakash Dwivedi

    Abstract: Real time system technology traditionally developed for safety critical systems, has now been extended to support multimedia systems and virtual reality. A large number of real-time application, related to multimedia and adaptive control system, require more flexibility than classical real-time theory usually permits. This paper proposes an efficient adaptive scheduling framework in real-time syst… ▽ More

    Submitted 14 December, 2012; originally announced December 2012.

    Comments: 8 pages, 5 figures