Zum Hauptinhalt springen

Showing 1–17 of 17 results for author: Agrawal, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12216  [pdf, other

    cs.IR

    Mindful-RAG: A Study of Points of Failure in Retrieval Augmented Generation

    Authors: Garima Agrawal, Tharindu Kumarage, Zeyad Alghamdi, Huan Liu

    Abstract: Large Language Models (LLMs) are proficient at generating coherent and contextually relevant text but face challenges when addressing knowledge-intensive queries in domain-specific and factual question-answering tasks. Retrieval-augmented generation (RAG) systems mitigate this by incorporating external knowledge sources, such as structured knowledge graphs (KGs). However, LLMs often struggle to pr… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  2. arXiv:2404.13528  [pdf, other

    cs.LG cs.AI cs.DC

    SmartMem: Layout Transformation Elimination and Adaptation for Efficient DNN Execution on Mobile

    Authors: Wei Niu, Md Musfiqur Rahman Sanim, Zhihao Shu, Jiexiong Guan, Xipeng Shen, Miao Yin, Gagan Agrawal, Bin Ren

    Abstract: This work is motivated by recent developments in Deep Neural Networks, particularly the Transformer architectures underlying applications such as ChatGPT, and the need for performing inference on mobile devices. Focusing on emerging transformers (specifically the ones with computationally efficient Swin-like architectures) and large models (e.g., Stable Diffusion and LLMs) based on transformers, w… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  3. arXiv:2403.01152  [pdf, other

    cs.CL cs.AI

    A Survey of AI-generated Text Forensic Systems: Detection, Attribution, and Characterization

    Authors: Tharindu Kumarage, Garima Agrawal, Paras Sheth, Raha Moraffah, Aman Chadha, Joshua Garland, Huan Liu

    Abstract: We have witnessed lately a rapid proliferation of advanced Large Language Models (LLMs) capable of generating high-quality text. While these LLMs have revolutionized text generation across various domains, they also pose significant risks to the information ecosystem, such as the potential for generating convincing propaganda, misinformation, and disinformation at scale. This paper offers a review… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  4. arXiv:2403.00176  [pdf, other

    cs.LG cs.AI cs.PL

    SoD$^2$: Statically Optimizing Dynamic Deep Neural Network

    Authors: Wei Niu, Gagan Agrawal, Bin Ren

    Abstract: Though many compilation and runtime systems have been developed for DNNs in recent years, the focus has largely been on static DNNs. Dynamic DNNs, where tensor shapes and sizes and even the set of operators used are dependent upon the input and/or execution, are becoming common. This paper presents SoD$^2$, a comprehensive framework for optimizing Dynamic DNNs. The basis of our approach is a class… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  5. arXiv:2311.07914  [pdf, other

    cs.CL cs.LG

    Can Knowledge Graphs Reduce Hallucinations in LLMs? : A Survey

    Authors: Garima Agrawal, Tharindu Kumarage, Zeyad Alghamdi, Huan Liu

    Abstract: The contemporary LLMs are prone to producing hallucinations, stemming mainly from the knowledge gaps within the models. To address this critical limitation, researchers employ diverse strategies to augment the LLMs by incorporating external knowledge, aiming to reduce hallucinations and enhance reasoning accuracy. Among these strategies, leveraging knowledge graphs as a source of external informat… ▽ More

    Submitted 15 March, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: Accepted Paper in NAACL 2024

  6. arXiv:2308.03927  [pdf, other

    cs.CR

    ForensiBlock: A Provenance-Driven Blockchain Framework for Data Forensics and Auditability

    Authors: Asma Jodeiri Akbarfam, Mahdieh Heidaripour, Hoda Maleki, Gokila Dorai, Gagan Agrawal

    Abstract: Maintaining accurate provenance records is paramount in digital forensics, as they underpin evidence credibility and integrity, addressing essential aspects like accountability and reproducibility. Blockchains have several properties that can address these requirements. Previous systems utilized public blockchains, i.e., treated blockchain as a black box, and benefiting from the immutability prope… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  7. arXiv:2112.15530  [pdf, other

    cs.LG cs.SI

    Scalable Deep Graph Clustering with Random-walk based Self-supervised Learning

    Authors: Xiang Li, Dong Li, Ruoming Jin, Gagan Agrawal, Rajiv Ramnath

    Abstract: Web-based interactions can be frequently represented by an attributed graph, and node clustering in such graphs has received much attention lately. Multiple efforts have successfully applied Graph Convolutional Networks (GCN), though with some limits on accuracy as GCNs have been shown to suffer from over-smoothing issues. Though other methods (particularly those based on Laplacian Smoothing) have… ▽ More

    Submitted 17 January, 2023; v1 submitted 31 December, 2021; originally announced December 2021.

  8. arXiv:2109.02084  [pdf, other

    eess.IV cs.CV cs.LG

    (M)SLAe-Net: Multi-Scale Multi-Level Attention embedded Network for Retinal Vessel Segmentation

    Authors: Shreshth Saini, Geetika Agrawal

    Abstract: Segmentation plays a crucial role in diagnosis. Studying the retinal vasculatures from fundus images help identify early signs of many crucial illnesses such as diabetic retinopathy. Due to the varying shape, size, and patterns of retinal vessels, along with artefacts and noises in fundus images, no one-stage method can accurately segment retinal vessels. In this work, we propose a multi-scale, mu… ▽ More

    Submitted 5 September, 2021; originally announced September 2021.

    Comments: 5 pages, 4 figures, Accepted and Presented in 9TH IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (IEEE-ICHI 2021), Victoria, British Columbia, Canada

  9. arXiv:2108.13342  [pdf, other

    cs.LG cs.AI

    DNNFusion: Accelerating Deep Neural Networks Execution with Advanced Operator Fusion

    Authors: Wei Niu, Jiexiong Guan, Yanzhi Wang, Gagan Agrawal, Bin Ren

    Abstract: Deep Neural Networks (DNNs) have emerged as the core enabler of many major applications on mobile devices. To achieve high accuracy, DNN models have become increasingly deep with hundreds or even thousands of operator layers, leading to high memory and computational requirements for inference. Operator fusion (or kernel/layer fusion) is key optimization in many state-of-the-art DNN execution frame… ▽ More

    Submitted 30 November, 2021; v1 submitted 30 August, 2021; originally announced August 2021.

  10. arXiv:2107.06419  [pdf, other

    cs.LG cs.AR

    FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks

    Authors: Sheng-Chun Kao, Suvinay Subramanian, Gaurav Agrawal, Amir Yazdanbakhsh, Tushar Krishna

    Abstract: Attention mechanisms, primarily designed to capture pairwise correlations between words, have become the backbone of machine learning, expanding beyond natural language processing into other domains. This growth in adaptation comes at the cost of prohibitively large memory requirements and computational complexity, especially at higher number of input elements. This limitation is due to inherently… ▽ More

    Submitted 23 September, 2022; v1 submitted 13 July, 2021; originally announced July 2021.

  11. arXiv:2007.06134  [pdf, other

    cs.LG cs.DC stat.ML

    Adaptive Periodic Averaging: A Practical Approach to Reducing Communication in Distributed Learning

    Authors: Peng Jiang, Gagan Agrawal

    Abstract: Stochastic Gradient Descent (SGD) is the key learning algorithm for many machine learning tasks. Because of its computational costs, there is a growing interest in accelerating SGD on HPC resources like GPU clusters. However, the performance of parallel SGD is still bottlenecked by the high communication costs even with a fast connection among the machines. A simple approach to alleviating this pr… ▽ More

    Submitted 19 January, 2021; v1 submitted 12 July, 2020; originally announced July 2020.

  12. arXiv:1910.12446  [pdf, other

    cs.SI cs.CL cs.IR

    Towards Successful Social Media Advertising: Predicting the Influence of Commercial Tweets

    Authors: Renhao Cui, Gagan Agrawal, Rajiv Ramnath

    Abstract: Businesses communicate using Twitter for a variety of reasons -- to raise awareness of their brands, to market new products, to respond to community comments, and to connect with their customers and potential customers in a targeted manner. For businesses to do this effectively, they need to understand which content and structural elements about a tweet make it influential, that is, widely liked,… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

  13. arXiv:1908.02551  [pdf, ps, other

    cs.SI cs.AI cs.CL

    Tweets Can Tell: Activity Recognition using Hybrid Long Short-Term Memory Model

    Authors: Renhao Cui, Gagan Agrawal, Rajiv Ramnath

    Abstract: This paper presents techniques to detect the "offline" activity a person is engaged in when she is tweeting (such as dining, shopping or entertainment), in order to create a dynamic profile of the user, for uses such as better targeting of advertisements. To this end, we propose a hybrid LSTM model for rich contextual learning, along with studies on the effects of applying and combining multiple L… ▽ More

    Submitted 9 July, 2019; originally announced August 2019.

  14. arXiv:1906.12018  [pdf, ps, other

    cs.DB cs.DC

    Pruned Landmark Labeling Meets Vertex Centric Computation: A Surprisingly Happy Marriage!

    Authors: Ruoming Jin, Zhen Peng, Wendell Wu, Feodor Dragan, Gagan Agrawal, Bin Ren

    Abstract: In this paper, we study how the Pruned Landmark Labeling (PPL) algorithm can be parallelized in a scalable fashion, producing the same results as the sequential algorithm. More specifically, we parallelize using a Vertex-Centric (VC) computational model on a modern SIMD powered multicore architecture. We design a new VC-PLL algorithm that resolves the apparent mismatch between the inherent sequent… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

  15. arXiv:1704.04760  [pdf

    cs.AR cs.LG cs.NE

    In-Datacenter Performance Analysis of a Tensor Processing Unit

    Authors: Norman P. Jouppi, Cliff Young, Nishant Patil, David Patterson, Gaurav Agrawal, Raminder Bajwa, Sarah Bates, Suresh Bhatia, Nan Boden, Al Borchers, Rick Boyle, Pierre-luc Cantin, Clifford Chao, Chris Clark, Jeremy Coriell, Mike Daley, Matt Dau, Jeffrey Dean, Ben Gelb, Tara Vazir Ghaemmaghami, Rajendra Gottipati, William Gulland, Robert Hagmann, C. Richard Ho, Doug Hogberg , et al. (50 additional authors not shown)

    Abstract: Many architects believe that major improvements in cost-energy-performance must now come from domain-specific hardware. This paper evaluates a custom ASIC---called a Tensor Processing Unit (TPU)---deployed in datacenters since 2015 that accelerates the inference phase of neural networks (NN). The heart of the TPU is a 65,536 8-bit MAC matrix multiply unit that offers a peak throughput of 92 TeraOp… ▽ More

    Submitted 16 April, 2017; originally announced April 2017.

    Comments: 17 pages, 11 figures, 8 tables. To appear at the 44th International Symposium on Computer Architecture (ISCA), Toronto, Canada, June 24-28, 2017

  16. arXiv:1610.05116  [pdf, ps, other

    cs.DC

    Fault Tolerant Frequent Pattern Mining

    Authors: Sameh Shohdy, Abhinav Vishnu, Gagan Agrawal

    Abstract: FP-Growth algorithm is a Frequent Pattern Min- ing (FPM) algorithm that has been extensively used to study correlations and patterns in large scale datasets. While several researchers have designed distributed memory FP-Growth algorithms, it is pivotal to consider fault tolerant FP-Growth, which can address the increasing fault rates in large scale systems. In this work, we propose a novel paralle… ▽ More

    Submitted 17 October, 2016; originally announced October 2016.

    Comments: 10 Pages, High Performance Computing Conference (HIPC 2016)

  17. arXiv:1602.06460  [pdf, other

    cs.MA

    Wheeled Robots playing Chain Catch: Strategies and Evaluation

    Authors: Garima Agrawal, Kamalakar Karlapalem

    Abstract: Robots playing games that humans are adept in is a challenge. We studied robotic agents playing Chain Catch game as a Multi-Agent System (MAS). Our game starts with a traditional Catch game similar to Pursuit evasion, and further extends it to form a growing chain of predator agents to chase remaining preys. Hence Chain Catch is a combination of two challenges - pursuit domain and robotic chain fo… ▽ More

    Submitted 20 February, 2016; originally announced February 2016.