Zum Hauptinhalt springen

Showing 1–50 of 63 results for author: Agrawal, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11239  [pdf, other

    cs.LG cs.AI cs.CL cs.IT cs.NE

    A Little Confidence Goes a Long Way

    Authors: John Scoville, Shang Gao, Devanshu Agrawal, Javed Qadrud-Din

    Abstract: We introduce a group of related methods for binary classification tasks using probes of the hidden state activations in large language models (LLMs). Performance is on par with the largest and most advanced LLMs currently available, but requiring orders of magnitude fewer computational resources and not requiring labeled data. This approach involves translating class labels into a semantically ric… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 13 pages, 2 figures

  2. arXiv:2408.00884  [pdf, other

    cs.DB cs.CL

    Hybrid Querying Over Relational Databases and Large Language Models

    Authors: Fuheng Zhao, Divyakant Agrawal, Amr El Abbadi

    Abstract: Database queries traditionally operate under the closed-world assumption, providing no answers to questions that require information beyond the data stored in the database. Hybrid querying using SQL offers an alternative by integrating relational databases with large language models (LLMs) to answer beyond-database questions. In this paper, we present the first cross-domain benchmark, SWAN, contai… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

  3. arXiv:2407.05467  [pdf, other

    cs.DC cs.AI

    The infrastructure powering IBM's Gen AI model development

    Authors: Talia Gershon, Seetharami Seelam, Brian Belgodere, Milton Bonilla, Lan Hoang, Danny Barnett, I-Hsin Chung, Apoorve Mohan, Ming-Hung Chen, Lixiang Luo, Robert Walkup, Constantinos Evangelinos, Shweta Salaria, Marc Dombrowa, Yoonho Park, Apo Kayi, Liran Schour, Alim Alim, Ali Sydney, Pavlos Maniotis, Laurent Schares, Bernard Metzler, Bengi Karacali-Akyamac, Sophia Wen, Tatsuhiro Chiba , et al. (121 additional authors not shown)

    Abstract: AI Infrastructure plays a key role in the speed and cost-competitiveness of developing and deploying advanced AI models. The current demand for powerful AI infrastructure for model training is driven by the emergence of generative AI and foundational models, where on occasion thousands of GPUs must cooperate on a single training job for the model to be trained in a reasonable time. Delivering effi… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Corresponding Authors: Talia Gershon, Seetharami Seelam,Brian Belgodere, Milton Bonilla

  4. A Constructive Method for Designing Safe Multirate Controllers for Differentially-Flat Systems

    Authors: Devansh R. Agrawal, Hardik Parwana, Ryan K. Cosner, Ugo Rosolia, Aaron D. Ames, Dimitra Panagou

    Abstract: We present a multi-rate control architecture that leverages fundamental properties of differential flatness to synthesize controllers for safety-critical nonlinear dynamical systems. We propose a two-layer architecture, where the high-level generates reference trajectories using a linear Model Predictive Controller, and the low-level tracks this reference using a feedback controller. The novelty l… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 6 pages, 3 figures, accepted at IEEE Control Systems Letters 2021

    Journal ref: IEEE Control Systems Letters, Vol 6, Page 2138--2143, 2021

  5. arXiv:2403.17917  [pdf, ps, other

    eess.SY cs.RO

    Multi-Agent Clarity-Aware Dynamic Coverage with Gaussian Processes

    Authors: Devansh R. Agrawal, Dimitra Panagou

    Abstract: This paper presents two algorithms for multi-agent dynamic coverage in spatiotemporal environments, where the coverage algorithms are informed by the method of data assimilation. In particular, we show that by considering the information assimilation algorithm, here a Numerical Gaussian Process Kalman Filter, the influence of measurements taken at one position on the uncertainty of the estimate at… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 8 pages, 2 figures, submitted to CDC 2024

  6. arXiv:2403.05004  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Can't Remember Details in Long Documents? You Need Some R&R

    Authors: Devanshu Agrawal, Shang Gao, Martin Gajek

    Abstract: Long-context large language models (LLMs) hold promise for tasks such as question-answering (QA) over long documents, but they tend to miss important information in the middle of context documents (arXiv:2307.03172v3). Here, we introduce $\textit{R&R}$ -- a combination of two novel prompt-based methods called $\textit{reprompting}$ and $\textit{in-context retrieval}$ (ICR) -- to alleviate this eff… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 13 pages, 1 figure, 9 tables. For associated code repository see https://github.com/casetext/r-and-r

  7. arXiv:2402.12881  [pdf, other

    cs.CL

    TEXT2AFFORD: Probing Object Affordance Prediction abilities of Language Models solely from Text

    Authors: Sayantan Adak, Daivik Agrawal, Animesh Mukherjee, Somak Aditya

    Abstract: We investigate the knowledge of object affordances in pre-trained language models (LMs) and pre-trained Vision-Language models (VLMs). A growing body of literature shows that PTLMs fail inconsistently and non-intuitively, demonstrating a lack of reasoning and grounding. To take a first step toward quantifying the effect of grounding (or lack thereof), we curate a novel and comprehensive dataset of… ▽ More

    Submitted 23 July, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  8. arXiv:2402.05254  [pdf, other

    cs.RO eess.SY

    Online and Certifiably Correct Visual Odometry and Mapping

    Authors: Devansh R Agrawal, Rajiv Govindjee, Jiangbo Yu, Anurekha Ravikumar, Dimitra Panagou

    Abstract: This paper proposes two new algorithms for certified perception in safety-critical robotic applications. The first is a Certified Visual Odometry algorithm, which uses a RGBD camera with bounded sensor noise to construct a visual odometry estimate with provable error bounds. The second is a Certified Mapping algorithm which, using the same RGBD images, constructs a Signed Distance Field of the obs… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 10 pages, 6 figures

  9. arXiv:2312.10321  [pdf, other

    cs.DB cs.CL

    LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?

    Authors: Fuheng Zhao, Lawrence Lim, Ishtiyaque Ahmad, Divyakant Agrawal, Amr El Abbadi

    Abstract: Judging the equivalence between two SQL queries is a fundamental problem with many practical applications in data management and SQL generation (i.e., evaluating the quality of generated SQL queries in text-to-SQL task). While the research community has reasoned about SQL equivalence for decades, it poses considerable difficulties and no complete solutions exist. Recently, Large Language Models (L… ▽ More

    Submitted 19 June, 2024; v1 submitted 16 December, 2023; originally announced December 2023.

  10. arXiv:2311.09564  [pdf, other

    cs.CL cs.AI

    LongBoX: Evaluating Transformers on Long-Sequence Clinical Tasks

    Authors: Mihir Parmar, Aakanksha Naik, Himanshu Gupta, Disha Agrawal, Chitta Baral

    Abstract: Many large language models (LLMs) for medicine have largely been evaluated on short texts, and their ability to handle longer sequences such as a complete electronic health record (EHR) has not been systematically explored. Assessing these models on long sequences is crucial since prior work in the general domain has demonstrated performance degradation of LLMs on longer texts. Motivated by this,… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 8 pages

  11. arXiv:2310.06933  [pdf, other

    cs.RO

    Eclares: Energy-Aware Clarity-Driven Ergodic Search

    Authors: Kaleb Ben Naveed, Devansh Agrawal, Christopher Vermillion, Dimitra Panagou

    Abstract: Planning informative trajectories while considering the spatial distribution of the information over the environment, as well as constraints such as the robot's limited battery capacity, makes the long-time horizon persistent coverage problem complex. Ergodic search methods consider the spatial distribution of environmental information while optimizing robot trajectories; however, current methods… ▽ More

    Submitted 12 October, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Submitted to International Conference of Robotics and Automation (ICRA) 2024

  12. arXiv:2309.12623  [pdf, other

    cs.DB cs.DS

    The SpaceSaving$\pm$ Family of Algorithms for Data Streams with Bounded Deletions

    Authors: Fuheng Zhao, Divyakant Agrawal, Amr El Abbadi, Claire Mathieu, Ahmed Metwally, Michel de Rougemont

    Abstract: In this paper, we present an advanced analysis of near optimal algorithms that use limited space to solve the frequency estimation, heavy hitters, frequent items, and top-k approximation in the bounded deletion model. We define the family of SpaceSaving$\pm$ algorithms and explain why the original SpaceSaving$\pm$ algorithm only works when insertions and deletions are not interleaved. Next, we pro… ▽ More

    Submitted 8 August, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

  13. arXiv:2308.12560  [pdf, other

    cs.CV

    NOVA: NOvel View Augmentation for Neural Composition of Dynamic Objects

    Authors: Dakshit Agrawal, Jiajie Xu, Siva Karthik Mustikovela, Ioannis Gkioulekas, Ashish Shrivastava, Yuning Chai

    Abstract: We propose a novel-view augmentation (NOVA) strategy to train NeRFs for photo-realistic 3D composition of dynamic objects in a static scene. Compared to prior work, our framework significantly reduces blending artifacts when inserting multiple dynamic objects into a 3D scene at novel views and times; achieves comparable PSNR without the need for additional ground truth modalities like optical flow… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: Accepted for publication in ICCV Computer Vision for Metaverse Workshop 2023 (code is available at https://github.com/dakshitagrawal/NoVA)

  14. arXiv:2305.05074  [pdf, other

    cs.DB cs.DS cs.IR

    Autumn: A Scalable Read Optimized LSM-tree based Key-Value Stores with Fast Point and Range Read Speed

    Authors: Fuheng Zhao, Zach Miller, Leron Reznikov, Divyakant Agrawal, Amr El Abbadi

    Abstract: The Log Structured Merge Trees (LSM-tree) based key-value stores are widely used in many storage systems to support a variety of operations such as updates, point reads, and range reads. Traditionally, LSM-tree's merge policy organizes data into multiple levels of exponentially increasing capacity to support high-speed writes. However, we contend that the traditional merge policies are not optimiz… ▽ More

    Submitted 30 June, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

  15. arXiv:2304.02578  [pdf, ps, other

    cs.RO

    Sensor-based Planning and Control for Robotic Systems: Introducing Clarity and Perceivability

    Authors: Devansh R Agrawal, Dimitra Panagou

    Abstract: We introduce an information measure, termed clarity, motivated by information entropy, and show that it has intuitive properties relevant to dynamic coverage control and informative path planning. Clarity defines the quality of the information we have about a variable of interest in an environment on a scale of [0, 1], and has useful properties for control and planning such as: (I) clarity lower b… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: 6 pages, 4 figures Submitted for review at LCSS/CDC 2023

  16. arXiv:2303.04614  [pdf, other

    cs.LG stat.ML

    Densely Connected $G$-invariant Deep Neural Networks with Signed Permutation Representations

    Authors: Devanshu Agrawal, James Ostrowski

    Abstract: We introduce and investigate, for finite groups $G$, $G$-invariant deep neural network ($G$-DNN) architectures with ReLU activation that are densely connected-- i.e., include all possible skip connections. In contrast to other $G$-invariant architectures in the literature, the preactivations of the$G$-DNNs presented here are able to transform by \emph{signed} permutation representations (signed pe… ▽ More

    Submitted 17 October, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: 40 pages, 2 figures, 4 tables. For associated code repository see https://github.com/dagrawa2/gdnn_code

  17. Safe and Robust Observer-Controller Synthesis using Control Barrier Functions

    Authors: Devansh R. Agrawal, Dimitra Panagou

    Abstract: This paper addresses the synthesis of safety-critical controllers using estimate feedback. We propose an observer-controller interconnection to ensure that the nonlinear system remains safe despite bounded disturbances on the system dynamics and measurements that correspond to partial state information. The co-design of observers and controllers is critical, since even in undisturbed cases, observ… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: 6 pages, 4 figures. Accepted at LCSS, CDC 2023

    Journal ref: IEEE Control Systems Letters 7 (2022): 127-132

  18. gatekeeper: Online Safety Verification and Control for Nonlinear Systems in Dynamic Environments

    Authors: Devansh R Agrawal, Ruichang Chen, Dimitra Panagou

    Abstract: This paper presents the gatekeeper algorithm, a real-time and computationally-lightweight method that ensures that trajectories of a nonlinear system satisfy safety constraints despite sensing limitations. gatekeeper integrates with existing path planners and feedback controllers by introducing an additional verification step to ensure that proposed trajectories can be executed safely, despite non… ▽ More

    Submitted 14 August, 2024; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: Accepted at IEEE T-RO 2024. Accepted at IROS 2023. 17 pages, 10 figures

  19. arXiv:2209.03076  [pdf

    cs.CV stat.ML

    Plant Species Classification Using Transfer Learning by Pretrained Classifier VGG-19

    Authors: Thiru Siddharth, Bhupendra Singh Kirar, Dheeraj Kumar Agrawal

    Abstract: Deep learning is currently the most important branch of machine learning, with applications in speech recognition, computer vision, image classification, and medical imaging analysis. Plant recognition is one of the areas where image classification can be used to identify plant species through their leaves. Botanists devote a significant amount of time to recognizing plant species by personally in… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: Under review process in 'IETE Journal of Research'

  20. arXiv:2205.11388  [pdf, other

    cs.CL cs.LG

    StreamingQA: A Benchmark for Adaptation to New Knowledge over Time in Question Answering Models

    Authors: Adam Liška, Tomáš Kočiský, Elena Gribovskaya, Tayfun Terzi, Eren Sezener, Devang Agrawal, Cyprien de Masson d'Autume, Tim Scholtes, Manzil Zaheer, Susannah Young, Ellen Gilsenan-McMahon, Sophia Austin, Phil Blunsom, Angeliki Lazaridou

    Abstract: Knowledge and language understanding of models evaluated through question answering (QA) has been usually studied on static snapshots of knowledge, like Wikipedia. However, our world is dynamic, evolves over time, and our models' knowledge becomes outdated. To study how semi-parametric QA models and their underlying parametric language models (LMs) adapt to evolving knowledge, we construct a new l… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  21. arXiv:2205.09873  [pdf, other

    cs.DS cs.DB cs.NI

    Differentially Private Linear Sketches: Efficient Implementations and Applications

    Authors: Fuheng Zhao, Dan Qiao, Rachel Redberg, Divyakant Agrawal, Amr El Abbadi, Yu-Xiang Wang

    Abstract: Linear sketches have been widely adopted to process fast data streams, and they can be used to accurately answer frequency estimation, approximate top K items, and summarize data distributions. When data are sensitive, it is desirable to provide privacy guarantees for linear sketches to preserve private information while delivering useful results with theoretical bounds. We show that linear sketch… ▽ More

    Submitted 14 October, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

  22. arXiv:2205.09219  [pdf, other

    cs.LG stat.ML

    A Classification of $G$-invariant Shallow Neural Networks

    Authors: Devanshu Agrawal, James Ostrowski

    Abstract: When trying to fit a deep neural network (DNN) to a $G$-invariant target function with $G$ a group, it only makes sense to constrain the DNN to be $G$-invariant as well. However, there can be many different ways to do this, thus raising the problem of ``$G$-invariant neural architecture design'': What is the optimal $G$-invariant architecture for a given problem? Before we can consider the optimiz… ▽ More

    Submitted 7 January, 2023; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: 32 pages, 8 figures; accepted at NeurIPS 2022

  23. arXiv:2205.04534  [pdf, other

    cs.DC

    The Bedrock of Byzantine Fault Tolerance: A Unified Platform for BFT Protocol Design and Implementation

    Authors: Mohammad Javad Amiri, Chenyuan Wu, Divyakant Agrawal, Amr El Abbadi, Boon Thau Loo, Mohammad Sadoghi

    Abstract: Byzantine Fault-Tolerant (BFT) protocols have recently been extensively used by decentralized data management systems with non-trustworthy infrastructures, e.g., permissioned blockchains. BFT protocols cover a broad spectrum of design dimensions from infrastructure settings such as the communication topology, to more technical features such as commitment strategy and even fundamental social choice… ▽ More

    Submitted 3 August, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

  24. SpaceSaving$^\pm$: An Optimal Algorithm for Frequency Estimation and Frequent items in the Bounded Deletion Model

    Authors: Fuheng Zhao, Divyakant Agrawal, Amr El Abbadi, Ahmed Metwally

    Abstract: In this paper, we propose the first deterministic algorithms to solve the frequency estimation and frequent item problems in the bounded deletion model. We establish the space lower bound for solving the deterministic frequent items problem in the bounded deletion model, and propose the Lazy SpaceSaving$^\pm$ and SpaceSaving$^\pm$ algorithms with optimal space bound. We then develop an efficient i… ▽ More

    Submitted 8 December, 2021; v1 submitted 6 December, 2021; originally announced December 2021.

  25. arXiv:2111.13310  [pdf, other

    cs.HC

    Evaluating Trust in the Context of Conversational Information Systems for new users of the Internet

    Authors: Anurag Aribandi, Divyanshu Agrawal, Dipanjan Chakraborty

    Abstract: Most online information sources are text-based and in Western Languages like English. However, many new and first time users of the Internet are in contexts with low English proficiency and are unable to access vital information online. Several researchers have focused on building conversational information systems over voice for this demographic, and also highlighted the importance of building tr… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

    Comments: 8 pages, 2 figures

  26. arXiv:2110.08403  [pdf, other

    cs.SE

    Nalanda: A Socio-Technical Graph for Building Software Analytics Tools at Enterprise Scale

    Authors: Chandra Maddila, Suhas Shanbhogue, Apoorva Agrawal, Thomas Zimmermann, Chetan Bansal, Nicole Forsgren, Divyanshu Agrawal, Kim Herzig, Arie van Deursen

    Abstract: Software development is information-dense knowledge work that requires collaboration with other developers and awareness of artifacts such as work items, pull requests, and files. With the speed of development increasing, information overload is a challenge for people developing and maintaining these systems. Finding information and people is difficult for software engineers, especially when they… ▽ More

    Submitted 19 September, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  27. arXiv:2108.11573  [pdf, other

    eess.IV cs.CV

    NeighCNN: A CNN based SAR Speckle Reduction using Feature preserving Loss Function

    Authors: Praveen Ravirathinam, Darshan Agrawal, J. Jennifer Ranjani

    Abstract: Coherent imaging systems like synthetic aperture radar are susceptible to multiplicative noise that makes applications like automatic target recognition challenging. In this paper, NeighCNN, a deep learning-based speckle reduction algorithm that handles multiplicative noise with relatively simple convolutional neural network architecture, is proposed. We have designed a loss function which is an u… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: 5 pages

  28. arXiv:2107.10836  [pdf, other

    cs.DB

    Qanaat: A Scalable Multi-Enterprise Permissioned Blockchain System with Confidentiality Guarantees

    Authors: Mohammad Javad Amiri, Boon Thau Loo, Divyakant Agrawal, Amr El Abbadi

    Abstract: Today's large-scale data management systems need to address distributed applications' confidentiality and scalability requirements among a set of collaborative enterprises. This paper presents Qanaat, a scalable multi-enterprise permissioned blockchain system that guarantees the confidentiality of enterprises in collaboration workflows. Qanaat presents data collections that enable any subset of en… ▽ More

    Submitted 17 July, 2022; v1 submitted 22 July, 2021; originally announced July 2021.

    Journal ref: Proceedings of the VLDB Endowment 15, no. 11 (2022)

  29. arXiv:2105.11132  [pdf, other

    cs.AI

    Partition Function Estimation: A Quantitative Study

    Authors: Durgesh Agrawal, Yash Pote, Kuldeep S Meel

    Abstract: Probabilistic graphical models have emerged as a powerful modeling tool for several real-world scenarios where one needs to reason under uncertainty. A graphical model's partition function is a central quantity of interest, and its computation is key to several probabilistic reasoning tasks. Given the #P-hardness of computing the partition function, several techniques have been proposed over the y… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

    Comments: 10 pages, 3 figures, 2 tables, to be published in IJCAI-21

  30. arXiv:2102.01951  [pdf, other

    cs.CL cs.AI

    Mind the Gap: Assessing Temporal Generalization in Neural Language Models

    Authors: Angeliki Lazaridou, Adhiguna Kuncoro, Elena Gribovskaya, Devang Agrawal, Adam Liska, Tayfun Terzi, Mai Gimenez, Cyprien de Masson d'Autume, Tomas Kocisky, Sebastian Ruder, Dani Yogatama, Kris Cao, Susannah Young, Phil Blunsom

    Abstract: Our world is open-ended, non-stationary, and constantly evolving; thus what we talk about and how we talk about it change over time. This inherent dynamic nature of language contrasts with the current static language modelling paradigm, which trains and evaluates models on utterances from overlapping time periods. Despite impressive recent progress, we demonstrate that Transformer-XL language mode… ▽ More

    Submitted 26 October, 2021; v1 submitted 3 February, 2021; originally announced February 2021.

    Comments: To appear as a Spotlight at NeurIPS 2021

  31. arXiv:2010.12770  [pdf, other

    cs.CL

    Conversational Semantic Parsing for Dialog State Tracking

    Authors: Jianpeng Cheng, Devang Agrawal, Hector Martinez Alonso, Shruti Bhargava, Joris Driesen, Federico Flego, Shaona Ghosh, Dain Kaplan, Dimitri Kartsaklis, Lin Li, Dhivya Piraviperumal, Jason D Williams, Hong Yu, Diarmuid O Seaghdha, Anders Johannsen

    Abstract: We consider a new perspective on dialog state tracking (DST), the task of estimating a user's goal through the course of a dialog. By formulating DST as a semantic parsing task over hierarchical representations, we can incorporate semantic compositionality, cross-domain knowledge sharing and co-reference. We present TreeDST, a dataset of 27k conversations annotated with tree-structured dialog stat… ▽ More

    Submitted 13 May, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: Publish as a conference paper at EMNLP 2020

  32. arXiv:2006.08067  [pdf, ps, other

    cs.DC cs.DB

    CoT: Decentralized Elastic Caches for Cloud Environments

    Authors: Victor Zakhary, Lawrence Lim, Divyakant Agrawal, Amr El Abbadi

    Abstract: Distributed caches are widely deployed to serve social networks and web applications at billion-user scales. This paper presents Cache-on-Track (CoT), a decentralized, elastic, and predictive caching framework for cloud environments. CoT proposes a new cache replacement policy specifically tailored for small front-end caches that serve skewed workloads. Front-end servers use a heavy hitter trackin… ▽ More

    Submitted 18 June, 2020; v1 submitted 14 June, 2020; originally announced June 2020.

  33. arXiv:2005.01038  [pdf, other

    cs.DB cs.CR cs.DC

    SEPAR: Towards Regulating Future of Work Multi-Platform Crowdworking Environments with Privacy Guarantees

    Authors: Mohammad Javad Amiri, Joris Duguépéroux, Tristan Allard, Divyakant Agrawal, Amr El Abbadi

    Abstract: Crowdworking platforms provide the opportunity for diverse workers to execute tasks for different requesters. The popularity of the "gig" economy has given rise to independent platforms that provide competing and complementary services. Workers as well as requesters with specific tasks may need to work for or avail from the services of multiple platforms resulting in the rise of multi-platform cro… ▽ More

    Submitted 21 October, 2020; v1 submitted 3 May, 2020; originally announced May 2020.

  34. arXiv:2001.06933  [pdf, other

    cs.DB cs.CR

    Fides: Managing Data on Untrusted Infrastructure

    Authors: Sujaya Maiyya, Danny Hyun Bum Cho, Divyakant Agrawal, Amr El Abbadi

    Abstract: Significant amounts of data are currently being stored and managed on third-party servers. It is impractical for many small scale enterprises to own their private datacenters, hence renting third-party servers is a viable solution for such businesses. But the increasing number of malicious attacks, both internal and external, as well as buggy software on third-party servers is causing clients to l… ▽ More

    Submitted 19 January, 2020; originally announced January 2020.

    Comments: 14pages, 15 figures/graphs

  35. arXiv:2001.00921  [pdf, other

    stat.ML cs.LG

    Wide Neural Networks with Bottlenecks are Deep Gaussian Processes

    Authors: Devanshu Agrawal, Theodore Papamarkou, Jacob Hinkle

    Abstract: There has recently been much work on the "wide limit" of neural networks, where Bayesian neural networks (BNNs) are shown to converge to a Gaussian process (GP) as all hidden layers are sent to infinite width. However, these results do not apply to architectures that require one or more of the hidden layers to remain narrow. In this paper, we consider the wide limit of BNNs where some hidden layer… ▽ More

    Submitted 6 July, 2020; v1 submitted 3 January, 2020; originally announced January 2020.

  36. arXiv:1912.04820  [pdf, other

    cs.DB

    ChainifyDB: How to Blockchainify any Data Management System

    Authors: Felix Martin Schuhknecht, Ankur Sharma, Jens Dittrich, Divya Agrawal

    Abstract: Today's permissioned blockchain systems come in a stand-alone fashion and require the users to integrate yet another full-fledged transaction processing system into their already complex data management landscape. This seems odd as blockchains and traditional DBMSs share large parts of their processing stack. Thus, rather than replacing the established data systems altogether, we advocate to simpl… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

  37. arXiv:1910.00765  [pdf, other

    cs.DB cs.DC

    SharPer: Sharding Permissioned Blockchains Over Network Clusters

    Authors: Mohammad Javad Amiri, Divyakant Agrawal, Amr El Abbadi

    Abstract: Scalability is one of the main roadblocks to business adoption of blockchain systems. Despite recent intensive research on using sharding techniques to enhance the scalability of blockchain systems, existing solutions do not efficiently address cross-shard transactions. In this paper, we introduce SharPer, a permissioned blockchain system that improves scalability by clustering (partitioning) the… ▽ More

    Submitted 16 February, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

  38. arXiv:1909.06494  [pdf, ps, other

    cs.DB cs.DC

    Transactional Smart Contracts in Blockchain Systems

    Authors: Victor Zakhary, Divyakant Agrawal, Amr El Abbadi

    Abstract: This paper presents TXSC, a framework that provides smart contract developers with transaction primitives. These primitives allow developers to write smart contracts without the need to reason about the anomalies that can arise due to concurrent smart contract function executions.

    Submitted 13 September, 2019; originally announced September 2019.

  39. arXiv:1908.03919  [pdf, other

    cs.CV cs.LG

    GAN-Tree: An Incrementally Learned Hierarchical Generative Framework for Multi-Modal Data Distributions

    Authors: Jogendra Nath Kundu, Maharshi Gor, Dakshit Agrawal, R. Venkatesh Babu

    Abstract: Despite the remarkable success of generative adversarial networks, their performance seems less impressive for diverse training sets, requiring learning of discontinuous mapping functions. Though multi-mode prior or multi-generator models have been proposed to alleviate this problem, such approaches may fail depending on the empirically chosen initial mode components. In contrast to such bottom-up… ▽ More

    Submitted 16 September, 2019; v1 submitted 11 August, 2019; originally announced August 2019.

    Comments: ICCV 2019 (code available at https://github.com/val-iisc/GANTree)

  40. arXiv:1906.07850  [pdf, other

    cs.DC cs.DB

    SeeMoRe: A Fault-Tolerant Protocol for Hybrid Cloud Environments

    Authors: Mohammad Javad Amiri, Sujaya Maiyya, Divyakant Agrawal, Amr El Abbadi

    Abstract: Large scale data management systems utilize State Machine Replication to provide fault tolerance and to enhance performance. Fault-tolerant protocols are extensively used in the distributed database infrastructure of large enterprises such as Google, Amazon, and Facebook, as well as permissioned blockchain systems like IBM's Hyperledger Fabric. However, and in spite of years of intensive research,… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

  41. arXiv:1905.09359  [pdf, other

    cs.DB cs.CR cs.DC

    Towards Global Asset Management in Blockchain Systems

    Authors: Victor Zakhary, Mohammad Javad Amiri, Sujaya Maiyya, Divyakant Agrawal, Amr El Abbadi

    Abstract: Permissionless blockchains (e.g., Bitcoin, Ethereum, etc) have shown a wide success in implementing global scale peer-to-peer cryptocurrency systems. In such blockchains, new currency units are generated through the mining process and are used in addition to transaction fees to incentivize miners to maintain the blockchain. Although it is clear how currency units are generated and transacted on, i… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

  42. arXiv:1905.02847  [pdf, other

    cs.DB cs.CR cs.DC

    Atomic Commitment Across Blockchains

    Authors: Victor Zakhary, Divyakant Agrawal, Amr El Abbadi

    Abstract: The recent adoption of blockchain technologies and open permissionless networks suggest the importance of peer-to-peer atomic cross-chain transaction protocols. Users should be able to atomically exchange tokens and assets without depending on centralized intermediaries such as exchanges. Recent peer-to-peer atomic cross-chain swap protocols use hashlocks and timelocks to ensure that participants… ▽ More

    Submitted 18 June, 2019; v1 submitted 7 May, 2019; originally announced May 2019.

  43. arXiv:1902.01457  [pdf, other

    cs.DC cs.DB

    ParBlockchain: Leveraging Transaction Parallelism in Permissioned Blockchain Systems

    Authors: Mohammad Javad Amiri, Divyakant Agrawal, Amr El Abbadi

    Abstract: Many existing blockchains do not adequately address all the characteristics of distributed system applications and suffer from serious architectural limitations resulting in performance and confidentiality issues. While recent permissioned blockchain systems, have tried to overcome these limitations, their focus has mainly been on workloads with no-contention, i.e., no conflicting transactions. In… ▽ More

    Submitted 4 February, 2019; originally announced February 2019.

  44. arXiv:1901.10133  [pdf, ps, other

    cs.IR cs.CL

    Structuring an unordered text document

    Authors: Shashank Yadav, Tejas Shimpi, C. Ravindranath Chowdary, Prashant Sharma, Deepansh Agrawal, Shivang Agarwal

    Abstract: Segmenting an unordered text document into different sections is a very useful task in many text processing applications like multiple document summarization, question answering, etc. This paper proposes structuring of an unordered text document based on the keywords in the document. We test our approach on Wikipedia documents using both statistical and predictive methods such as the TextRank algo… ▽ More

    Submitted 29 January, 2019; originally announced January 2019.

  45. arXiv:1810.13177  [pdf, other

    cs.DC

    How to Databasify a Blockchain: the Case of Hyperledger Fabric

    Authors: Ankur Sharma, Felix Martin Schuhknecht, Divya Agrawal, Jens Dittrich

    Abstract: Within the last few years, a countless number of blockchain systems have emerged on the market, each one claiming to revolutionize the way of distributed transaction processing in one way or the other. Many blockchain features, such as byzantine fault tolerance (BFT), are indeed valuable additions in modern environments. However, despite all the hype around the technology, many of the challenges t… ▽ More

    Submitted 31 October, 2018; originally announced October 2018.

  46. arXiv:1807.03380  [pdf, other

    cs.CV

    An Attention Model for group-level emotion recognition

    Authors: Aarush Gupta, Dakshit Agrawal, Hardik Chauhan, Jose Dolz, Marco Pedersoli

    Abstract: In this paper we propose a new approach for classifying the global emotion of images containing groups of people. To achieve this task, we consider two different and complementary sources of information: i) a global representation of the entire image (ii) a local representation where only faces are considered. While the global representation of the image is learned with a convolutional neural netw… ▽ More

    Submitted 9 July, 2018; originally announced July 2018.

    Comments: 5 pages, 3 figures, 2 tables

  47. A Cost-based Optimizer for Gradient Descent Optimization

    Authors: Zoi Kaoudi, Jorge-Arnulfo Quiané-Ruiz, Saravanan Thirumuruganathan, Sanjay Chawla, Divy Agrawal

    Abstract: As the use of machine learning (ML) permeates into diverse application domains, there is an urgent need to support a declarative framework for ML. Ideally, a user will specify an ML task in a high-level and easy-to-use language and the framework will invoke the appropriate algorithms and system configurations to execute it. An important observation towards designing such a framework is that many M… ▽ More

    Submitted 27 March, 2017; originally announced March 2017.

    Comments: Accepted at SIGMOD 2017

  48. arXiv:1508.00703  [pdf, other

    cs.DB cs.LG

    Parameter Database : Data-centric Synchronization for Scalable Machine Learning

    Authors: Naman Goel, Divyakant Agrawal, Sanjay Chawla, Ahmed Elmagarmid

    Abstract: We propose a new data-centric synchronization framework for carrying out of machine learning (ML) tasks in a distributed environment. Our framework exploits the iterative nature of ML algorithms and relaxes the application agnostic bulk synchronization parallel (BSP) paradigm that has previously been used for distributed machine learning. Data-centric synchronization complements function-centric s… ▽ More

    Submitted 4 August, 2015; originally announced August 2015.

    Report number: QCRI-TR-2015-003

  49. arXiv:1503.03653  [pdf, other

    cs.DB

    Adaptive Logging for Distributed In-memory Databases

    Authors: Chang Yao, Divyakant Agrawal, Gang Chen, Beng Chin Ooi, Sai Wu

    Abstract: A new type of logs, the command log, is being employed to replace the traditional data log (e.g., ARIES log) in the in-memory databases. Instead of recording how the tuples are updated, a command log only tracks the transactions being executed, thereby effectively reducing the size of the log and improving the performance. Command logging on the other hand increases the cost of recovery, because a… ▽ More

    Submitted 27 April, 2015; v1 submitted 12 March, 2015; originally announced March 2015.

    Comments: 13 pages

  50. arXiv:1503.03642  [pdf, other

    cs.DB

    DGCC:A New Dependency Graph based Concurrency Control Protocol for Multicore Database Systems

    Authors: Chang Yao, Divyakant Agrawal, Pengfei Chang, Gang Chen, Beng Chin Ooi, Weng-Fai Wong, Meihui Zhang

    Abstract: Multicore CPUs and large memories are increasingly becoming the norm in modern computer systems. However, current database management systems (DBMSs) are generally ineffective in exploiting the parallelism of such systems. In particular, contention can lead to a dramatic fall in performance. In this paper, we propose a new concurrency control protocol called DGCC (Dependency Graph based Concurrenc… ▽ More

    Submitted 12 March, 2015; originally announced March 2015.

    Comments: 12 pages