Zum Hauptinhalt springen

Showing 1–44 of 44 results for author: Jaiswal, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13689  [pdf, other

    cs.CY cs.AI cs.CV

    Shaded Route Planning Using Active Segmentation and Identification of Satellite Images

    Authors: Longchao Da, Rohan Chhibba, Rushabh Jaiswal, Ariane Middel, Hua Wei

    Abstract: Heatwaves pose significant health risks, particularly due to prolonged exposure to high summer temperatures. Vulnerable groups, especially pedestrians and cyclists on sun-exposed sidewalks, motivate the development of a route planning method that incorporates somatosensory temperature effects through shade ratio consideration. This paper is the first to introduce a pipeline that utilizes segmentat… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: Paper accepted to CIKM24 demo track

    MSC Class: 68T45; 68U35 ACM Class: I.2.10; I.4.8

  2. arXiv:2405.13351  [pdf, other

    quant-ph cs.DS

    Quantum (Inspired) $D^2$-sampling with Applications

    Authors: Ragesh Jaiswal, Poojan Shah

    Abstract: $D^2$-sampling is a fundamental component of sampling-based clustering algorithms such as $k$-means++. Given a dataset $V \subset \mathbb{R}^d$ with $N$ points and a center set $C \subset \mathbb{R}^d$, $D^2$-sampling refers to picking a point from $V$ where the sampling probability of a point is proportional to its squared distance from the nearest center in $C$. Starting with empty $C… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2308.08167

  3. arXiv:2405.02718  [pdf, other

    eess.SP cs.IT

    Zak-OTFS: Pulse Shaping and the Tradeoff between Time/Bandwidth Expansion and Predictability

    Authors: Jinu Jayachandran, Rahul Kumar Jaiswal, Saif Khan Mohammed, Ronny Hadani, Ananthanarayanan Chockalingam, Robert Calderbank

    Abstract: The Zak-OTFS input/output (I/O) relation is predictable and non-fading when the delay and Doppler periods are greater than the effective channel delay and Doppler spreads, a condition which we refer to as the crystallization condition. When the crystallization condition is satisfied, we describe how to integrate sensing and communication within a single Zak-OTFS subframe by transmitting a pilot in… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  4. TC-OCR: TableCraft OCR for Efficient Detection & Recognition of Table Structure & Content

    Authors: Avinash Anand, Raj Jaiswal, Pijush Bhuyan, Mohit Gupta, Siddhesh Bangar, Md. Modassir Imam, Rajiv Ratn Shah, Shin'ichi Satoh

    Abstract: The automatic recognition of tabular data in document images presents a significant challenge due to the diverse range of table styles and complex structures. Tables offer valuable content representation, enhancing the predictive capabilities of various systems such as search engines and Knowledge Graphs. Addressing the two main problems, namely table detection (TD) and table structure recognition… ▽ More

    Submitted 19 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: 8 pages, 2 figures, Workshop of 1st MMIR Deep Multimodal Learning for Information Retrieval

  5. RanLayNet: A Dataset for Document Layout Detection used for Domain Adaptation and Generalization

    Authors: Avinash Anand, Raj Jaiswal, Mohit Gupta, Siddhesh S Bangar, Pijush Bhuyan, Naman Lal, Rajeev Singh, Ritika Jha, Rajiv Ratn Shah, Shin'ichi Satoh

    Abstract: Large ground-truth datasets and recent advances in deep learning techniques have been useful for layout detection. However, because of the restricted layout diversity of these datasets, training on them requires a sizable number of annotated instances, which is both expensive and time-consuming. As a result, differences between the source and target domains may significantly impact how well these… ▽ More

    Submitted 19 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: 8 pages, 6 figures, MMAsia 2023 Proceedings of the 5th ACM International Conference on Multimedia in Asia

    Journal ref: In Proceedings of the 5th ACM International Conference on Multimedia in Asia 2023. Association for Computing Machinery, NY, USA, Article 74, pp. 1-6

  6. arXiv:2402.15121  [pdf, other

    cs.AR cs.ET eess.IV

    Toward High Performance, Programmable Extreme-Edge Intelligence for Neuromorphic Vision Sensors utilizing Magnetic Domain Wall Motion-based MTJ

    Authors: Md Abdullah-Al Kaiser, Gourav Datta, Peter A. Beerel, Akhilesh R. Jaiswal

    Abstract: The desire to empower resource-limited edge devices with computer vision (CV) must overcome the high energy consumption of collecting and processing vast sensory data. To address the challenge, this work proposes an energy-efficient non-von-Neumann in-pixel processing solution for neuromorphic vision sensors employing emerging (X) magnetic domain wall magnetic tunnel junction (MDWMTJ) for the firs… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 11 pages, 7 figures, 2 table

  7. arXiv:2401.06714  [pdf, other

    cs.DS

    FPT Approximation for Capacitated Sum of Radii

    Authors: Ragesh Jaiswal, Amit Kumar, Jatin Yadav

    Abstract: We consider the capacitated clustering problem in general metric spaces where the goal is to identify $k$ clusters and minimize the sum of the radii of the clusters (we call this the Capacitated-$k$-sumRadii problem). We are interested in fixed-parameter tractable (FPT) approximation algorithms where the running time is of the form $f(k) \cdot \text{poly}(n)$, where $f(k)$ can be an exponential fu… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  8. arXiv:2310.16844  [pdf, other

    cs.AR eess.IV

    Hardware-Algorithm Co-design Enabling Processing-in-Pixel-in-Memory (P2M) for Neuromorphic Vision Sensors

    Authors: Md Abdullah-Al Kaiser, Akhilesh R. Jaiswal

    Abstract: The high volume of data transmission between the edge sensor and the cloud processor leads to energy and throughput bottlenecks for resource-constrained edge devices focused on computer vision. Hence, researchers are investigating different approaches (e.g., near-sensor processing, in-sensor processing, in-pixel processing) by executing computations closer to the sensor to reduce the transmission… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: 6 pages, 4 figures, 1 table

  9. arXiv:2308.08167  [pdf, ps, other

    quant-ph cs.DS cs.LG

    A Quantum Approximation Scheme for k-Means

    Authors: Ragesh Jaiswal

    Abstract: We give a quantum approximation scheme (i.e., $(1 + \varepsilon)$-approximation for every $\varepsilon > 0$) for the classical $k$-means clustering problem in the QRAM model with a running time that has only polylogarithmic dependence on the number of data points. More specifically, given a dataset $V$ with $N$ points in $\mathbb{R}^d$ stored in QRAM data structure, our quantum algorithm runs in t… ▽ More

    Submitted 24 May, 2024; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: An extended version of this paper can be found here arXiv:2405.13351

  10. arXiv:2305.16890  [pdf, ps, other

    cs.DS cs.LG

    Universal Weak Coreset

    Authors: Ragesh Jaiswal, Amit Kumar

    Abstract: Coresets for $k$-means and $k$-median problems yield a small summary of the data, which preserve the clustering cost with respect to any set of $k$ centers. Recently coresets have also been constructed for constrained $k$-means and $k$-median problems. However, the notion of coresets has the drawback that (i) they can only be applied in settings where the input points are allowed to have weights,… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  11. arXiv:2305.00175  [pdf, ps, other

    cs.DS

    Clustering What Matters in Constrained Settings

    Authors: Ragesh Jaiswal, Amit Kumar

    Abstract: Constrained clustering problems generalize classical clustering formulations, e.g., $k$-median, $k$-means, by imposing additional constraints on the feasibility of clustering. There has been significant recent progress in obtaining approximation algorithms for these problems, both in the metric and the Euclidean settings. However, the outlier version of these problems, where the solution is allowe… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

  12. Technology-Circuit-Algorithm Tri-Design for Processing-in-Pixel-in-Memory (P2M)

    Authors: Md Abdullah-Al Kaiser, Gourav Datta, Sreetama Sarkar, Souvik Kundu, Zihan Yin, Manas Garg, Ajey P. Jacob, Peter A. Beerel, Akhilesh R. Jaiswal

    Abstract: The massive amounts of data generated by camera sensors motivate data processing inside pixel arrays, i.e., at the extreme-edge. Several critical developments have fueled recent interest in the processing-in-pixel-in-memory paradigm for a wide range of visual machine intelligence tasks, including (1) advances in 3D integration technology to enable complex processing inside each pixel in a 3D integ… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Journal ref: GLSVLSI '23: Great Lakes Symposium on VLSI 2023 Proceedings

  13. A Context-Switching/Dual-Context ROM Augmented RAM using Standard 8T SRAM

    Authors: Md Abdullah-Al Kaiser, Edwin Tieu, Ajey P. Jacob, Akhilesh R. Jaiswal

    Abstract: The landscape of emerging applications has been continually widening, encompassing various data-intensive applications like artificial intelligence, machine learning, secure encryption, Internet-of-Things, etc. A sustainable approach toward creating dedicated hardware platforms that can cater to multiple applications often requires the underlying hardware to context-switch or support more than one… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Journal ref: GLSVLSI '23: Great Lakes Symposium on VLSI 2023 Proceedings

  14. arXiv:2301.13005  [pdf

    cs.DC cs.CR

    Farm Environmental Data Analyzer using a Decentralised system and R

    Authors: Aryan Bagade, Rupesh C. Jaiswal

    Abstract: Data/Web Hosting is a service that lets enterprises or selves present their data on the internet that users can access. The firm providing such services are web/data host. Apart from that, such services require incessant support, and not everyone can afford a particular centralized data host service. The peer-to-peer(P2P) protocol, the Interplanetary file system(IPFS), is augmenting into a legitim… ▽ More

    Submitted 17 December, 2022; originally announced January 2023.

  15. arXiv:2212.10881  [pdf, other

    cs.CV

    In-Sensor & Neuromorphic Computing are all you need for Energy Efficient Computer Vision

    Authors: Gourav Datta, Zeyu Liu, Md Abdullah-Al Kaiser, Souvik Kundu, Joe Mathai, Zihan Yin, Ajey P. Jacob, Akhilesh R. Jaiswal, Peter A. Beerel

    Abstract: Due to the high activation sparsity and use of accumulates (AC) instead of expensive multiply-and-accumulates (MAC), neuromorphic spiking neural networks (SNNs) have emerged as a promising low-power alternative to traditional DNNs for several computer vision (CV) applications. However, most existing SNNs require multiple time steps for acceptable inference accuracy, hindering real-time deployment… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

  16. arXiv:2210.05451  [pdf, other

    cs.CV eess.IV

    Enabling ISP-less Low-Power Computer Vision

    Authors: Gourav Datta, Zeyu Liu, Zihan Yin, Linyu Sun, Akhilesh R. Jaiswal, Peter A. Beerel

    Abstract: In order to deploy current computer vision (CV) models on resource-constrained low-power devices, recent works have proposed in-sensor and in-pixel computing approaches that try to partly/fully bypass the image signal processor (ISP) and yield significant bandwidth reduction between the image sensor and the CV processing unit by downsampling the activation maps in the initial convolutional neural… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted to WACV 2023

  17. arXiv:2203.05696  [pdf, other

    eess.IV cs.CV

    Toward Efficient Hyperspectral Image Processing inside Camera Pixels

    Authors: Gourav Datta, Zihan Yin, Ajey Jacob, Akhilesh R. Jaiswal, Peter A. Beerel

    Abstract: Hyperspectral cameras generate a large amount of data due to the presence of hundreds of spectral bands as opposed to only three channels (red, green, and blue) in traditional cameras. This requires a significant amount of data transmission between the hyperspectral image sensor and a processor used to classify/detect/track the images, frame by frame, expending high energy and causing bandwidth an… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: 6 pages, 3 figures

  18. arXiv:2203.04737  [pdf, other

    cs.LG cs.AR cs.CV

    P2M: A Processing-in-Pixel-in-Memory Paradigm for Resource-Constrained TinyML Applications

    Authors: Gourav Datta, Souvik Kundu, Zihan Yin, Ravi Teja Lakkireddy, Joe Mathai, Ajey Jacob, Peter A. Beerel, Akhilesh R. Jaiswal

    Abstract: The demand to process vast amounts of data generated from state-of-the-art high resolution cameras has motivated novel energy-efficient on-device AI solutions. Visual data in such cameras are usually captured in the form of analog voltages by a sensor pixel array, and then converted to the digital domain for subsequent AI processing using analog-to-digital converters (ADC). Recent research has tri… ▽ More

    Submitted 16 March, 2022; v1 submitted 6 March, 2022; originally announced March 2022.

    Comments: 15 pages, 8 figures

  19. arXiv:2110.14242  [pdf, other

    cs.DS cs.LG

    Tight FPT Approximation for Constrained k-Center and k-Supplier

    Authors: Dishant Goyal, Ragesh Jaiswal

    Abstract: In this work, we study a range of constrained versions of the $k$-supplier and $k$-center problems such as: capacitated, fault-tolerant, fair, etc. These problems fall under a broad framework of constrained clustering. A unified framework for constrained clustering was proposed by Ding and Xu [SODA 2015] in context of the $k$-median and $k$-means objectives. In this work, we extend this framework… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

  20. arXiv:2109.14801  [pdf, other

    cs.ET cond-mat.dis-nn cond-mat.mes-hall

    Benchmarking a Probabilistic Coprocessor

    Authors: Jan Kaiser, Risi Jaiswal, Behtash Behin-Aein, Supriyo Datta

    Abstract: Computation in the past decades has been driven by deterministic computers based on classical deterministic bits. Recently, alternative computing paradigms and domain-based computing like quantum computing and probabilistic computing have gained traction. While quantum computers based on q-bits utilize quantum effects to advance computation, probabilistic computers based on probabilistic (p-)bits… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

  21. arXiv:2107.11979  [pdf, other

    cs.NE

    HYPER-SNN: Towards Energy-efficient Quantized Deep Spiking Neural Networks for Hyperspectral Image Classification

    Authors: Gourav Datta, Souvik Kundu, Akhilesh R. Jaiswal, Peter A. Beerel

    Abstract: Hyper spectral images (HSI) provide rich spectral and spatial information across a series of contiguous spectral bands. However, the accurate processing of the spectral and spatial correlation between the bands requires the use of energy-expensive 3-D Convolutional Neural Networks (CNNs). To address this challenge, we propose the use of Spiking Neural Networks (SNNs) that are generated from iso-ar… ▽ More

    Submitted 28 July, 2021; v1 submitted 26 July, 2021; originally announced July 2021.

  22. arXiv:2107.07342  [pdf, other

    cs.LG eess.IV physics.optics

    Probabilistic analysis of solar cell optical performance using Gaussian processes

    Authors: Rahul Jaiswal, Manel Martínez-Ramón, Tito Busani

    Abstract: This work investigates application of different machine learning based prediction methodologies to estimate the performance of silicon based textured cells. Concept of confidence bound regions is introduced and advantages of this concept are discussed in detail. Results show that reflection profiles and depth dependent optical generation profiles can be accurately estimated using Gaussian processe… ▽ More

    Submitted 26 June, 2021; originally announced July 2021.

  23. arXiv:2106.06755  [pdf, ps, other

    cs.DS cs.LG

    Tight FPT Approximation for Socially Fair Clustering

    Authors: Dishant Goyal, Ragesh Jaiswal

    Abstract: In this work, we study the socially fair $k$-median/$k$-means problem. We are given a set of points $P$ in a metric space $\mathcal{X}$ with a distance function $d(.,.)$. There are $\ell$ groups: $P_1,\dotsc,P_{\ell} \subseteq P$. We are also given a set $F$ of feasible centers in $\mathcal{X}$. The goal in the socially fair $k$-median problem is to find a set $C \subseteq F$ of $k$ centers that m… ▽ More

    Submitted 13 September, 2021; v1 submitted 12 June, 2021; originally announced June 2021.

    Comments: The new version gives tight approximation results. However, the old version uses techniques that work in the streaming setting albeit at the cost of weaker approximation guarantees. So, readers interested in the streaming setting may want to see the older version

  24. Hardness of Approximation of Euclidean $k$-Median

    Authors: Anup Bhattacharya, Dishant Goyal, Ragesh Jaiswal

    Abstract: The Euclidean $k$-median problem is defined in the following manner: given a set $\mathcal{X}$ of $n$ points in $\mathbb{R}^{d}$, and an integer $k$, find a set $C \subset \mathbb{R}^{d}$ of $k$ points (called centers) such that the cost function $Φ(C,\mathcal{X}) \equiv \sum_{x \in \mathcal{X}} \min_{c \in C} \|x-c\|_{2}$ is minimized. The Euclidean $k$-means problem is defined similarly by repla… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

  25. arXiv:2007.11773  [pdf, other

    cs.DS

    FPT Approximation for Constrained Metric $k$-Median/Means

    Authors: Dishant Goyal, Ragesh Jaiswal, Amit Kumar

    Abstract: The Metric $k$-median problem over a metric space $(\mathcal{X}, d)$ is defined as follows: given a set $L \subseteq \mathcal{X}$ of facility locations and a set $C \subseteq \mathcal{X}$ of clients, open a set $F \subseteq L$ of $k$ facilities such that the total service cost, defined as $Φ(F, C) \equiv \sum_{x \in C} \min_{f \in F} d(x, f)$, is minimised. The metric $k$-means problem is defined… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

  26. arXiv:1909.11744  [pdf, ps, other

    cs.DS

    Streaming PTAS for Binary $\ell_0$-Low Rank Approximation

    Authors: Anup Bhattacharya, Dishant Goyal, Ragesh Jaiswal, Amit Kumar

    Abstract: We give a 3-pass, polylog-space streaming PTAS for the constrained binary $k$-means problem and a 4-pass, polylog-space streaming PTAS for the binary $\ell_0$-low rank approximation problem. The connection between the above two problems has recently been studied. We design a streaming PTAS for the former and use this connection to obtain streaming PTAS for the latter. This is the first constant pa… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

  27. arXiv:1909.07515  [pdf, ps, other

    cs.DS

    Multiplicative Rank-1 Approximation using Length-Squared Sampling

    Authors: Ragesh Jaiswal, Amit Kumar

    Abstract: We show that the span of $Ω(\frac{1}{\varepsilon^4})$ rows of any matrix $A \subset \mathbb{R}^{n \times d}$ sampled according to the length-squared distribution contains a rank-$1$ matrix $\tilde{A}$ such that $||A - \tilde{A}||_F^2 \leq (1 + \varepsilon) \cdot ||A - π_1(A)||_F^2$, where $π_1(A)$ denotes the best rank-$1$ approximation of $A$ under the Frobenius norm. Length-squared sampling has… ▽ More

    Submitted 28 October, 2019; v1 submitted 16 September, 2019; originally announced September 2019.

    Comments: A section on open problems added in the new version

  28. arXiv:1909.07511  [pdf, other

    cs.DS

    Streaming PTAS for Constrained k-Means

    Authors: Dishant Goyal, Ragesh Jaiswal, Amit Kumar

    Abstract: We generalise the results of Bhattacharya et al. (Journal of Computing Systems, 62(1):93-115, 2018) for the list-$k$-means problem defined as -- for a (unknown) partition $X_1, ..., X_k$ of the dataset $X \subseteq \mathbb{R}^d$, find a list of $k$-center sets (each element in the list is a set of $k$ centers) such that at least one of $k$-center sets $\{c_1, ..., c_k\}$ in the list gives an… ▽ More

    Submitted 18 February, 2020; v1 submitted 16 September, 2019; originally announced September 2019.

    Comments: Changes from previous version: (i) added discussion on coreset, and (ii) fixed few typos

  29. arXiv:1907.09664  [pdf, other

    cs.ET cond-mat.dis-nn cond-mat.mes-hall

    Autonomous Probabilistic Coprocessing with Petaflips per Second

    Authors: Brian Sutton, Rafatul Faria, Lakshmi A. Ghantasala, Risi Jaiswal, Kerem Y. Camsari, Supriyo Datta

    Abstract: In this paper we present a concrete design for a probabilistic (p-) computer based on a network of p-bits, robust classical entities fluctuating between -1 and +1, with probabilities that are controlled through an input constructed from the outputs of other p-bits. The architecture of this probabilistic computer is similar to a stochastic neural network with the p-bit playing the role of a binary… ▽ More

    Submitted 22 August, 2020; v1 submitted 22 July, 2019; originally announced July 2019.

    Comments: 13 pages, 8 figures, 1 table

    Journal ref: IEEE Access (2020)

  30. arXiv:1812.03385  [pdf

    cs.CV

    Biometric Recognition System (Algorithm)

    Authors: Rahul Kumar Jaiswal, Gaurav Saxena

    Abstract: Fingerprints are the most widely deployed form of biometric identification. No two individuals share the same fingerprint because they have unique biometric identifiers. This paper presents an efficient fingerprint verification algorithm which improves matching accuracy. Fingerprint images get degraded and corrupted due to variations in skin and impression conditions. Thus, image enhancement techn… ▽ More

    Submitted 8 December, 2018; originally announced December 2018.

    Comments: Conference

  31. arXiv:1712.06865  [pdf, ps, other

    cs.DS

    Approximate Correlation Clustering Using Same-Cluster Queries

    Authors: Nir Ailon, Anup Bhattacharya, Ragesh Jaiswal

    Abstract: Ashtiani et al. (NIPS 2016) introduced a semi-supervised framework for clustering (SSAC) where a learner is allowed to make same-cluster queries. More specifically, in their model, there is a query oracle that answers queries of the form given any two vertices, do they belong to the same optimal cluster?. Ashtiani et al. showed the usefulness of such a query framework by giving a polynomial time a… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.

    Comments: To appear in LATIN 2018

  32. arXiv:1704.05232  [pdf, other

    cs.DS

    On the k-Means/Median Cost Function

    Authors: Anup Bhattacharya, Yoav Freund, Ragesh Jaiswal

    Abstract: In this work, we study the $k$-means cost function. Given a dataset $X \subseteq \mathbb{R}^d$ and an integer $k$, the goal of the Euclidean $k$-means problem is to find a set of $k$ centers $C \subseteq \mathbb{R}^d$ such that $Φ(C, X) \equiv \sum_{x \in X} \min_{c \in C} ||x - c||^2$ is minimized. Let $Δ(X,k) \equiv \min_{C \subseteq \mathbb{R}^d} Φ(C, X)$ denote the cost of the optimal $k$-mean… ▽ More

    Submitted 9 September, 2021; v1 submitted 18 April, 2017; originally announced April 2017.

    Comments: This update includes minor improvements and a new section on Dimension Estimation

    ACM Class: I.5.3; H.3.3; F.2

  33. arXiv:1704.01862  [pdf, ps, other

    cs.DS

    Approximate Clustering with Same-Cluster Queries

    Authors: Nir Ailon, Anup Bhattacharya, Ragesh Jaiswal, Amit Kumar

    Abstract: Ashtiani et al. proposed a Semi-Supervised Active Clustering framework (SSAC), where the learner is allowed to make adaptive queries to a domain expert. The queries are of the kind "do two given points belong to the same optimal cluster?" There are many clustering contexts where such same-cluster queries are feasible. Ashtiani et al. exhibited the power of such queries by showing that any instance… ▽ More

    Submitted 4 October, 2017; v1 submitted 6 April, 2017; originally announced April 2017.

    Comments: Updated version has results for faulty queries

  34. arXiv:1504.02564  [pdf, ps, other

    cs.DS

    Faster Algorithms for the Constrained k-means Problem

    Authors: Anup Bhattacharya, Ragesh Jaiswal, Amit Kumar

    Abstract: The classical center based clustering problems such as $k$-means/median/center assume that the optimal clusters satisfy the locality property that the points in the same cluster are close to each other. A number of clustering problems arise in machine learning where the optimal clusters do not follow such a locality property. Consider a variant of the $k$-means problem that may be regarded as a ge… ▽ More

    Submitted 10 April, 2015; originally announced April 2015.

  35. arXiv:1407.1689  [pdf, other

    cs.DS

    Sampling in Space Restricted Settings

    Authors: Anup Bhattacharya, Davis Issac, Ragesh Jaiswal, Amit Kumar

    Abstract: Space efficient algorithms play a central role in dealing with large amount of data. In such settings, one would like to analyse the large data using small amount of "working space". One of the key steps in many algorithms for analysing large data is to maintain a (or a small number) random sample from the data points. In this paper, we consider two space restricted settings -- (i) streaming model… ▽ More

    Submitted 15 January, 2015; v1 submitted 7 July, 2014; originally announced July 2014.

  36. arXiv:1404.5169  [pdf, ps, other

    cs.CC

    A note on the relation between XOR and Selective XOR Lemmas

    Authors: Ragesh Jaiswal

    Abstract: Given an unpredictable Boolean function $f: \{0, 1\}^n \rightarrow \{0, 1\}$, the standard Yao's XOR lemma is a statement about the unpredictability of computing $\oplus_{i \in [k]}f(x_i)$ given $x_1, ..., x_k \in \{0, 1\}^n$, whereas the Selective XOR lemma is a statement about the unpredictability of computing $\oplus_{i \in S}f(x_i)$ given $x_1, ..., x_k \in \{0, 1\}^n$ and… ▽ More

    Submitted 15 August, 2019; v1 submitted 21 April, 2014; originally announced April 2014.

    Comments: The previous version has been significantly simplified to highlight the main result

  37. arXiv:1401.3685  [pdf, ps, other

    cs.DS

    Improved analysis of D2-sampling based PTAS for k-means and other Clustering problems

    Authors: Ragesh Jaiswal, Mehul Kumar, Pulkit Yadav

    Abstract: We give an improved analysis of the simple $D^2$-sampling based PTAS for the $k$-means clustering problem given by Jaiswal, Kumar, and Sen (Algorithmica, 2013). The improvement on the running time is from $O\left(nd \cdot 2^{\tilde{O}(k^2/ε)}\right)$ to $O\left(nd \cdot 2^{\tilde{O}(k/ε)}\right)$.

    Submitted 15 January, 2014; originally announced January 2014.

    Comments: arXiv admin note: substantial text overlap with arXiv:1201.4206

  38. arXiv:1401.2912  [pdf, other

    cs.DS

    A tight lower bound instance for k-means++ in constant dimension

    Authors: Anup Bhattacharya, Ragesh Jaiswal, Nir Ailon

    Abstract: The k-means++ seeding algorithm is one of the most popular algorithms that is used for finding the initial $k$ centers when using the k-means heuristic. The algorithm is a simple sampling procedure and can be described as follows: Pick the first center randomly from the given points. For $i > 1$, pick a point to be the $i^{th}$ center with probability proportional to the square of the Euclidean di… ▽ More

    Submitted 13 January, 2014; v1 submitted 13 January, 2014; originally announced January 2014.

    Comments: To appear in TAMC 2014. arXiv admin note: text overlap with arXiv:1306.4207

  39. arXiv:1308.1351   

    cs.DS

    An $O^*(1.0821^n)$-Time Algorithm for Computing Maximum Independent Set in Graphs with Bounded Degree 3

    Authors: Davis Issac, Ragesh Jaiswal

    Abstract: We give an $O^*(1.0821^n)$-time, polynomial space algorithm for computing Maximum Independent Set in graphs with bounded degree 3. This improves all the previous running time bounds known for the problem.

    Submitted 17 June, 2022; v1 submitted 6 August, 2013; originally announced August 2013.

    Comments: While working on an updated version, we observed a bug in one of the cases of our extensive case analysis. We are withdrawing this paper while we work to fix the bug. We will add an updated version once we manage to fix the bug

  40. arXiv:1306.4207  [pdf, other

    cs.DS

    A bad 2-dimensional instance for k-means++

    Authors: Ragesh Jaiswal, Prachi Jain, Saumya Yadav

    Abstract: The k-means++ seeding algorithm is one of the most popular algorithms that is used for finding the initial $k$ centers when using the k-means heuristic. The algorithm is a simple sampling procedure and can be described as follows: {quote} Pick the first center randomly from among the given points. For $i > 1$, pick a point to be the $i^{th}$ center with probability proportional to the square of th… ▽ More

    Submitted 18 June, 2013; originally announced June 2013.

  41. Reconstruction and Analysis of Cancer-specific Gene Regulatory Networks from Gene Expression Profiles

    Authors: Khalid Raza, Rajni Jaiswal

    Abstract: The main goal of Systems Biology research is to reconstruct biological networks for its topological analysis so that reconstructed networks can be used for the identification of various kinds of disease. The availability of high-throughput data generated by microarray experiments fueled researchers to use whole-genome gene expression profiles to understand cancer and to reconstruct key cancer-spec… ▽ More

    Submitted 30 June, 2013; v1 submitted 23 May, 2013; originally announced May 2013.

    Comments: 10 pages, 1 figure, 2 tables

    Journal ref: International Journal on Bioinformatics & Biosciences (IJBB), 3(2):25-34, June 2013

  42. arXiv:1202.6680  [pdf, other

    cs.CC cs.DM math.PR

    On the Distribution of the Fourier Spectrum of Halfspaces

    Authors: Ilias Diakonikolas, Ragesh Jaiswal, Rocco A. Servedio, Li-Yang Tan, Andrew Wan

    Abstract: Bourgain showed that any noise stable Boolean function $f$ can be well-approximated by a junta. In this note we give an exponential sharpening of the parameters of Bourgain's result under the additional assumption that $f$ is a halfspace.

    Submitted 29 February, 2012; originally announced February 2012.

  43. arXiv:1201.4206  [pdf, ps, other

    cs.DS

    A simple D^2-sampling based PTAS for k-means and other Clustering Problems

    Authors: Ragesh Jaiswal, Amit Kumar, Sandeep Sen

    Abstract: Given a set of points $P \subset \mathbb{R}^d$, the $k$-means clustering problem is to find a set of $k$ {\em centers} $C = \{c_1,...,c_k\}, c_i \in \mathbb{R}^d,$ such that the objective function $\sum_{x \in P} d(x,C)^2$, where $d(x,C)$ denotes the distance between $x$ and the closest center in $C$, is minimized. This is one of the most prominent objective functions that have been studied with r… ▽ More

    Submitted 20 January, 2012; originally announced January 2012.

    ACM Class: I.5.3

  44. arXiv:0902.3757  [pdf, ps, other

    cs.CC

    Bounded Independence Fools Halfspaces

    Authors: Ilias Diakonikolas, Parikshit Gopalan, Ragesh Jaiswal, Rocco Servedio, Emanuele Viola

    Abstract: We show that any distribution on {-1,1}^n that is k-wise independent fools any halfspace h with error \eps for k = O(\log^2(1/\eps) /\eps^2). Up to logarithmic factors, our result matches a lower bound by Benjamini, Gurel-Gurevich, and Peled (2007) showing that k = Ω(1/(\eps^2 \cdot \log(1/\eps))). Using standard constructions of k-wise independent distributions, we obtain the first explicit pse… ▽ More

    Submitted 21 February, 2009; originally announced February 2009.