Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Bera, S K

Searching in archive cs. Search in all archives.
.
  1. DeMEtRIS: Counting (near)-Cliques by Crawling

    Authors: Suman K. Bera, Jayesh Choudhari, Shahrzad Haddadan, Sara Ahmadian

    Abstract: We study the problem of approximately counting cliques and near cliques in a graph, where the access to the graph is only available through crawling its vertices; thus typically seeing only a small portion of it. This model, known as the random walk model or the neighborhood query model has been introduced recently and captures real-life scenarios in which the entire graph is too massive to be sto… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

  2. arXiv:2211.06352  [pdf, other

    cs.SI cs.DM cs.DS

    Spectral Triadic Decompositions of Real-World Networks

    Authors: Sabyasachi Basu, Suman Kalyan Bera, C. Seshadhri

    Abstract: A fundamental problem in mathematics and network analysis is to find conditions under which a graph can be partitioned into smaller pieces. The most important tool for this partitioning is the Fiedler vector or discrete Cheeger inequality. These results relate the graph spectrum (eigenvalues of the normalized adjacency matrix) to the ability to break a graph into two pieces, with few edge deletion… ▽ More

    Submitted 8 May, 2024; v1 submitted 11 November, 2022; originally announced November 2022.

  3. arXiv:2204.08106  [pdf, other

    cs.DS

    A New Dynamic Algorithm for Densest Subhypergraphs

    Authors: Suman K. Bera, Sayan Bhattacharya, Jayesh Choudhari, Prantar Ghosh

    Abstract: Computing a dense subgraph is a fundamental problem in graph mining, with a diverse set of applications ranging from electronic commerce to community detection in social networks. In many of these applications, the underlying context is better modelled as a weighted hypergraph that keeps evolving with time. This motivates the problem of maintaining the densest subhypergraph of a weighted hypergr… ▽ More

    Submitted 17 April, 2022; originally announced April 2022.

    Comments: Extended abstract appears in TheWebConf (previously WWW) 2022

  4. arXiv:2010.08083  [pdf, ps, other

    cs.DS cs.CC

    Near-Linear Time Homomorphism Counting in Bounded Degeneracy Graphs: The Barrier of Long Induced Cycles

    Authors: Suman K. Bera, Noujan Pashanasangi, C. Seshadhri

    Abstract: Counting homomorphisms of a constant sized pattern graph $H$ in an input graph $G$ is a fundamental computational problem. There is a rich history of studying the complexity of this problem, under various constraints on the input $G$ and the pattern $H$. Given the significance of this problem and the large sizes of modern inputs, we investigate when near-linear time algorithms are possible. We foc… ▽ More

    Submitted 18 November, 2020; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: To be published in Symposium on Discrete Algorithms (SODA) 2021 Added conclusion section in the new version

  5. arXiv:2010.05998  [pdf, ps, other

    cs.CC cs.DS math.CO

    Counting Subgraphs in Degenerate Graphs

    Authors: Suman K. Bera, Lior Gishboliner, Yevgeny Levanzov, C. Seshadhri, Asaf Shapira

    Abstract: We consider the problem of counting the number of copies of a fixed graph $H$ within an input graph $G$. This is one of the most well-studied algorithmic graph problems, with many theoretical and practical applications. We focus on solving this problem when the input $G$ has bounded degeneracy. This is a rich family of graphs, containing all graphs without a fixed minor (e.g. planar graphs), as we… ▽ More

    Submitted 9 December, 2021; v1 submitted 12 October, 2020; originally announced October 2020.

  6. arXiv:2006.12589  [pdf, other

    cs.LG cs.DS stat.ML

    Distributional Individual Fairness in Clustering

    Authors: Nihesh Anderson, Suman K. Bera, Syamantak Das, Yang Liu

    Abstract: In this paper, we initiate the study of fair clustering that ensures distributional similarity among similar individuals. In response to improving fairness in machine learning, recent papers have investigated fairness in clustering algorithms and have focused on the paradigm of statistical parity/group fairness. These efforts attempt to minimize bias against some protected groups in the population… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  7. arXiv:2006.11947  [pdf, other

    cs.DS

    How to Count Triangles, without Seeing the Whole Graph

    Authors: Suman K. Bera, C. Seshadhri

    Abstract: Triangle counting is a fundamental problem in the analysis of large graphs. There is a rich body of work on this problem, in varying streaming and distributed models, yet all these algorithms require reading the whole input graph. In many scenarios, we do not have access to the whole graph, and can only sample a small portion of the graph (typically through crawling). In such a setting, how can we… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

    Comments: Accepted for publication in KDD 2020

  8. arXiv:2005.08642  [pdf

    cs.CV cs.LG cs.NE

    Atom Search Optimization with Simulated Annealing -- a Hybrid Metaheuristic Approach for Feature Selection

    Authors: Kushal Kanti Ghosh, Ritam Guha, Soulib Ghosh, Suman Kumar Bera, Ram Sarkar

    Abstract: 'Hybrid meta-heuristics' is one of the most interesting recent trends in the field of optimization and feature selection (FS). In this paper, we have proposed a binary variant of Atom Search Optimization (ASO) and its hybrid with Simulated Annealing called ASO-SA techniques for FS. In order to map the real values used by ASO to the binary domain of FS, we have used two different transfer functions… ▽ More

    Submitted 10 May, 2020; originally announced May 2020.

    Comments: 39 pages, submitted to Expert Systems with Applications, Elsevier

  9. arXiv:2003.13151  [pdf, ps, other

    cs.DS

    How the Degeneracy Helps for Triangle Counting in Graph Streams

    Authors: Suman K. Bera, C. Seshadhri

    Abstract: We revisit the well-studied problem of triangle count estimation in graph streams. Given a graph represented as a stream of $m$ edges, our aim is to compute a $(1\pm\varepsilon)$-approximation to the triangle count $T$, using a small space algorithm. For arbitrary order and a constant number of passes, the space complexity is known to be essentially $Θ(\min(m^{3/2}/T, m/\sqrt{T}))$ (McGregor et al… ▽ More

    Submitted 29 March, 2020; originally announced March 2020.

    Comments: Accepted for publication in PODS'2020

  10. arXiv:1911.05896  [pdf, ps, other

    cs.DS

    Linear Time Subgraph Counting, Graph Degeneracy, and the Chasm at Size Six

    Authors: Suman K. Bera, Noujan Pashanasangi, C. Seshadhri

    Abstract: We consider the problem of counting all $k$-vertex subgraphs in an input graph, for any constant $k$. This problem (denoted sub-cnt$_k$) has been studied extensively in both theory and practice. In a classic result, Chiba and Nishizeki (SICOMP 85) gave linear time algorithms for clique and 4-cycle counting for bounded degeneracy graphs. This is a rich class of sparse graphs that contains, for exam… ▽ More

    Submitted 27 November, 2019; v1 submitted 13 November, 2019; originally announced November 2019.

    Comments: The previous version did not handle the case of k=8. We corrected that in this version

  11. arXiv:1905.00566  [pdf, other

    cs.DS

    Graph Coloring via Degeneracy in Streaming and Other Space-Conscious Models

    Authors: Suman K. Bera, Amit Chakrabarti, Prantar Ghosh

    Abstract: We study the problem of coloring a given graph using a small number of colors in several well-established models of computation for big data. These include the data streaming model, the general graph query model, the massively parallel computation (MPC) model, and the CONGESTED-CLIQUE and the LOCAL models of distributed computation. On the one hand, we give algorithms with sublinear complexity, fo… ▽ More

    Submitted 1 May, 2019; originally announced May 2019.

    Comments: 26 pages

  12. arXiv:1901.02393  [pdf, other

    cs.DS cs.LG

    Fair Algorithms for Clustering

    Authors: Suman K. Bera, Deeparnab Chakrabarty, Nicolas J. Flores, Maryam Negahbani

    Abstract: We study the problem of finding low-cost Fair Clusterings in data where each data point may belong to many protected groups. Our work significantly generalizes the seminal work of Chierichetti et.al. (NIPS 2017) as follows. - We allow the user to specify the parameters that define fair representation. More precisely, these parameters define the maximum over- and minimum under-representation of a… ▽ More

    Submitted 17 June, 2019; v1 submitted 8 January, 2019; originally announced January 2019.

  13. arXiv:1807.07640  [pdf, ps, other

    cs.DS

    Coloring in Graph Streams

    Authors: Suman Kalyan Bera, Prantar Ghosh

    Abstract: In this paper, we initiate the study of the vertex coloring problem of a graph in the semi streaming model. In this model, the input graph is defined by a stream of edges, arriving in adversarial order and any algorithm must process the edges in the order of arrival using space linear (up to polylogarithmic factors) in the number of vertices of the graph. In the offline settings, there is a simple… ▽ More

    Submitted 25 July, 2018; v1 submitted 19 July, 2018; originally announced July 2018.

  14. arXiv:1309.5904  [pdf, ps, other

    cs.LG

    Fenchel Duals for Drifting Adversaries

    Authors: Suman K Bera, Anamitra R Choudhury, Syamantak Das, Sambuddha Roy, Jayram S. Thatchachar

    Abstract: We describe a primal-dual framework for the design and analysis of online convex optimization algorithms for {\em drifting regret}. Existing literature shows (nearly) optimal drifting regret bounds only for the $\ell_2$ and the $\ell_1$-norms. Our work provides a connection between these algorithms and the Online Mirror Descent ($\omd$) updates; one key insight that results from our work is that i… ▽ More

    Submitted 23 September, 2013; originally announced September 2013.

  15. arXiv:1212.3964  [pdf, other

    cs.IR

    Advanced Bloom Filter Based Algorithms for Efficient Approximate Data De-Duplication in Streams

    Authors: Suman K. Bera, Sourav Dutta, Ankur Narang, Souvik Bhattacherjee

    Abstract: Applications involving telecommunication call data records, web pages, online transactions, medical records, stock markets, climate warning systems, etc., necessitate efficient management and processing of such massively exponential amount of data from diverse sources. De-duplication or Intelligent Compression in streaming scenarios for approximate identification and elimination of duplicates from… ▽ More

    Submitted 17 December, 2012; originally announced December 2012.

    Comments: 41 pages

  16. arXiv:1112.1945  [pdf, ps, other

    cs.DS

    Approximation Algorithms for Edge Partitioned Vertex Cover Problems

    Authors: Suman Kalyan Bera, Shalmoli Gupta, Amit Kumar, Sambuddha Roy

    Abstract: We consider a natural generalization of the Partial Vertex Cover problem. Here an instance consists of a graph G = (V,E), a positive cost function c: V-> Z^{+}, a partition $P_1,..., P_r$ of the edge set $E$, and a parameter $k_i$ for each partition $P_i$. The goal is to find a minimum cost set of vertices which cover at least $k_i$ edges from the partition $P_i$. We call this the Partition Vertex… ▽ More

    Submitted 10 October, 2012; v1 submitted 8 December, 2011; originally announced December 2011.