Zum Hauptinhalt springen

Showing 1–37 of 37 results for author: Gopalan, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.07821  [pdf, other

    cs.LG cs.CC cs.DS math.ST stat.ML

    On Computationally Efficient Multi-Class Calibration

    Authors: Parikshit Gopalan, Lunjia Hu, Guy N. Rothblum

    Abstract: Consider a multi-class labelling problem, where the labels can take values in $[k]$, and a predictor predicts a distribution over the labels. In this work, we study the following foundational question: Are there notions of multi-class calibration that give strong guarantees of meaningful predictions and can be achieved in time and sample complexities polynomial in $k$? Prior notions of calibration… ▽ More

    Submitted 8 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: In COLT 2024

  2. arXiv:2401.14645  [pdf, ps, other

    cs.LG cs.CC cs.DS

    Omnipredictors for Regression and the Approximate Rank of Convex Functions

    Authors: Parikshit Gopalan, Princewill Okoroafor, Prasad Raghavendra, Abhishek Shetty, Mihir Singhal

    Abstract: Consider the supervised learning setting where the goal is to learn to predict labels $\mathbf y$ given points $\mathbf x$ from a distribution. An \textit{omnipredictor} for a class $\mathcal L$ of loss functions and a class $\mathcal C$ of hypotheses is a predictor whose predictions incur less expected loss than the best hypothesis in $\mathcal C$ for every loss in $\mathcal L$. Since the work of… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  3. arXiv:2306.10615  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Agnostically Learning Single-Index Models using Omnipredictors

    Authors: Aravind Gollakota, Parikshit Gopalan, Adam R. Klivans, Konstantinos Stavropoulos

    Abstract: We give the first result for agnostically learning Single-Index Models (SIMs) with arbitrary monotone and Lipschitz activations. All prior work either held only in the realizable setting or required the activation to be known. Moreover, we only require the marginal to have bounded second moments, whereas all prior work required stronger distributional assumptions (such as anticoncentration or boun… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

    Comments: 21 pages

  4. arXiv:2305.18764  [pdf, other

    cs.LG math.ST stat.ML

    When Does Optimizing a Proper Loss Yield Calibration?

    Authors: Jarosław Błasiok, Parikshit Gopalan, Lunjia Hu, Preetum Nakkiran

    Abstract: Optimizing proper loss functions is popularly believed to yield predictors with good calibration properties; the intuition being that for such losses, the global optimum is to predict the ground-truth probabilities, which is indeed calibrated. However, typical machine learning models are trained to approximately minimize loss over restricted families of predictors, that are unlikely to contain the… ▽ More

    Submitted 8 December, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: In NeurIPS 2023. Selected for spotlight presentation

  5. arXiv:2304.09424  [pdf, other

    cs.LG cs.AI stat.ML

    Loss Minimization Yields Multicalibration for Large Neural Networks

    Authors: Jarosław Błasiok, Parikshit Gopalan, Lunjia Hu, Adam Tauman Kalai, Preetum Nakkiran

    Abstract: Multicalibration is a notion of fairness for predictors that requires them to provide calibrated predictions across a large set of protected groups. Multicalibration is known to be a distinct goal than loss minimization, even for simple predictors such as linear functions. In this work, we consider the setting where the protected groups can be represented by neural networks of size $k$, and the… ▽ More

    Submitted 7 December, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: In ITCS 2024

  6. arXiv:2302.06726  [pdf, other

    cs.LG

    Swap Agnostic Learning, or Characterizing Omniprediction via Multicalibration

    Authors: Parikshit Gopalan, Michael P. Kim, Omer Reingold

    Abstract: We introduce and study Swap Agnostic Learning. The problem can be phrased as a game between a predictor and an adversary: first, the predictor selects a hypothesis $h$; then, the adversary plays in response, and for each level set of the predictor $\{x \in \mathcal{X} : h(x) = v\}$ selects a (different) loss-minimizing hypothesis $c_v \in \mathcal{C}$; the predictor wins if $h$ competes with the a… ▽ More

    Submitted 21 January, 2024; v1 submitted 13 February, 2023; originally announced February 2023.

    MSC Class: 68T05; 68Q32

  7. arXiv:2211.16886  [pdf, other

    cs.LG

    A Unifying Theory of Distance from Calibration

    Authors: Jarosław Błasiok, Parikshit Gopalan, Lunjia Hu, Preetum Nakkiran

    Abstract: We study the fundamental question of how to define and measure the distance from calibration for probabilistic predictors. While the notion of perfect calibration is well-understood, there is no consensus on how to quantify the distance from perfect calibration. Numerous calibration measures have been proposed in the literature, but it is unclear how they compare to each other, and many popular me… ▽ More

    Submitted 31 March, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: In STOC 2023

  8. arXiv:2210.08649  [pdf, other

    cs.LG

    Loss Minimization through the Lens of Outcome Indistinguishability

    Authors: Parikshit Gopalan, Lunjia Hu, Michael P. Kim, Omer Reingold, Udi Wieder

    Abstract: We present a new perspective on loss minimization and the recent notion of Omniprediction through the lens of Outcome Indistingusihability. For a collection of losses and hypothesis class, omniprediction requires that a predictor provide a loss-minimization guarantee simultaneously for every loss in the collection compared to the best (loss-specific) hypothesis in the class. We present a generic t… ▽ More

    Submitted 8 December, 2022; v1 submitted 16 October, 2022; originally announced October 2022.

  9. arXiv:2203.01255  [pdf, other

    cs.LG cs.CC

    Low-Degree Multicalibration

    Authors: Parikshit Gopalan, Michael P. Kim, Mihir Singhal, Shengjia Zhao

    Abstract: Introduced as a notion of algorithmic fairness, multicalibration has proved to be a powerful and versatile concept with implications far beyond its original intent. This stringent notion -- that predictions be well-calibrated across a rich class of intersecting subpopulations -- provides its strong guarantees at a cost: the computational and sample complexity of learning multicalibrated predictors… ▽ More

    Submitted 16 June, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: Appears at COLT'22

  10. arXiv:2202.13576  [pdf, other

    cs.LG cs.IT stat.ML

    KL Divergence Estimation with Multi-group Attribution

    Authors: Parikshit Gopalan, Nina Narodytska, Omer Reingold, Vatsal Sharan, Udi Wieder

    Abstract: Estimating the Kullback-Leibler (KL) divergence between two distributions given samples from them is well-studied in machine learning and information theory. Motivated by considerations of multi-group fairness, we seek KL divergence estimates that accurately reflect the contributions of sub-populations to the overall divergence. We model the sub-populations coming from a rich (possibly infinite) f… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

    Comments: 20 pages, 4 figures

  11. arXiv:2109.05389  [pdf, other

    cs.LG stat.ML

    Omnipredictors

    Authors: Parikshit Gopalan, Adam Tauman Kalai, Omer Reingold, Vatsal Sharan, Udi Wieder

    Abstract: Loss minimization is a dominant paradigm in machine learning, where a predictor is trained to minimize some loss function that depends on an uncertain event (e.g., "will it rain tomorrow?''). Different loss functions imply different learning algorithms and, at times, very different predictors. While widespread and appealing, a clear drawback of this approach is that the loss function may not be kn… ▽ More

    Submitted 11 September, 2021; originally announced September 2021.

    Comments: 35 pages, 1 figure

  12. arXiv:2103.05853  [pdf, ps, other

    cs.LG stat.ML

    Multicalibrated Partitions for Importance Weights

    Authors: Parikshit Gopalan, Omer Reingold, Vatsal Sharan, Udi Wieder

    Abstract: The ratio between the probability that two distributions $R$ and $P$ give to points $x$ are known as importance weights or propensity scores and play a fundamental role in many different fields, most notably, statistics and machine learning. Among its applications, importance weights are central to domain adaptation, anomaly detection, and estimations of various divergences such as the KL divergen… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

    Comments: 27 pages

  13. arXiv:2006.12018  [pdf, other

    cs.CR cs.DB

    Overlook: Differentially Private Exploratory Visualization for Big Data

    Authors: Pratiksha Thaker, Mihai Budiu, Parikshit Gopalan, Udi Wieder, Matei Zaharia

    Abstract: Data exploration systems that provide differential privacy must manage a privacy budget that measures the amount of privacy lost across multiple queries. One effective strategy to manage the privacy budget is to compute a one-time private synopsis of the data, to which users can make an unlimited number of queries. However, existing systems using synopses are built for offline use cases, where a s… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  14. arXiv:1912.03582  [pdf, other

    cs.LG stat.ML

    PIDForest: Anomaly Detection via Partial Identification

    Authors: Parikshit Gopalan, Vatsal Sharan, Udi Wieder

    Abstract: We consider the problem of detecting anomalies in a large dataset. We propose a framework called Partial Identification which captures the intuition that anomalies are easy to distinguish from the overwhelming majority of points by relatively few attribute values. Formalizing this intuition, we propose a geometric anomaly measure for a point that we call PIDScore, which measures the minimum densit… ▽ More

    Submitted 7 December, 2019; originally announced December 2019.

  15. arXiv:1911.07378  [pdf, ps, other

    cs.DS cs.CC math.PR

    Finding Skewed Subcubes Under a Distribution

    Authors: Parikshit Gopalan, Roie Levin, Udi Wieder

    Abstract: Say that we are given samples from a distribution $ψ$ over an $n$-dimensional space. We expect or desire $ψ$ to behave like a product distribution (or a $k$-wise independent distribution over its marginals for small $k$). We propose the problem of enumerating/list-decoding all large subcubes where the distribution $ψ$ deviates markedly from what we expect; we refer to such subcubes as skewed subcu… ▽ More

    Submitted 12 November, 2020; v1 submitted 17 November, 2019; originally announced November 2019.

  16. Hillview: A trillion-cell spreadsheet for big data

    Authors: Mihai Budiu, Parikshit Gopalan, Lalith Suresh, Udi Wieder, Han Kruiger, Marcos K. Aguilera

    Abstract: Hillview is a distributed spreadsheet for browsing very large datasets that cannot be handled by a single machine. As a spreadsheet, Hillview provides a high degree of interactivity that permits data analysts to explore information quickly along many dimensions while switching visualizations on a whim. To provide the required responsiveness, Hillview introduces visualization sketches, or vizketche… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

  17. arXiv:1804.03065  [pdf, other

    cs.LG cs.AI stat.ML

    Efficient Anomaly Detection via Matrix Sketching

    Authors: Vatsal Sharan, Parikshit Gopalan, Udi Wieder

    Abstract: We consider the problem of finding anomalies in high-dimensional data using popular PCA based anomaly scores. The naive algorithms for computing these scores explicitly compute the PCA of the covariance matrix which uses space quadratic in the dimensionality of the data. We give the first streaming algorithms that use space that is linear or sublinear in the dimension. We prove general results sho… ▽ More

    Submitted 27 November, 2018; v1 submitted 9 April, 2018; originally announced April 2018.

    Comments: Updates for NeurIPS'18 camera-ready

  18. arXiv:1803.03620  [pdf, other

    cs.DC

    Stable and Consistent Membership at Scale with Rapid

    Authors: Lalith Suresh, Dahlia Malkhi, Parikshit Gopalan, Ivan Porto Carreiro, Zeeshan Lokhandwala

    Abstract: We present the design and evaluation of Rapid, a distributed membership service. At Rapid's core is a scheme for multi-process cut detection (CD) that revolves around two key insights: (i) it suspects a failure of a process only after alerts arrive from multiple sources, and (ii) when a group of processes experience problems, it detects failures of the entire group, rather than conclude about each… ▽ More

    Submitted 9 March, 2018; originally announced March 2018.

    Comments: 15 pages

  19. arXiv:1605.05412  [pdf, other

    cs.IT

    Maximally Recoverable Codes for Grid-like Topologies

    Authors: Parikshit Gopalan, Guangda Hu, Swastik Kopparty, Shubhangi Saraf, Carol Wang, Sergey Yekhanin

    Abstract: The explosion in the volumes of data being stored online has resulted in distributed storage systems transitioning to erasure coding based schemes. Yet, the codes being deployed in practice are fairly short. In this work, we address what we view as the main coding theoretic barrier to deploying longer codes in storage: at large lengths, failures are not independent and correlated failures are inev… ▽ More

    Submitted 20 September, 2016; v1 submitted 17 May, 2016; originally announced May 2016.

  20. arXiv:1604.07432  [pdf, ps, other

    cs.CC

    Degree and Sensitivity: tails of two distributions

    Authors: Parikshit Gopalan, Rocco Servedio, Avishay Tal, Avi Wigderson

    Abstract: The sensitivity of a Boolean function f is the maximum over all inputs x, of the number of sensitive coordinates of x. The well-known sensitivity conjecture of Nisan (see also Nisan and Szegedy) states that every sensitivity-s Boolean function can be computed by a polynomial over the reals of degree poly(s). The best known upper bounds on degree, however, are exponential rather than polynomial in… ▽ More

    Submitted 25 April, 2016; originally announced April 2016.

    Comments: The conference version of this paper will appear in CCC'2016

    MSC Class: 68Q10; 68Q15

  21. arXiv:1508.02420  [pdf, ps, other

    cs.CC

    Smooth Boolean functions are easy: efficient algorithms for low-sensitivity functions

    Authors: Parikshit Gopalan, Noam Nisan, Rocco A. Servedio, Kunal Talwar, Avi Wigderson

    Abstract: A natural measure of smoothness of a Boolean function is its sensitivity (the largest number of Hamming neighbors of a point which differ from it in function value). The structure of smooth or equivalently low-sensitivity functions is still a mystery. A well-known conjecture states that every such Boolean function can be computed by a shallow decision tree. While this conjecture implies that smoot… ▽ More

    Submitted 10 August, 2015; originally announced August 2015.

    MSC Class: 68Q15; 68Q17

  22. arXiv:1506.04350  [pdf, ps, other

    cs.CC

    Pseudorandomness via the discrete Fourier transform

    Authors: Parikshit Gopalan, Daniel Kane, Raghu Meka

    Abstract: We present a new approach to constructing unconditional pseudorandom generators against classes of functions that involve computing a linear function of the inputs. We give an explicit construction of a pseudorandom generator that fools the discrete Fourier transforms of linear functions with seed-length that is nearly logarithmic (up to polyloglog factors) in the input size and the desired error… ▽ More

    Submitted 18 November, 2015; v1 submitted 14 June, 2015; originally announced June 2015.

  23. arXiv:1504.07687  [pdf, ps, other

    cs.GT cs.CC

    Public projects, Boolean functions and the borders of Border's theorem

    Authors: Parikshit Gopalan, Noam Nisan, Tim Roughgarden

    Abstract: Border's theorem gives an intuitive linear characterization of the feasible interim allocation rules of a Bayesian single-item environment, and it has several applications in economic and algorithmic mechanism design. All known generalizations of Border's theorem either restrict attention to relatively simple settings, or resort to approximation. This paper identifies a complexity-theoretic barrie… ▽ More

    Submitted 28 April, 2015; originally announced April 2015.

    Comments: Accepted to ACM EC 2015

    MSC Class: 68Q17; 68Q25

  24. arXiv:1411.4584  [pdf, ps, other

    cs.CC

    Pseudorandomness for concentration bounds and signed majorities

    Authors: Parikshit Gopalan, Daniel Kane, Raghu Meka

    Abstract: The problem of constructing pseudorandom generators that fool halfspaces has been studied intensively in recent times. For fooling halfspaces over the hypercube with polynomially small error, the best construction known requires seed-length O(log^2 n) (MekaZ13). Getting the seed-length down to O(log(n)) is a natural challenge in its own right, which needs to be overcome in order to derandomize RL.… ▽ More

    Submitted 17 November, 2014; originally announced November 2014.

  25. arXiv:1402.3543  [pdf, ps, other

    cs.CC

    Inequalities and tail bounds for elementary symmetric polynomial with applications

    Authors: Parikshit Gopalan, Amir Yehudayoff

    Abstract: We study the extent of independence needed to approximate the product of bounded random variables in expectation, a natural question that has applications in pseudorandomness and min-wise independent hashing. For random variables whose absolute value is bounded by $1$, we give an error bound of the form $σ^{Ω(k)}$ where $k$ is the amount of independence and $σ^2$ is the total variance of the sum… ▽ More

    Submitted 10 August, 2015; v1 submitted 14 February, 2014; originally announced February 2014.

    MSC Class: 68Q87; 68W20

  26. arXiv:1311.1704  [pdf, other

    cs.IR cs.AI cs.LG stat.ML

    Scalable Recommendation with Poisson Factorization

    Authors: Prem Gopalan, Jake M. Hofman, David M. Blei

    Abstract: We develop a Bayesian Poisson matrix factorization model for forming recommendations from sparse user behavior data. These data are large user/item matrices where each user has provided feedback on only a small subset of items, either explicitly (e.g., through star ratings) or implicitly (e.g., through views or purchases). In contrast to traditional matrix factorization approaches, Poisson factori… ▽ More

    Submitted 20 May, 2014; v1 submitted 7 November, 2013; originally announced November 2013.

  27. arXiv:1308.5158  [pdf, ps, other

    cs.CC

    Locally Testable Codes and Cayley Graphs

    Authors: Parikshit Gopalan, Salil Vadhan, Yuan Zhou

    Abstract: We give two new characterizations of ($\F_2$-linear) locally testable error-correcting codes in terms of Cayley graphs over $\F_2^h$: \begin{enumerate} \item A locally testable code is equivalent to a Cayley graph over $\F_2^h$ whose set of generators is significantly larger than $h$ and has no short linear dependencies, but yields a shortest-path metric that embeds into $\ell_1$ with constant d… ▽ More

    Submitted 23 August, 2013; originally announced August 2013.

    Comments: 22 pages

  28. arXiv:1307.4150  [pdf, ps, other

    cs.IT

    Explicit Maximally Recoverable Codes with Locality

    Authors: Parikshit Gopalan, Cheng Huang, Bob Jenkins, Sergey Yekhanin

    Abstract: Consider a systematic linear code where some (local) parity symbols depend on few prescribed symbols, while other (heavy) parity symbols may depend on all data symbols. Local parities allow to quickly recover any single symbol when it is erased, while heavy parities provide tolerance to a large number of simultaneous erasures. A code as above is maximally-recoverable if it corrects all erasure pat… ▽ More

    Submitted 19 July, 2013; v1 submitted 15 July, 2013; originally announced July 2013.

    MSC Class: 94B05

  29. arXiv:1210.0049  [pdf, ps, other

    cs.CC

    Better Pseudorandom Generators from Milder Pseudorandom Restrictions

    Authors: Parikshit Gopalan, Raghu Meka, Omer Reingold, Luca Trevisan, Salil Vadhan

    Abstract: We present an iterative approach to constructing pseudorandom generators, based on the repeated application of mild pseudorandom restrictions. We use this template to construct pseudorandom generators for combinatorial rectangles and read-once CNFs and a hitting set generator for width-3 branching programs, all of which achieve near-optimal seed-length even in the low-error regime: We get seed-len… ▽ More

    Submitted 28 September, 2012; originally announced October 2012.

    Comments: To appear in FOCS 2012

    MSC Class: 68Q17

  30. arXiv:1111.0405  [pdf, ps, other

    cs.CC

    Making the long code shorter, with applications to the Unique Games Conjecture

    Authors: Boaz Barak, Parikshit Gopalan, Johan Hastad, Raghu Meka, Prasad Raghavendra, David Steurer

    Abstract: The long code is a central tool in hardness of approximation, especially in questions related to the unique games conjecture. We construct a new code that is exponentially more efficient, but can still be used in many of these applications. Using the new code we obtain exponential improvements over several known results, including the following: 1. For any eps > 0, we show the existence of an n… ▽ More

    Submitted 2 November, 2011; originally announced November 2011.

    Comments: 45 pages

    MSC Class: 68Q15

  31. arXiv:1106.3625  [pdf, ps, other

    cs.IT cs.CC cs.DM

    On the Locality of Codeword Symbols

    Authors: Parikshit Gopalan, Cheng Huang, Huseyin Simitci, Sergey Yekhanin

    Abstract: Consider a linear [n,k,d]_q code C. We say that that i-th coordinate of C has locality r, if the value at this coordinate can be recovered from accessing some other r coordinates of C. Data storage applications require codes with small redundancy, low locality for information coordinates, large distance, and low locality for parity coordinates. In this paper we carry out an in-depth study of the r… ▽ More

    Submitted 18 June, 2011; originally announced June 2011.

  32. arXiv:1008.3187  [pdf, ps, other

    cs.DS cs.CC cs.LG

    Polynomial-Time Approximation Schemes for Knapsack and Related Counting Problems using Branching Programs

    Authors: Parikshit Gopalan, Adam Klivans, Raghu Meka

    Abstract: We give a deterministic, polynomial-time algorithm for approximately counting the number of {0,1}-solutions to any instance of the knapsack problem. On an instance of length n with total weight W and accuracy parameter eps, our algorithm produces a (1 + eps)-multiplicative approximation in time poly(n,log W,1/eps). We also give algorithms with identical guarantees for general integer knapsack, the… ▽ More

    Submitted 18 August, 2010; originally announced August 2010.

  33. arXiv:1001.1593  [pdf, ps, other

    cs.CC

    Fooling functions of halfspaces under product distributions

    Authors: P. Gopalan, R. O'Donnell, Y. Wu, D. Zuckerman

    Abstract: We construct pseudorandom generators that fool functions of halfspaces (threshold functions) under a very broad class of product distributions. This class includes not only familiar cases such as the uniform distribution on the discrete cube, the uniform distribution on the solid cube, and the multivariate Gaussian distribution, but also includes any product of discrete distributions with probab… ▽ More

    Submitted 11 January, 2010; originally announced January 2010.

  34. arXiv:0902.3757  [pdf, ps, other

    cs.CC

    Bounded Independence Fools Halfspaces

    Authors: Ilias Diakonikolas, Parikshit Gopalan, Ragesh Jaiswal, Rocco Servedio, Emanuele Viola

    Abstract: We show that any distribution on {-1,1}^n that is k-wise independent fools any halfspace h with error \eps for k = O(\log^2(1/\eps) /\eps^2). Up to logarithmic factors, our result matches a lower bound by Benjamini, Gurel-Gurevich, and Peled (2007) showing that k = Ω(1/(\eps^2 \cdot \log(1/\eps))). Using standard constructions of k-wise independent distributions, we obtain the first explicit pse… ▽ More

    Submitted 21 February, 2009; originally announced February 2009.

  35. arXiv:0811.4395  [pdf, ps, other

    cs.IT

    List Decoding Tensor Products and Interleaved Codes

    Authors: Parikshit Gopalan, Venkatesan Guruswami, Prasad Raghavendra

    Abstract: We design the first efficient algorithms and prove new combinatorial bounds for list decoding tensor products of codes and interleaved codes. We show that for {\em every} code, the ratio of its list decoding radius to its minimum distance stays unchanged under the tensor product operation (rather than squaring, as one might expect). This gives the first efficient list decoders and new combinator… ▽ More

    Submitted 26 November, 2008; originally announced November 2008.

    Comments: 32 pages

    ACM Class: E.4; F.2.2

  36. arXiv:cs/0609072  [pdf, ps, other

    cs.CC

    The Connectivity of Boolean Satisfiability: Computational and Structural Dichotomies

    Authors: Parikshit Gopalan, Phokion G. Kolaitis, Elitza Maneva, Christos H. Papadimitriou

    Abstract: Boolean satisfiability problems are an important benchmark for questions about complexity, algorithms, heuristics and threshold phenomena. Recent work on heuristics, and the satisfiability threshold has centered around the structure and connectivity of the solution space. Motivated by this work, we study structural and connectivity-related properties of the space of solutions of Boolean satisfia… ▽ More

    Submitted 3 October, 2007; v1 submitted 13 September, 2006; originally announced September 2006.

    Journal ref: Extended abstract in Proceedings of ICALP 2006, pp 346-357

  37. arXiv:cs/0106055  [pdf, ps

    cs.DB

    A Seamless Integration of Association Rule Mining with Database Systems

    Authors: Raj P. Gopalan, Tariq Nuruddin, Yudho Giri Sucahyo

    Abstract: The need for Knowledge and Data Discovery Management Systems (KDDMS) that support ad hoc data mining queries has been long recognized. A significant amount of research has gone into building tightly coupled systems that integrate association rule mining with database systems. In this paper, we describe a seamless integration scheme for database queries and association rule discovery using a comm… ▽ More

    Submitted 28 June, 2001; originally announced June 2001.

    Comments: 15 pages

    ACM Class: H.2.8