Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Banh, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2209.06113  [pdf, other

    cs.LG cs.CR

    Generate synthetic samples from tabular data

    Authors: David Banh, Alan Huang

    Abstract: Generating new samples from data sets can mitigate extra expensive operations, increased invasive procedures, and mitigate privacy issues. These novel samples that are statistically robust can be used as a temporary and intermediate replacement when privacy is a concern. This method can enable better data sharing practices without problems relating to identification issues or biases that are flaws… ▽ More

    Submitted 22 December, 2022; v1 submitted 11 September, 2022; originally announced September 2022.

  2. arXiv:2201.08233  [pdf, other

    cs.LG

    Encoding large information structures in linear algebra and statistical models

    Authors: David Banh, Alan Huang

    Abstract: Large information sizes in samples and features can be encoded to speed up the learning of statistical models based on linear algebra and remove unwanted signals. Encoding information can reduce both sample and feature dimension to a smaller representational set. Here two examples are shown on linear mixed models and mixture models speeding up the run time for parameter estimation by a factor defi… ▽ More

    Submitted 22 June, 2022; v1 submitted 15 January, 2022; originally announced January 2022.