Zum Hauptinhalt springen

Showing 1–35 of 35 results for author: Farnia, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02961  [pdf, other

    cs.LG cs.AI

    Towards a Scalable Reference-Free Evaluation of Generative Models

    Authors: Azim Ospanov, Jingwei Zhang, Mohammad Jalali, Xuenan Cao, Andrej Bogdanov, Farzan Farnia

    Abstract: While standard evaluation scores for generative models are mostly reference-based, a reference-dependent assessment of generative models could be generally difficult due to the unavailability of applicable reference datasets. Recently, the reference-free entropy scores, VENDI and RKE, have been proposed to evaluate the diversity of generated data. However, estimating these scores from data leads t… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2406.07451  [pdf, other

    cs.LG

    An Optimism-based Approach to Online Evaluation of Generative Models

    Authors: Xiaoyan Hu, Ho-fung Leung, Farzan Farnia

    Abstract: Existing frameworks for evaluating and comparing generative models typically target an offline setting, where the evaluator has access to full batches of data produced by the models. However, in many practical scenarios, the goal is to identify the best model using the fewest generated samples to minimize the costs of querying data from the models. Such an online comparison is challenging with cur… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: arXiv version

  3. arXiv:2406.07017  [pdf, other

    cs.LG cs.CL

    MoreauPruner: Robust Pruning of Large Language Models against Weight Perturbations

    Authors: Zixiao Wang, Jingwei Zhang, Wenqian Zhao, Farzan Farnia, Bei Yu

    Abstract: Few-shot gradient methods have been extensively utilized in existing model pruning methods, where the model weights are regarded as static values and the effects of potential weight perturbations are not considered. However, the widely used large language models (LLMs) have several billion model parameters, which could increase the fragility of few-shot gradient pruning. In this work, we experimen… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  4. arXiv:2406.02017  [pdf, other

    cs.LG stat.ML

    On the Mode-Seeking Properties of Langevin Dynamics

    Authors: Xiwei Cheng, Kexin Fu, Farzan Farnia

    Abstract: The Langevin Dynamics framework, which aims to generate samples from the score function of a probability distribution, is widely used for analyzing and interpreting score-based generative modeling. While the convergence behavior of Langevin Dynamics under unimodal distributions has been extensively studied in the literature, in practice the data distribution could consist of multiple distinct mode… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  5. arXiv:2405.07489  [pdf, other

    cs.LG cs.CV

    Sparse Domain Transfer via Elastic Net Regularization

    Authors: Jingwei Zhang, Farzan Farnia

    Abstract: Transportation of samples across different domains is a central task in several machine learning problems. A sensible requirement for domain transfer tasks in computer vision and language domains is the sparsity of the transportation map, i.e., the transfer algorithm aims to modify the least number of input features while transporting samples across the source and target domains. In this work, we… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  6. arXiv:2405.02700  [pdf, other

    cs.LG cs.CV

    Identification of Novel Modes in Generative Models via Fourier-based Differential Clustering

    Authors: Jingwei Zhang, Mohammad Jalali, Cheuk Ting Li, Farzan Farnia

    Abstract: An interpretable comparison of generative models requires the identification of sample types produced more frequently by each of the involved models. While several quantitative scores have been proposed in the literature to rank different generative models, such score-based evaluations do not reveal the nuanced differences between the generative models in capturing various sample types. In this wo… ▽ More

    Submitted 4 July, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

  7. arXiv:2404.08980  [pdf, other

    cs.LG stat.ML

    Stability and Generalization in Free Adversarial Training

    Authors: Xiwei Cheng, Kexin Fu, Farzan Farnia

    Abstract: While adversarial training methods have resulted in significant improvements in the deep neural nets' robustness against norm-bounded adversarial perturbations, their generalization performance from training samples to test data has been shown to be considerably worse than standard empirical risk minimization methods. Several recent studies seek to connect the generalization behavior of adversaria… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  8. arXiv:2404.04647  [pdf, other

    cs.CV

    Structured Gradient-based Interpretations via Norm-Regularized Adversarial Training

    Authors: Shizhan Gong, Qi Dou, Farzan Farnia

    Abstract: Gradient-based saliency maps have been widely used to explain the decisions of deep neural network classifiers. However, standard gradient-based interpretation maps, including the simple gradient and integrated gradient algorithms, often lack desired structures such as sparsity and connectedness in their application to real-world computer vision models. A frequently used approach to inducing spars… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR 2024

  9. arXiv:2403.15434  [pdf, other

    cs.CL cs.AI

    ChatPattern: Layout Pattern Customization via Natural Language

    Authors: Zixiao Wang, Yunheng Shen, Xufeng Yao, Wenqian Zhao, Yang Bai, Farzan Farnia, Bei Yu

    Abstract: Existing works focus on fixed-size layout pattern generation, while the more practical free-size pattern generation receives limited attention. In this paper, we propose ChatPattern, a novel Large-Language-Model (LLM) powered framework for flexible pattern customization. ChatPattern utilizes a two-part system featuring an expert LLM agent and a highly controllable layout pattern generator. The LLM… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted by DAC24

  10. arXiv:2402.18129  [pdf, other

    cs.LG cs.AI cs.IT

    On the Inductive Biases of Demographic Parity-based Fair Learning Algorithms

    Authors: Haoyu Lei, Amin Gohari, Farzan Farnia

    Abstract: Fair supervised learning algorithms assigning labels with little dependence on a sensitive attribute have attracted great attention in the machine learning community. While the demographic parity (DP) notion has been frequently used to measure a model's fairness in training fair classifiers, several studies in the literature suggest potential impacts of enforcing DP in fair learning algorithms. In… ▽ More

    Submitted 20 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  11. arXiv:2402.17287  [pdf, other

    cs.LG cs.CV stat.ML

    An Interpretable Evaluation of Entropy-based Novelty of Generative Models

    Authors: Jingwei Zhang, Cheuk Ting Li, Farzan Farnia

    Abstract: The massive developments of generative model frameworks require principled methods for the evaluation of a model's novelty compared to a reference dataset. While the literature has extensively studied the evaluation of the quality, diversity, and generalizability of generative models, the assessment of a model's novelty compared to a reference model has not been adequately explored in the machine… ▽ More

    Submitted 13 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  12. arXiv:2401.05015  [pdf, other

    cs.LG

    An Information Theoretic Approach to Interaction-Grounded Learning

    Authors: Xiaoyan Hu, Farzan Farnia, Ho-fung Leung

    Abstract: Reinforcement learning (RL) problems where the learner attempts to infer an unobserved reward from some feedback variables have been studied in several recent papers. The setting of Interaction-Grounded Learning (IGL) is an example of such feedback-based RL tasks where the learner optimizes the return by inferring latent binary rewards from the interaction with the environment. In the IGL setting,… ▽ More

    Submitted 2 February, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

  13. arXiv:2311.11965  [pdf, other

    cs.LG stat.ML

    Provably Efficient CVaR RL in Low-rank MDPs

    Authors: Yulai Zhao, Wenhao Zhan, Xiaoyan Hu, Ho-fung Leung, Farzan Farnia, Wen Sun, Jason D. Lee

    Abstract: We study risk-sensitive Reinforcement Learning (RL), where we aim to maximize the Conditional Value at Risk (CVaR) with a fixed risk tolerance $τ$. Prior theoretical work studying risk-sensitive RL focuses on the tabular Markov Decision Processes (MDPs) setting. To extend CVaR RL to settings where state space is large, function approximation must be deployed. We study CVaR RL in low-rank MDPs with… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: The first three authors contribute equally and are ordered randomly

  14. arXiv:2310.11714  [pdf, other

    cs.LG

    On the Distributed Evaluation of Generative Models

    Authors: Zixiao Wang, Farzan Farnia, Zhenghao Lin, Yunheng Shen, Bei Yu

    Abstract: The evaluation of deep generative models has been extensively studied in the centralized setting, where the reference data are drawn from a single probability distribution. On the other hand, several applications of generative models concern distributed settings, e.g. the federated learning setting, where the reference data for conducting evaluation are provided by several clients in a network. In… ▽ More

    Submitted 11 June, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: 26 pages, 22 figures

  15. arXiv:2303.13060  [pdf, other

    cs.CV cs.AR

    DiffPattern: Layout Pattern Generation via Discrete Diffusion

    Authors: Zixiao Wang, Yunheng Shen, Wenqian Zhao, Yang Bai, Guojin Chen, Farzan Farnia, Bei Yu

    Abstract: Deep generative models dominate the existing literature in layout pattern generation. However, leaving the guarantee of legality to an inexplicable neural network could be problematic in several applications. In this paper, we propose \tool{DiffPattern} to generate reliable layout patterns. \tool{DiffPattern} introduces a novel diverse topology generation method via a discrete diffusion model with… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: DAC2023 Accepted

  16. arXiv:2302.05294  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    MoreauGrad: Sparse and Robust Interpretation of Neural Networks via Moreau Envelope

    Authors: Jingwei Zhang, Farzan Farnia

    Abstract: Explaining the predictions of deep neural nets has been a topic of great interest in the computer vision literature. While several gradient-based interpretation schemes have been proposed to reveal the influential variables in a neural net's prediction, standard gradient-based interpretation frameworks have been commonly observed to lack robustness to input perturbations and flexibility for incorp… ▽ More

    Submitted 8 January, 2023; originally announced February 2023.

  17. arXiv:2212.03095  [pdf, other

    cs.CV cs.AI cs.CR cs.LG stat.ML

    Interpretation of Neural Networks is Susceptible to Universal Adversarial Perturbations

    Authors: Haniyeh Ehsani Oskouie, Farzan Farnia

    Abstract: Interpreting neural network classifiers using gradient-based saliency maps has been extensively studied in the deep learning literature. While the existing algorithms manage to achieve satisfactory performance in application to standard image recognition datasets, recent works demonstrate the vulnerability of widely-used gradient-based interpretation schemes to norm-bounded perturbations adversari… ▽ More

    Submitted 20 April, 2024; v1 submitted 30 November, 2022; originally announced December 2022.

  18. arXiv:2210.15997  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Universal Adversarial Directions

    Authors: Ching Lam Choi, Farzan Farnia

    Abstract: Despite their great success in image recognition tasks, deep neural networks (DNNs) have been observed to be susceptible to universal adversarial perturbations (UAPs) which perturb all input samples with a single perturbation vector. However, UAPs often struggle in transferring across DNN architectures and lead to challenging optimization problems. In this work, we study the transferability of UAP… ▽ More

    Submitted 16 April, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

  19. arXiv:2207.00957  [pdf, other

    math.OC cs.LG stat.ML

    On Convergence of Gradient Descent Ascent: A Tight Local Analysis

    Authors: Haochuan Li, Farzan Farnia, Subhro Das, Ali Jadbabaie

    Abstract: Gradient Descent Ascent (GDA) methods are the mainstream algorithms for minimax optimization in generative adversarial networks (GANs). Convergence properties of GDA have drawn significant interest in the recent literature. Specifically, for $\min_{\mathbf{x}} \max_{\mathbf{y}} f(\mathbf{x};\mathbf{y})$ where $f$ is strongly-concave in $\mathbf{y}$ and possibly nonconvex in $\mathbf{x}$, (Lin et a… ▽ More

    Submitted 3 July, 2022; originally announced July 2022.

    Comments: Accepted by ICML 2022

  20. arXiv:2206.09238  [pdf, other

    cs.LG stat.ML

    On the Role of Generalization in Transferability of Adversarial Examples

    Authors: Yilin Wang, Farzan Farnia

    Abstract: Black-box adversarial attacks designing adversarial examples for unseen neural networks (NNs) have received great attention over the past years. While several successful black-box attack schemes have been proposed in the literature, the underlying factors driving the transferability of black-box adversarial examples still lack a thorough understanding. In this paper, we aim to demonstrate the role… ▽ More

    Submitted 18 June, 2022; originally announced June 2022.

  21. arXiv:2206.02468  [pdf, ps, other

    cs.LG cs.AI stat.ML

    An Optimal Transport Approach to Personalized Federated Learning

    Authors: Farzan Farnia, Amirhossein Reisizadeh, Ramtin Pedarsani, Ali Jadbabaie

    Abstract: Federated learning is a distributed machine learning paradigm, which aims to train a model using the local data of many distributed clients. A key challenge in federated learning is that the data samples across the clients may not be identically distributed. To address this challenge, personalized federated learning with the goal of tailoring the learned model to the data distribution of every ind… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  22. arXiv:2106.10324  [pdf, other

    cs.LG stat.ML

    Group-Structured Adversarial Training

    Authors: Farzan Farnia, Amirali Aghazadeh, James Zou, David Tse

    Abstract: Robust training methods against perturbations to the input data have received great attention in the machine learning literature. A standard approach in this direction is adversarial training which learns a model using adversarially-perturbed training samples. However, adversarial training performs suboptimally against perturbations structured across samples such as universal and group-sparse shif… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

  23. arXiv:2106.07537  [pdf, other

    stat.ML cs.LG math.OC

    A Wasserstein Minimax Framework for Mixed Linear Regression

    Authors: Theo Diamandis, Yonina C. Eldar, Alireza Fallah, Farzan Farnia, Asuman Ozdaglar

    Abstract: Multi-modal distributions are commonly used to model clustered data in statistical learning tasks. In this paper, we consider the Mixed Linear Regression (MLR) problem. We propose an optimal transport-based framework for MLR problems, Wasserstein Mixed Linear Regression (WMLR), which minimizes the Wasserstein distance between the learned and target mixture regression models. Through a model-based… ▽ More

    Submitted 16 June, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: To appear in 38th International Conference on Machine Learning (ICML 2021)

  24. arXiv:2010.12561  [pdf, other

    cs.LG math.OC stat.ML

    Train simultaneously, generalize better: Stability of gradient-based minimax learners

    Authors: Farzan Farnia, Asuman Ozdaglar

    Abstract: The success of minimax learning problems of generative adversarial networks (GANs) has been observed to depend on the minimax optimization algorithm used for their training. This dependence is commonly attributed to the convergence speed and robustness properties of the underlying optimization algorithm. In this paper, we show that the optimization algorithm also plays a key role in the generaliza… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

  25. arXiv:2006.10293  [pdf, other

    cs.LG stat.ML

    GAT-GMM: Generative Adversarial Training for Gaussian Mixture Models

    Authors: Farzan Farnia, William Wang, Subhro Das, Ali Jadbabaie

    Abstract: Generative adversarial networks (GANs) learn the distribution of observed samples through a zero-sum game between two machine players, a generator and a discriminator. While GANs achieve great success in learning the complex distribution of image, sound, and text data, they perform suboptimally in learning multi-modal distribution-learning benchmarks including Gaussian mixture models (GMMs). In th… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

  26. arXiv:2006.08907  [pdf, other

    cs.LG math.OC stat.ML

    Robust Federated Learning: The Case of Affine Distribution Shifts

    Authors: Amirhossein Reisizadeh, Farzan Farnia, Ramtin Pedarsani, Ali Jadbabaie

    Abstract: Federated learning is a distributed paradigm that aims at training models using samples distributed across multiple users in a network while keeping the samples on users' devices with the aim of efficiency and protecting users privacy. In such settings, the training data is often statistically heterogeneous and manifests various distribution shifts across users, which degrades the performance of t… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  27. arXiv:2002.09124  [pdf, other

    cs.LG cs.GT stat.ML

    GANs May Have No Nash Equilibria

    Authors: Farzan Farnia, Asuman Ozdaglar

    Abstract: Generative adversarial networks (GANs) represent a zero-sum game between two machine players, a generator and a discriminator, designed to learn the distribution of data. While GANs have achieved state-of-the-art performance in several benchmark learning tasks, GAN minimax optimization still poses great theoretical and empirical challenges. GANs trained using first-order optimization methods commo… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

  28. arXiv:1811.07457  [pdf, other

    cs.LG stat.ML

    Generalizable Adversarial Training via Spectral Normalization

    Authors: Farzan Farnia, Jesse M. Zhang, David Tse

    Abstract: Deep neural networks (DNNs) have set benchmarks on a wide array of supervised learning tasks. Trained DNNs, however, often lack robustness to minor adversarial perturbations to the input, which undermines their true practicality. Recent works have increased the robustness of DNNs by fitting networks using adversarially-perturbed training samples, but the improved performance can still be far below… ▽ More

    Submitted 18 November, 2018; originally announced November 2018.

  29. arXiv:1810.11740  [pdf, other

    cs.LG stat.ML

    A Convex Duality Framework for GANs

    Authors: Farzan Farnia, David Tse

    Abstract: Generative adversarial network (GAN) is a minimax game between a generator mimicking the true model and a discriminator distinguishing the samples produced by the generator from the real training samples. Given an unconstrained discriminator able to approximate any function, this game reduces to finding the generative model minimizing a divergence measure, e.g. the Jensen-Shannon (JS) divergence,… ▽ More

    Submitted 27 October, 2018; originally announced October 2018.

  30. arXiv:1710.10793  [pdf, other

    stat.ML cs.IT cs.LG

    Understanding GANs: the LQG Setting

    Authors: Soheil Feizi, Farzan Farnia, Tony Ginart, David Tse

    Abstract: Generative Adversarial Networks (GANs) have become a popular method to learn a probability model from data. In this paper, we aim to provide an understanding of some of the basic issues surrounding GANs including their formulation, generalization and stability on a simple benchmark where the data has a high-dimensional Gaussian distribution. Even in this simple benchmark, the GAN problem has not b… ▽ More

    Submitted 22 October, 2018; v1 submitted 30 October, 2017; originally announced October 2017.

  31. arXiv:1606.02206  [pdf, other

    stat.ML cs.IT cs.LG

    A Minimax Approach to Supervised Learning

    Authors: Farzan Farnia, David Tse

    Abstract: Given a task of predicting $Y$ from $X$, a loss function $L$, and a set of probability distributions $Γ$ on $(X,Y)$, what is the optimal decision rule minimizing the worst-case expected loss over $Γ$? In this paper, we address this question by introducing a generalization of the principle of maximum entropy. Applying this principle to sets of distributions with marginal on $X$ constrained to be th… ▽ More

    Submitted 3 July, 2017; v1 submitted 7 June, 2016; originally announced June 2016.

  32. arXiv:1511.01764  [pdf, ps, other

    cs.LG

    Discrete Rényi Classifiers

    Authors: Meisam Razaviyayn, Farzan Farnia, David Tse

    Abstract: Consider the binary classification problem of predicting a target variable $Y$ from a discrete feature vector $X = (X_1,...,X_d)$. When the probability distribution $\mathbb{P}(X,Y)$ is known, the optimal classifier, leading to the minimum misclassification rate, is given by the Maximum A-posteriori Probability decision rule. However, estimating the complete joint distribution $\mathbb{P}(X,Y)$ is… ▽ More

    Submitted 5 November, 2015; originally announced November 2015.

  33. arXiv:1504.06010  [pdf, ps, other

    cs.IT

    Minimum HGR Correlation Principle: From Marginals to Joint Distribution

    Authors: Farzan Farnia, Meisam Razaviyayn, Sreeram Kannan, David Tse

    Abstract: Given low order moment information over the random variables $\mathbf{X} = (X_1,X_2,\ldots,X_p)$ and $Y$, what distribution minimizes the Hirschfeld-Gebelein-Rényi (HGR) maximal correlation coefficient between $\mathbf{X}$ and $Y$, while remains faithful to the given moments? The answer to this question is important especially in order to fit models over $(\mathbf{X},Y)$ with minimum dependence am… ▽ More

    Submitted 22 April, 2015; originally announced April 2015.

  34. arXiv:1405.1156  [pdf, ps, other

    cs.IT

    Near Optimal Energy Control and Approximate Capacity of Energy Harvesting Communication

    Authors: Yishun Dong, Farzan Farnia, Ayfer Özgür

    Abstract: We consider an energy-harvesting communication system where a transmitter powered by an exogenous energy arrival process and equipped with a finite battery of size $B_{max}$ communicates over a discrete-time AWGN channel. We first concentrate on a simple Bernoulli energy arrival process where at each time step, either an energy packet of size $E$ is harvested with probability $p$, or no energy is… ▽ More

    Submitted 8 January, 2015; v1 submitted 6 May, 2014; originally announced May 2014.

    Comments: To appear in JSAC Special Issue on Wireless Communications Powered by Energy Harvesting and Wireless Energy Transfer. A shorter version presented at ISIT 2014

  35. arXiv:1304.7344  [pdf, ps, other

    cs.IT

    On feedback in Gaussian multi-hop networks

    Authors: Bobbie Chern, Farzan Farnia, Ayfer Özgür

    Abstract: The study of feedback has been mostly limited to single-hop communication settings. In this paper, we consider Gaussian networks where sources and destinations can communicate with the help of intermediate relays over multiple hops. We assume that links in the network can be bidirected providing opportunities for feedback. We ask the following question: can the information transfer in both directi… ▽ More

    Submitted 15 July, 2014; v1 submitted 27 April, 2013; originally announced April 2013.

    Comments: 16 pages; Submitted to Transactions on Information Theory in July 2014