Zum Hauptinhalt springen

Showing 1–42 of 42 results for author: Mirzasoleiman, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.19580  [pdf, other

    cs.LG cs.AI cs.CL

    Memory-efficient Training of LLMs with Larger Mini-batches

    Authors: Dang Nguyen, Wenhan Yang, Rathul Anand, Yu Yang, Baharan Mirzasoleiman

    Abstract: Training with larger mini-batches improves the performance and convergence rate of training machine learning models. However, training with large mini-batches becomes prohibitive for Large Language Models (LLMs) with billions of parameters, due to the large GPU memory requirement. To address this problem, we propose finding small mini-batches that simulate the dynamics of training with larger mini… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: 15 pages, 2 figures, 4 tables

  2. arXiv:2404.17768  [pdf, other

    cs.LG cs.AI cs.CV

    Make the Most of Your Data: Changing the Training Data Distribution to Improve In-distribution Generalization Performance

    Authors: Dang Nguyen, Paymon Haddad, Eric Gan, Baharan Mirzasoleiman

    Abstract: Can we modify the training data distribution to encourage the underlying optimization method toward finding solutions with superior generalization performance on in-distribution data? In this work, we approach this question for the first time by comparing the inductive bias of gradient descent (GD) with that of sharpness-aware minimization (SAM). By studying a two-layer CNN, we prove that SAM lear… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 32 pages, 11 figures, 6 tables

  3. arXiv:2403.12267  [pdf, other

    cs.CV cs.LG

    Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over Quantity

    Authors: Siddharth Joshi, Arnav Jain, Ali Payani, Baharan Mirzasoleiman

    Abstract: Contrastive Language-Image Pre-training (CLIP) on large-scale image-caption datasets learns representations that can achieve remarkable zero-shot generalization. However, such models require a massive amount of pre-training data. Improving the quality of the pre-training data has been shown to be much more effective in improving CLIP's performance than increasing its volume. Nevertheless, finding… ▽ More

    Submitted 19 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: AISTATS 2024, Code: https://github.com/BigML-CS-UCLA/clipcov-data-efficient-clip

  4. arXiv:2403.11391  [pdf, other

    cs.LG cs.CV

    Investigating the Benefits of Projection Head for Representation Learning

    Authors: Yihao Xue, Eric Gan, Jiayi Ni, Siddharth Joshi, Baharan Mirzasoleiman

    Abstract: An effective technique for obtaining high-quality representations is adding a projection head on top of the encoder during training, then discarding it and using the pre-projection representations. Despite its proven practical effectiveness, the reason behind the success of this technique is poorly understood. The pre-projection representations are not directly optimized by the loss function, rais… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Journal ref: ICLR 2024

  5. arXiv:2403.07384  [pdf, other

    cs.CL cs.AI cs.LG

    SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models

    Authors: Yu Yang, Siddhartha Mishra, Jeffrey N Chiang, Baharan Mirzasoleiman

    Abstract: Despite the effectiveness of data selection for large language models (LLMs) during pretraining and instruction fine-tuning phases, improving data efficiency in supervised fine-tuning (SFT) for specialized domains poses significant challenges due to the complexity of fine-tuning data. To bridge this gap, we introduce an effective and scalable data selection method for SFT, SmallToLarge (S2L), whic… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  6. arXiv:2403.03241  [pdf, other

    cs.NI

    NeWRF: A Deep Learning Framework for Wireless Radiation Field Reconstruction and Channel Prediction

    Authors: Haofan Lu, Christopher Vattheuer, Baharan Mirzasoleiman, Omid Abari

    Abstract: We present NeWRF, a deep learning framework for predicting wireless channels. Wireless channel prediction is a long-standing problem in the wireless community and is a key technology for improving the coverage of wireless network deployments. Today, a wireless deployment is evaluated by a site survey which is a cumbersome process requiring an experienced engineer to perform extensive channel measu… ▽ More

    Submitted 14 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  7. arXiv:2311.06839  [pdf, other

    cs.LG cs.CR

    Inference and Interference: The Role of Clipping, Pruning and Loss Landscapes in Differentially Private Stochastic Gradient Descent

    Authors: Lauren Watson, Eric Gan, Mohan Dantam, Baharan Mirzasoleiman, Rik Sarkar

    Abstract: Differentially private stochastic gradient descent (DP-SGD) is known to have poorer training and test performance on large neural networks, compared to ordinary stochastic gradient descent (SGD). In this paper, we perform a detailed study and comparison of the two processes and unveil several new insights. By comparing the behavior of the two processes separately in early and late epochs, we find… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  8. arXiv:2310.06982  [pdf, other

    cs.CV cs.LG

    Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality

    Authors: Xuxi Chen, Yu Yang, Zhangyang Wang, Baharan Mirzasoleiman

    Abstract: Dataset distillation aims to minimize the time and memory needed for training deep networks on large datasets, by creating a small set of synthetic images that has a similar generalization performance to that of the full dataset. However, current dataset distillation techniques fall short, showing a notable performance gap when compared to training on the original data. In this work, we are the fi… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Preprint

  9. arXiv:2310.05862  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    Better Safe than Sorry: Pre-training CLIP against Targeted Data Poisoning and Backdoor Attacks

    Authors: Wenhan Yang, Jingdong Gao, Baharan Mirzasoleiman

    Abstract: Contrastive Language-Image Pre-training (CLIP) on large image-caption datasets has achieved remarkable success in zero-shot classification and enabled transferability to new domains. However, CLIP is extremely more vulnerable to targeted data poisoning and backdoor attacks, compared to supervised learning. Perhaps surprisingly, poisoning 0.0001% of CLIP pre-training data is enough to make targeted… ▽ More

    Submitted 10 June, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

  10. arXiv:2310.04971  [pdf, other

    cs.LG

    Understanding the Robustness of Multi-modal Contrastive Learning to Distribution Shift

    Authors: Yihao Xue, Siddharth Joshi, Dang Nguyen, Baharan Mirzasoleiman

    Abstract: Recently, multimodal contrastive learning (MMCL) approaches, such as CLIP, have achieved a remarkable success in learning representations that are robust against distribution shift and generalize to new domains. Despite the empirical success, the mechanism behind learning such generalizable representations is not understood. In this work, we rigorously analyze this problem and uncover two mechanis… ▽ More

    Submitted 17 March, 2024; v1 submitted 7 October, 2023; originally announced October 2023.

  11. arXiv:2306.15848  [pdf, other

    cs.LG math.OC

    Ordering for Non-Replacement SGD

    Authors: Yuetong Xu, Baharan Mirzasoleiman

    Abstract: One approach for reducing run time and improving efficiency of machine learning is to reduce the convergence rate of the optimization algorithm used. Shuffling is an algorithm technique that is widely used in machine learning, but it only started to gain attention theoretically in recent years. With different convergence rates developed for random shuffling and incremental gradient descent, we see… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  12. arXiv:2306.11957  [pdf, other

    cs.LG

    Towards Mitigating Spurious Correlations in the Wild: A Benchmark and a more Realistic Dataset

    Authors: Siddharth Joshi, Yu Yang, Yihao Xue, Wenhan Yang, Baharan Mirzasoleiman

    Abstract: Deep neural networks often exploit non-predictive features that are spuriously correlated with class labels, leading to poor performance on groups of examples without such features. Despite the growing body of recent works on remedying spurious correlations, the lack of a standardized benchmark hinders reproducible evaluation and comparison of the proposed solutions. To address this, we present Sp… ▽ More

    Submitted 29 September, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Package: https://github.com/BigML-CS-UCLA/SpuCo

  13. arXiv:2306.04949  [pdf, other

    cs.LG

    Robust Learning with Progressive Data Expansion Against Spurious Correlation

    Authors: Yihe Deng, Yu Yang, Baharan Mirzasoleiman, Quanquan Gu

    Abstract: While deep learning models have shown remarkable performance in various tasks, they are susceptible to learning non-generalizable spurious features rather than the core features that are genuinely correlated to the true label. In this paper, beyond existing analyses of linear models, we theoretically examine the learning process of a two-layer nonlinear convolutional neural network in the presence… ▽ More

    Submitted 28 October, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: 22 pages, 7 figures, 11 tables. In NeurIPS 2023

  14. arXiv:2306.01244  [pdf, other

    cs.LG

    Towards Sustainable Learning: Coresets for Data-efficient Deep Learning

    Authors: Yu Yang, Hao Kang, Baharan Mirzasoleiman

    Abstract: To improve the efficiency and sustainability of learning deep models, we propose CREST, the first scalable framework with rigorous theoretical guarantees to identify the most valuable examples for training non-convex models, particularly deep networks. To guarantee convergence to a stationary point of a non-convex function, CREST models the non-convex loss as a series of quadratic functions and ex… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  15. arXiv:2305.18761  [pdf, other

    cs.LG cs.CV

    Identifying Spurious Biases Early in Training through the Lens of Simplicity Bias

    Authors: Yu Yang, Eric Gan, Gintare Karolina Dziugaite, Baharan Mirzasoleiman

    Abstract: Neural networks trained with (stochastic) gradient descent have an inductive bias towards learning simpler solutions. This makes them highly prone to learning spurious correlations in the training data, that may not hold at test time. In this work, we provide the first theoretical analysis of the effect of simplicity bias on learning spurious correlations. Notably, we show that examples with spuri… ▽ More

    Submitted 6 March, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: 26 pages, 10 figures

    Journal ref: Proceedings of the 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024, Valencia, Spain. PMLR: Volume 238

  16. arXiv:2305.16536  [pdf, ps, other

    cs.LG stat.ML

    Which Features are Learnt by Contrastive Learning? On the Role of Simplicity Bias in Class Collapse and Feature Suppression

    Authors: Yihao Xue, Siddharth Joshi, Eric Gan, Pin-Yu Chen, Baharan Mirzasoleiman

    Abstract: Contrastive learning (CL) has emerged as a powerful technique for representation learning, with or without label supervision. However, supervised CL is prone to collapsing representations of subclasses within a class by not capturing all their features, and unsupervised CL may suppress harder class-relevant features by focusing on learning easy class-irrelevant features; both significantly comprom… ▽ More

    Submitted 28 May, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: to appear at ICML 2023

  17. arXiv:2305.14521  [pdf, ps, other

    cs.LG cs.CL cs.CV

    Few-shot Adaptation to Distribution Shifts By Mixing Source and Target Embeddings

    Authors: Yihao Xue, Ali Payani, Yu Yang, Baharan Mirzasoleiman

    Abstract: Pretrained machine learning models need to be adapted to distribution shifts when deployed in new target environments. When obtaining labeled data from the target distribution is expensive, few-shot adaptation with only a few examples from the target distribution becomes essential. In this work, we propose MixPro, a lightweight and highly data-efficient approach for few-shot adaptation. MixPro fir… ▽ More

    Submitted 29 May, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  18. arXiv:2304.03916  [pdf, other

    cs.LG cs.AI

    Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning

    Authors: Yu Yang, Besmira Nushi, Hamid Palangi, Baharan Mirzasoleiman

    Abstract: Spurious correlations that degrade model generalization or lead the model to be right for the wrong reasons are one of the main robustness concerns for real-world deployments. However, mitigating these correlations during pre-training for large-scale models can be costly and impractical, particularly for those without access to high-performance computing resources. This paper proposes a novel appr… ▽ More

    Submitted 30 May, 2023; v1 submitted 8 April, 2023; originally announced April 2023.

  19. arXiv:2303.14267  [pdf, other

    cs.LG

    A Self-supervised Framework for Improved Data-Driven Monitoring of Stress via Multi-modal Passive Sensing

    Authors: Shayan Fazeli, Lionel Levine, Mehrab Beikzadeh, Baharan Mirzasoleiman, Bita Zadeh, Tara Peris, Majid Sarrafzadeh

    Abstract: Recent advances in remote health monitoring systems have significantly benefited patients and played a crucial role in improving their quality of life. However, while physiological health-focused solutions have demonstrated increasing success and maturity, mental health-focused applications have seen comparatively limited success in spite of the fact that stress and anxiety disorders are among the… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  20. arXiv:2303.11937  [pdf, other

    cs.DS cs.LG math.OC

    High Probability Bounds for Stochastic Continuous Submodular Maximization

    Authors: Evan Becker, Jingdong Gao, Ted Zadouri, Baharan Mirzasoleiman

    Abstract: We consider maximization of stochastic monotone continuous submodular functions (CSF) with a diminishing return property. Existing algorithms only guarantee the performance \textit{in expectation}, and do not bound the probability of getting a bad solution. This implies that for a particular run of the algorithms, the solution may be much worse than the provided guarantee in expectation. In this p… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS) 2023

  21. arXiv:2303.06854  [pdf, other

    cs.CV cs.CL cs.CR cs.LG

    Robust Contrastive Language-Image Pre-training against Data Poisoning and Backdoor Attacks

    Authors: Wenhan Yang, Jingdong Gao, Baharan Mirzasoleiman

    Abstract: Contrastive vision-language representation learning has achieved state-of-the-art performance for zero-shot classification, by learning from millions of image-caption pairs crawled from the internet. However, the massive data that powers large multimodal models such as CLIP, makes them extremely vulnerable to various types of targeted data poisoning and backdoor attacks. Despite this vulnerability… ▽ More

    Submitted 19 December, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

  22. arXiv:2303.06344  [pdf, other

    cs.LG

    Graph Contrastive Learning under Heterophily via Graph Filters

    Authors: Wenhan Yang, Baharan Mirzasoleiman

    Abstract: Graph contrastive learning (CL) methods learn node representations in a self-supervised manner by maximizing the similarity between the augmented node representations obtained via a GNN-based encoder. However, CL methods perform poorly on graphs with heterophily, where connected nodes tend to belong to different classes. In this work, we address this problem by proposing an effective graph CL meth… ▽ More

    Submitted 10 June, 2024; v1 submitted 11 March, 2023; originally announced March 2023.

  23. arXiv:2302.09195  [pdf, other

    cs.LG cs.AI

    Data-Efficient Contrastive Self-supervised Learning: Most Beneficial Examples for Supervised Learning Contribute the Least

    Authors: Siddharth Joshi, Baharan Mirzasoleiman

    Abstract: Self-supervised learning (SSL) learns high-quality representations from large pools of unlabeled training data. As datasets grow larger, it becomes crucial to identify the examples that contribute the most to learning such representations. This enables efficient SSL by reducing the volume of data required. Nevertheless, quantifying the value of examples for SSL has remained an open question. In th… ▽ More

    Submitted 12 March, 2024; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: Accepted to ICML 2023, Code: https://github.com/BigML-CS-UCLA/sas-data-efficient-contrastive-learning

  24. arXiv:2302.00138  [pdf, other

    cs.LG

    Generating High Fidelity Synthetic Data via Coreset selection and Entropic Regularization

    Authors: Omead Pooladzandi, Pasha Khosravi, Erik Nijkamp, Baharan Mirzasoleiman

    Abstract: Generative models have the ability to synthesize data points drawn from the data distribution, however, not all generated samples are high quality. In this paper, we propose using a combination of coresets selection methods and ``entropic regularization'' to select the highest fidelity samples. We leverage an Energy-Based Model which resembles a variational auto-encoder with an inference and gener… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

    Comments: NeurIPS 2022 Workshop on Synthetic Data for Empowering ML Research

  25. arXiv:2210.09671  [pdf, other

    cs.LG cs.CR

    Not All Poisons are Created Equal: Robust Training against Data Poisoning

    Authors: Yu Yang, Tian Yu Liu, Baharan Mirzasoleiman

    Abstract: Data poisoning causes misclassification of test time target examples by injecting maliciously crafted samples in the training data. Existing defenses are often effective only against a specific type of targeted attack, significantly degrade the generalization performance, or are prohibitive for standard deep learning pipelines. In this work, we propose an efficient defense mechanism that signifi… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:25154-25165, 2022

  26. arXiv:2210.08363  [pdf, other

    cs.LG

    Data-Efficient Augmentation for Training Neural Networks

    Authors: Tian Yu Liu, Baharan Mirzasoleiman

    Abstract: Data augmentation is essential to achieve state-of-the-art performance in many deep learning applications. However, the most effective augmentation techniques become computationally prohibitive for even medium-sized datasets. To address this, we propose a rigorous technique to select subsets of data points that when augmented, closely capture the training dynamics of full data augmentation. We fir… ▽ More

    Submitted 20 July, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: Code available at: https://github.com/tianyu139/data-efficient-augmentation

  27. arXiv:2208.10224  [pdf, other

    cs.CR cs.LG

    Friendly Noise against Adversarial Noise: A Powerful Defense against Data Poisoning Attacks

    Authors: Tian Yu Liu, Yu Yang, Baharan Mirzasoleiman

    Abstract: A powerful category of (invisible) data poisoning attacks modify a subset of training examples by small adversarial perturbations to change the prediction of certain test-time data. Existing defense mechanisms are not desirable to deploy in practice, as they often either drastically harm the generalization performance, or are attack-specific, and prohibitively slow to apply. Here, we propose a sim… ▽ More

    Submitted 20 July, 2023; v1 submitted 13 August, 2022; originally announced August 2022.

    Comments: Code available at: https://github.com/tianyu139/friendly-noise

  28. arXiv:2208.08003  [pdf, ps, other

    cs.LG stat.ML

    Investigating the Impact of Model Width and Density on Generalization in Presence of Label Noise

    Authors: Yihao Xue, Kyle Whitecross, Baharan Mirzasoleiman

    Abstract: Increasing the size of overparameterized neural networks has been a key in achieving state-of-the-art performance. This is captured by the double descent phenomenon, where the test loss follows a decreasing-increasing-decreasing pattern (or sometimes monotonically decreasing) as model width increases. However, the effect of label noise on the test loss curve has not been fully explored. In this wo… ▽ More

    Submitted 7 May, 2024; v1 submitted 16 August, 2022; originally announced August 2022.

  29. arXiv:2207.13887  [pdf, other

    cs.LG

    Adaptive Second Order Coresets for Data-efficient Machine Learning

    Authors: Omead Pooladzandi, David Davini, Baharan Mirzasoleiman

    Abstract: Training machine learning models on massive datasets incurs substantial computational costs. To alleviate such costs, there has been a sustained effort to develop data-efficient training methods that can carefully select subsets of the training examples that generalize on par with the full training data. However, existing methods are limited in providing theoretical guarantees for the quality of t… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

    Journal ref: International Conference on Machine Learning 2022

  30. arXiv:2201.12498  [pdf, ps, other

    cs.LG

    Investigating Why Contrastive Learning Benefits Robustness Against Label Noise

    Authors: Yihao Xue, Kyle Whitecross, Baharan Mirzasoleiman

    Abstract: Self-supervised Contrastive Learning (CL) has been recently shown to be very effective in preventing deep networks from overfitting noisy labels. Despite its empirical success, the theoretical understanding of the effect of contrastive learning on boosting robustness is very limited. In this work, we rigorously prove that the representation matrix learned by contrastive learning boosts robustness,… ▽ More

    Submitted 5 July, 2022; v1 submitted 29 January, 2022; originally announced January 2022.

  31. arXiv:2112.14871  [pdf, other

    cs.SI

    Analytical Models for Motifs in Temporal Networks: Discovering Trends and Anomalies

    Authors: Alexandra Porter, Baharan Mirzasoleiman, Jure Leskovec

    Abstract: Dynamic evolving networks capture temporal relations in domains such as social networks, communication networks, and financial transaction networks. In such networks, temporal motifs, which are repeated sequences of time-stamped edges/transactions, offer valuable information about the networks' evolution and function. However, currently no analytical models for temporal graphs exist and there are… ▽ More

    Submitted 29 December, 2021; originally announced December 2021.

  32. arXiv:2105.02725  [pdf, other

    cs.LG cs.SI stat.ML

    CrossWalk: Fairness-enhanced Node Representation Learning

    Authors: Ahmad Khajehnejad, Moein Khajehnejad, Mahmoudreza Babaei, Krishna P. Gummadi, Adrian Weller, Baharan Mirzasoleiman

    Abstract: The potential for machine learning systems to amplify social inequities and unfairness is receiving increasing popular and academic attention. Much recent work has focused on developing algorithmic tools to assess and mitigate such unfairness. However, there is little work on enhancing fairness in graph algorithms. Here, we develop a simple, effective and general method, CrossWalk, that enhances f… ▽ More

    Submitted 25 March, 2022; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: Association for the Advancement of Artificial Intelligence (AAAI) 2022

  33. arXiv:2011.07451  [pdf, ps, other

    cs.LG stat.ML

    Coresets for Robust Training of Neural Networks against Noisy Labels

    Authors: Baharan Mirzasoleiman, Kaidi Cao, Jure Leskovec

    Abstract: Modern neural networks have the capacity to overfit noisy labels frequently found in real-world datasets. Although great progress has been made, existing techniques are limited in providing theoretical guarantees for the performance of the neural networks trained with noisy labels. Here we propose a novel approach with strong theoretical guarantees for robust training of deep networks trained with… ▽ More

    Submitted 14 November, 2020; originally announced November 2020.

    Journal ref: Advances in Neural Information Processing Systems 2020

  34. arXiv:1906.11829  [pdf, other

    cs.LG stat.ML

    Selection via Proxy: Efficient Data Selection for Deep Learning

    Authors: Cody Coleman, Christopher Yeh, Stephen Mussmann, Baharan Mirzasoleiman, Peter Bailis, Percy Liang, Jure Leskovec, Matei Zaharia

    Abstract: Data selection methods, such as active learning and core-set selection, are useful tools for machine learning on large datasets. However, they can be prohibitively expensive to apply in deep learning because they depend on feature representations that need to be learned. In this work, we show that we can greatly improve the computational efficiency by using a small proxy model to perform data sele… ▽ More

    Submitted 26 October, 2020; v1 submitted 26 June, 2019; originally announced June 2019.

    Comments: ICLR 2020

  35. arXiv:1906.01827  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Coresets for Data-efficient Training of Machine Learning Models

    Authors: Baharan Mirzasoleiman, Jeff Bilmes, Jure Leskovec

    Abstract: Incremental gradient (IG) methods, such as stochastic gradient descent and its variants are commonly used for large scale optimization in machine learning. Despite the sustained effort to make IG methods more data-efficient, it remains an open question how to select a training data subset that can theoretically and practically perform on par with the full dataset. Here we develop CRAIG, a method t… ▽ More

    Submitted 16 November, 2020; v1 submitted 5 June, 2019; originally announced June 2019.

    Journal ref: International Conference on Machine Learning 2020

  36. arXiv:1906.01021  [pdf, other

    cs.LG stat.ML

    Coresets for Estimating Means and Mean Square Error with Limited Greedy Samples

    Authors: Saeed Vahidian, Baharan Mirzasoleiman, Alexander Cloninger

    Abstract: In a number of situations, collecting a function value for every data point may be prohibitively expensive, and random sampling ignores any structure in the underlying data. We introduce a scalable optimization algorithm with no correction steps (in contrast to Frank-Wolfe and its variants), a variant of gradient ascent for coreset selection in graphs, that greedily selects a weighted subset of ve… ▽ More

    Submitted 22 June, 2020; v1 submitted 3 June, 2019; originally announced June 2019.

  37. arXiv:1905.06618  [pdf, other

    cs.SI cs.CY

    On the Fairness of Time-Critical Influence Maximization in Social Networks

    Authors: Junaid Ali, Mahmoudreza Babaei, Abhijnan Chakraborty, Baharan Mirzasoleiman, Krishna P. Gummadi, Adish Singla

    Abstract: Influence maximization has found applications in a wide range of real-world problems, for instance, viral marketing of products in an online social network, and information propagation of valuable information such as job vacancy advertisements and health-related information. While existing algorithmic techniques usually aim at maximizing the total number of people influenced, the population often… ▽ More

    Submitted 3 November, 2021; v1 submitted 16 May, 2019; originally announced May 2019.

    Comments: Accepted at TKDE and Human-Centeric Machine learning (HCML), Workshop at NeurIPS 2019

  38. arXiv:1805.10616  [pdf, other

    cs.LG stat.ML

    Dynamic Network Model from Partial Observations

    Authors: Elahe Ghalebi, Baharan Mirzasoleiman, Radu Grosu, Jure Leskovec

    Abstract: Can evolving networks be inferred and modeled without directly observing their nodes and edges? In many applications, the edges of a dynamic network might not be observed, but one can observe the dynamics of stochastic cascading processes (e.g., information diffusion, virus propagation) occurring over the unobserved network. While there have been efforts to infer networks based on such data, provi… ▽ More

    Submitted 25 February, 2019; v1 submitted 27 May, 2018; originally announced May 2018.

  39. arXiv:1706.03583  [pdf, other

    cs.DS cs.IR

    Streaming Non-monotone Submodular Maximization: Personalized Video Summarization on the Fly

    Authors: Baharan Mirzasoleiman, Stefanie Jegelka, Andreas Krause

    Abstract: The need for real time analysis of rapidly producing data streams (e.g., video and image streams) motivated the design of streaming algorithms that can efficiently extract and summarize useful information from massive data "on the fly". Such problems can often be reduced to maximizing a submodular set function subject to various constraints. While efficient streaming methods have been recently dev… ▽ More

    Submitted 26 December, 2017; v1 submitted 12 June, 2017; originally announced June 2017.

  40. arXiv:1606.05615  [pdf, other

    cs.LG cs.DS

    Guaranteed Non-convex Optimization: Submodular Maximization over Continuous Domains

    Authors: Andrew An Bian, Baharan Mirzasoleiman, Joachim M. Buhmann, Andreas Krause

    Abstract: Submodular continuous functions are a category of (generally) non-convex/non-concave functions with a wide spectrum of applications. We characterize these functions and demonstrate that they can be maximized efficiently with approximation guarantees. Specifically, i) We introduce the weak DR property that gives a unified characterization of submodularity for all set, integer-lattice and continuous… ▽ More

    Submitted 6 May, 2019; v1 submitted 17 June, 2016; originally announced June 2016.

    Comments: Appears in the 20th International Conference on Artificial Intelligence and Statistics (AISTATS) 2017

  41. arXiv:1411.0541  [pdf, other

    cs.LG cs.AI cs.DC cs.IR

    Distributed Submodular Maximization

    Authors: Baharan Mirzasoleiman, Amin Karbasi, Rik Sarkar, Andreas Krause

    Abstract: Many large-scale machine learning problems--clustering, non-parametric learning, kernel machines, etc.--require selecting a small yet representative subset from a large dataset. Such problems can often be reduced to maximizing a submodular set function subject to various constraints. Classical approaches to submodular optimization require centralized access to the full dataset, which is impractica… ▽ More

    Submitted 27 June, 2016; v1 submitted 3 November, 2014; originally announced November 2014.

  42. arXiv:1409.7938  [pdf, ps, other

    cs.LG cs.DS cs.IR

    Lazier Than Lazy Greedy

    Authors: Baharan Mirzasoleiman, Ashwinkumar Badanidiyuru, Amin Karbasi, Jan Vondrak, Andreas Krause

    Abstract: Is it possible to maximize a monotone submodular function faster than the widely used lazy greedy algorithm (also known as accelerated greedy), both in theory and practice? In this paper, we develop the first linear-time algorithm for maximizing a general monotone submodular function subject to a cardinality constraint. We show that our randomized algorithm, STOCHASTIC-GREEDY, can achieve a… ▽ More

    Submitted 28 November, 2014; v1 submitted 28 September, 2014; originally announced September 2014.

    Comments: In Proc. Conference on Artificial Intelligence (AAAI), 2015