Zum Hauptinhalt springen

Showing 1–48 of 48 results for author: Farajtabar, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02664  [pdf, other

    cs.RO

    The path towards contact-based physical human-robot interaction

    Authors: Mohammad Farajtabar, Marie Charbonneau

    Abstract: With the advancements in human-robot interaction (HRI), robots are now capable of operating in close proximity and engaging in physical interactions with humans (pHRI). Likewise, contact-based pHRI is becoming increasingly common as robots are equipped with a range of sensors to perceive human motions. Despite the presence of surveys exploring various aspects of HRI and pHRI, there is presently a… ▽ More

    Submitted 29 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2404.15653  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

    Authors: Sachin Mehta, Maxwell Horton, Fartash Faghri, Mohammad Hossein Sekhavat, Mahyar Najibi, Mehrdad Farajtabar, Oncel Tuzel, Mohammad Rastegari

    Abstract: Contrastive learning has emerged as a transformative method for learning effective visual representations through the alignment of image and text embeddings. However, pairwise similarity computation in contrastive loss between image and text pairs poses computational challenges. This paper presents a novel weakly supervised pre-training of vision models on web-scale image-text data. The proposed m… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  3. arXiv:2312.11514  [pdf, other

    cs.CL cs.AI cs.LG

    LLM in a flash: Efficient Large Language Model Inference with Limited Memory

    Authors: Keivan Alizadeh, Iman Mirzadeh, Dmitry Belenko, Karen Khatamifard, Minsik Cho, Carlo C Del Mundo, Mohammad Rastegari, Mehrdad Farajtabar

    Abstract: Large language models (LLMs) are central to modern natural language processing, delivering exceptional performance in various tasks. However, their substantial computational and memory requirements present challenges, especially for devices with limited DRAM capacity. This paper tackles the challenge of efficiently running LLMs that exceed the available DRAM capacity by storing the model parameter… ▽ More

    Submitted 30 July, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: ACL 2024

  4. arXiv:2312.09299  [pdf, other

    cs.LG cs.CL cs.CV

    Weight subcloning: direct initialization of transformers using larger pretrained ones

    Authors: Mohammad Samragh, Mehrdad Farajtabar, Sachin Mehta, Raviteja Vemulapalli, Fartash Faghri, Devang Naik, Oncel Tuzel, Mohammad Rastegari

    Abstract: Training large transformer models from scratch for a target task requires lots of data and is computationally demanding. The usual practice of transfer learning overcomes this challenge by initializing the model with weights of a pretrained model of the same size and specification to increase the convergence and training speed. However, what if no pretrained model of the required size is available… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  5. arXiv:2311.18237  [pdf, other

    cs.CV cs.LG

    Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task-specific Models

    Authors: Raviteja Vemulapalli, Hadi Pouransari, Fartash Faghri, Sachin Mehta, Mehrdad Farajtabar, Mohammad Rastegari, Oncel Tuzel

    Abstract: Vision Foundation Models (VFMs) pretrained on massive datasets exhibit impressive performance on various downstream tasks, especially with limited labeled target data. However, due to their high inference compute cost, these models cannot be deployed for many real-world applications. Motivated by this, we ask the following important question, "How can we leverage the knowledge from a large VFM to… ▽ More

    Submitted 1 July, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: International Conference on Machine Learning, 2024

  6. arXiv:2310.16226  [pdf, other

    cs.CV cs.CL cs.LG

    TiC-CLIP: Continual Training of CLIP Models

    Authors: Saurabh Garg, Mehrdad Farajtabar, Hadi Pouransari, Raviteja Vemulapalli, Sachin Mehta, Oncel Tuzel, Vaishaal Shankar, Fartash Faghri

    Abstract: Keeping large foundation models up to date on latest data is inherently expensive. To avoid the prohibitive costs of constantly retraining, it is imperative to continually train these models. This problem is exacerbated by the lack of any large scale continual learning benchmarks or baselines. We introduce the first set of web-scale Time-Continual (TiC) benchmarks for training vision-language mode… ▽ More

    Submitted 21 March, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  7. arXiv:2310.15308  [pdf, other

    cs.CV cs.LG

    SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding

    Authors: Haoxiang Wang, Pavan Kumar Anasosalu Vasu, Fartash Faghri, Raviteja Vemulapalli, Mehrdad Farajtabar, Sachin Mehta, Mohammad Rastegari, Oncel Tuzel, Hadi Pouransari

    Abstract: The landscape of publicly available vision foundation models (VFMs), such as CLIP and Segment Anything Model (SAM), is expanding rapidly. VFMs are endowed with distinct capabilities stemming from their pre-training objectives. For instance, CLIP excels in semantic understanding, while SAM specializes in spatial understanding for segmentation. In this work, we introduce a simple recipe to efficient… ▽ More

    Submitted 10 June, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  8. arXiv:2310.14108  [pdf, other

    cs.LG cs.AI cs.CV

    CLIP meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement

    Authors: Mohammadreza Salehi, Mehrdad Farajtabar, Maxwell Horton, Fartash Faghri, Hadi Pouransari, Raviteja Vemulapalli, Oncel Tuzel, Ali Farhadi, Mohammad Rastegari, Sachin Mehta

    Abstract: Contrastive language image pretraining (CLIP) is a standard method for training vision-language models. While CLIP is scalable, promptable, and robust to distribution shifts on image classification tasks, it lacks object localization capabilities. This paper studies the following question: Can we augment CLIP training with task-specific vision models from model zoos to improve its visual represent… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

  9. arXiv:2310.04564  [pdf, other

    cs.LG cs.AI

    ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models

    Authors: Iman Mirzadeh, Keivan Alizadeh, Sachin Mehta, Carlo C Del Mundo, Oncel Tuzel, Golnoosh Samei, Mohammad Rastegari, Mehrdad Farajtabar

    Abstract: Large Language Models (LLMs) with billions of parameters have drastically transformed AI applications. However, their demanding computation during inference has raised significant challenges for deployment on resource-constrained devices. Despite recent trends favoring alternative activation functions such as GELU or SiLU, known for increased computation, this study strongly advocates for reinstat… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: preprint

  10. arXiv:2309.04502  [pdf, other

    cs.CV

    On the Efficacy of Multi-scale Data Samplers for Vision Applications

    Authors: Elvis Nunez, Thomas Merth, Anish Prabhu, Mehrdad Farajtabar, Mohammad Rastegari, Sachin Mehta, Maxwell Horton

    Abstract: Multi-scale resolution training has seen an increased adoption across multiple vision tasks, including classification and detection. Training with smaller resolutions enables faster training at the expense of a drop in accuracy. Conversely, training with larger resolutions has been shown to improve performance, but memory constraints often make this infeasible. In this paper, we empirically study… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  11. arXiv:2303.08983  [pdf, other

    cs.CV cs.AI cs.LG

    Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Reinforcement

    Authors: Fartash Faghri, Hadi Pouransari, Sachin Mehta, Mehrdad Farajtabar, Ali Farhadi, Mohammad Rastegari, Oncel Tuzel

    Abstract: We propose Dataset Reinforcement, a strategy to improve a dataset once such that the accuracy of any model architecture trained on the reinforced dataset is improved at no additional training cost for users. We propose a Dataset Reinforcement strategy based on data augmentation and knowledge distillation. Our generic strategy is designed based on extensive analysis across CNN- and transformer-base… ▽ More

    Submitted 22 September, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: Accepted at International Conference on Computer Vision (ICCV) 2023. v2: Camera-ready version with new Tables 9 and 10. v3: Correction to Table 7-Avg. column

  12. arXiv:2207.02099  [pdf, other

    cs.LG

    An Empirical Study of Implicit Regularization in Deep Offline RL

    Authors: Caglar Gulcehre, Srivatsan Srinivasan, Jakub Sygnowski, Georg Ostrovski, Mehrdad Farajtabar, Matt Hoffman, Razvan Pascanu, Arnaud Doucet

    Abstract: Deep neural networks are the most commonly used function approximators in offline reinforcement learning. Prior works have shown that neural nets trained with TD-learning and gradient descent can exhibit implicit regularization that can be characterized by under-parameterization of these networks. Specifically, the rank of the penultimate feature layer, also called \textit{effective rank}, has bee… ▽ More

    Submitted 7 July, 2022; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: 40 pages, 37 figures, 2 tables

  13. arXiv:2202.09826  [pdf, other

    cs.LG cs.AI

    Continual Learning Beyond a Single Model

    Authors: Thang Doan, Seyed Iman Mirzadeh, Mehrdad Farajtabar

    Abstract: A growing body of research in continual learning focuses on the catastrophic forgetting problem. While many attempts have been made to alleviate this problem, the majority of the methods assume a single model in the continual learning setup. In this work, we question this assumption and show that employing ensemble models can be a simple yet effective method to improve continual performance. Howev… ▽ More

    Submitted 3 July, 2023; v1 submitted 20 February, 2022; originally announced February 2022.

    Comments: Accepted to 2nd Conference on Lifelong Learning Agents (CoLLAs 2023); Keywords: continual learning, neural network subspaces, ensemble models, computationally efficient training

  14. arXiv:2202.00275  [pdf, other

    cs.LG cs.AI

    Architecture Matters in Continual Learning

    Authors: Seyed Iman Mirzadeh, Arslan Chaudhry, Dong Yin, Timothy Nguyen, Razvan Pascanu, Dilan Gorur, Mehrdad Farajtabar

    Abstract: A large body of research in continual learning is devoted to overcoming the catastrophic forgetting of neural networks by designing new algorithms that are robust to the distribution shifts. However, the majority of these works are strictly focused on the "algorithmic" part of continual learning for a "fixed neural network architecture", and the implications of using different architectures are mo… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: preprint

  15. arXiv:2111.01950  [pdf

    cs.LG eess.IV physics.med-ph

    Machine-Learning Identification of Hemodynamics in Coronary Arteries in the Presence of Stenosis

    Authors: Mohammad Farajtabar, Mohit Biglarian, Morteza Miansari

    Abstract: Prediction of the blood flow characteristics is of utmost importance for understanding the behavior of the blood arterial network, especially in the presence of vascular diseases such as stenosis. Computational fluid dynamics (CFD) has provided a powerful and efficient tool to determine these characteristics including the pressure and velocity fields within the network. Despite numerous studies in… ▽ More

    Submitted 10 November, 2021; v1 submitted 2 November, 2021; originally announced November 2021.

  16. arXiv:2110.11526  [pdf, other

    cs.LG cs.AI cs.CV

    Wide Neural Networks Forget Less Catastrophically

    Authors: Seyed Iman Mirzadeh, Arslan Chaudhry, Dong Yin, Huiyi Hu, Razvan Pascanu, Dilan Gorur, Mehrdad Farajtabar

    Abstract: A primary focus area in continual learning research is alleviating the "catastrophic forgetting" problem in neural networks by designing new algorithms that are more robust to the distribution shifts. While the recent progress in continual learning literature is encouraging, our understanding of what properties of neural networks contribute to catastrophic forgetting is still limited. To address t… ▽ More

    Submitted 14 July, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: ICML 2022

  17. arXiv:2106.12772  [pdf, other

    cs.LG stat.ML

    Task-agnostic Continual Learning with Hybrid Probabilistic Models

    Authors: Polina Kirichenko, Mehrdad Farajtabar, Dushyant Rao, Balaji Lakshminarayanan, Nir Levine, Ang Li, Huiyi Hu, Andrew Gordon Wilson, Razvan Pascanu

    Abstract: Learning new tasks continuously without forgetting on a constantly changing data distribution is essential for real-world problems but extremely challenging for modern deep learning. In this work we propose HCL, a Hybrid generative-discriminative approach to Continual Learning for classification. We model the distribution of each task and each class with a normalizing flow. The flow is used to lea… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

  18. arXiv:2011.11199  [pdf, other

    cs.LG stat.ML

    Balance Regularized Neural Network Models for Causal Effect Estimation

    Authors: Mehrdad Farajtabar, Andrew Lee, Yuanjian Feng, Vishal Gupta, Peter Dolan, Harish Chandran, Martin Szummer

    Abstract: Estimating individual and average treatment effects from observational data is an important problem in many domains such as healthcare and e-commerce. In this paper, we advocate balance regularization of multi-head neural network architectures. Our work is motivated by representation learning techniques to reduce differences between treated and untreated distributions that potentially arise due to… ▽ More

    Submitted 22 November, 2020; originally announced November 2020.

    Comments: Causal Discovery & Causality-Inspired Machine Learning Workshop at Neural Information Processing Systems, 2020

  19. arXiv:2010.04495  [pdf, other

    cs.LG cs.AI cs.CV

    Linear Mode Connectivity in Multitask and Continual Learning

    Authors: Seyed Iman Mirzadeh, Mehrdad Farajtabar, Dilan Gorur, Razvan Pascanu, Hassan Ghasemzadeh

    Abstract: Continual (sequential) training and multitask (simultaneous) training are often attempting to solve the same overall objective: to find a solution that performs well on all considered tasks. The main difference is in the training regimes, where continual learning can only have access to one task at a time, which for neural networks typically leads to catastrophic forgetting. That is, the solution… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

  20. arXiv:2010.02418  [pdf, other

    cs.LG cs.AI cs.CV

    The Effectiveness of Memory Replay in Large Scale Continual Learning

    Authors: Yogesh Balaji, Mehrdad Farajtabar, Dong Yin, Alex Mott, Ang Li

    Abstract: We study continual learning in the large scale setting where tasks in the input sequence are not limited to classification, and the outputs can be of high dimension. Among multiple state-of-the-art methods, we found vanilla experience replay (ER) still very competitive in terms of both performance and scalability, despite its simplicity. However, a degraded performance is observed for ER with smal… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: 15 pages

  21. arXiv:2006.12620  [pdf, other

    cs.LG cs.AI

    A maximum-entropy approach to off-policy evaluation in average-reward MDPs

    Authors: Nevena Lazic, Dong Yin, Mehrdad Farajtabar, Nir Levine, Dilan Gorur, Chris Harris, Dale Schuurmans

    Abstract: This work focuses on off-policy evaluation (OPE) with function approximation in infinite-horizon undiscounted Markov decision processes (MDPs). For MDPs that are ergodic and linear (i.e. where rewards and dynamics are linear in some known features), we provide the first finite-sample OPE error bound, extending existing results beyond the episodic and discounted cases. In a more general setting, wh… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  22. arXiv:2006.10974  [pdf, ps, other

    cs.LG stat.ML

    Optimization and Generalization of Regularization-Based Continual Learning: a Loss Approximation Viewpoint

    Authors: Dong Yin, Mehrdad Farajtabar, Ang Li, Nir Levine, Alex Mott

    Abstract: Neural networks have achieved remarkable success in many cognitive tasks. However, when they are trained sequentially on multiple tasks without access to old data, their performance on early tasks tend to drop significantly. This problem is often referred to as catastrophic forgetting, a key challenge in continual learning of neural networks. The regularization-based approach is one of the primary… ▽ More

    Submitted 8 February, 2021; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: Preliminary version with a different title presented at ICML Workshop on Continual Learning, 2020 (spotlight)

  23. arXiv:2006.06958  [pdf, other

    cs.LG cs.NE stat.ML

    Understanding the Role of Training Regimes in Continual Learning

    Authors: Seyed Iman Mirzadeh, Mehrdad Farajtabar, Razvan Pascanu, Hassan Ghasemzadeh

    Abstract: Catastrophic forgetting affects the training of neural networks, limiting their ability to learn multiple tasks sequentially. From the perspective of the well established plasticity-stability dilemma, neural networks tend to be overly plastic, lacking the stability necessary to prevent the forgetting of previous knowledge, which means that as learning progresses, networks tend to forget previously… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  24. arXiv:2006.06051  [pdf, other

    cs.LG cs.GT cs.MA stat.ML

    Learning to Incentivize Other Learning Agents

    Authors: Jiachen Yang, Ang Li, Mehrdad Farajtabar, Peter Sunehag, Edward Hughes, Hongyuan Zha

    Abstract: The challenge of developing powerful and general Reinforcement Learning (RL) agents has received increasing attention in recent years. Much of this effort has focused on the single-agent setting, in which an agent maximizes a predefined extrinsic reward function. However, a long-term question inevitably arises: how will such independent agents cooperate when they are continually learning and actin… ▽ More

    Submitted 19 October, 2020; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: 20 pages, 11 figures. To appear in 34th Conference on Neural Information Processing Systems (NeurIPS 2020)

  25. arXiv:2004.11545  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Dropout as an Implicit Gating Mechanism For Continual Learning

    Authors: Seyed-Iman Mirzadeh, Mehrdad Farajtabar, Hassan Ghasemzadeh

    Abstract: In recent years, neural networks have demonstrated an outstanding ability to achieve complex learning tasks across various domains. However, they suffer from the "catastrophic forgetting" problem when they face a sequence of learning tasks, where they forget the old ones as they learn new tasks. This problem is also highly related to the "stability-plasticity dilemma". The more plastic the network… ▽ More

    Submitted 24 April, 2020; originally announced April 2020.

    Comments: CVPR 2020 Workshops

  26. arXiv:2002.05715  [pdf, other

    cs.LG stat.ML

    Self-Distillation Amplifies Regularization in Hilbert Space

    Authors: Hossein Mobahi, Mehrdad Farajtabar, Peter L. Bartlett

    Abstract: Knowledge distillation introduced in the deep learning context is a method to transfer knowledge from one architecture to another. In particular, when the architectures are identical, this is called self-distillation. The idea is to feed in predictions of the trained model as new target values for retraining (and iterate this loop possibly a few times). It has been empirically observed that the se… ▽ More

    Submitted 26 October, 2020; v1 submitted 13 February, 2020; originally announced February 2020.

  27. arXiv:1910.07104  [pdf, other

    cs.LG stat.ML

    Orthogonal Gradient Descent for Continual Learning

    Authors: Mehrdad Farajtabar, Navid Azizan, Alex Mott, Ang Li

    Abstract: Neural networks are achieving state of the art and sometimes super-human performance on learning tasks across a variety of domains. Whenever these problems require learning in a continual or sequential manner, however, neural networks suffer from the problem of catastrophic forgetting; they forget how to solve previous tasks after being trained on a new task, despite having the essential capacity… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

  28. arXiv:1906.05930  [pdf, other

    cs.CV cs.LG cs.RO

    Cross-View Policy Learning for Street Navigation

    Authors: Ang Li, Huiyi Hu, Piotr Mirowski, Mehrdad Farajtabar

    Abstract: The ability to navigate from visual observations in unfamiliar environments is a core component of intelligent agents and an ongoing challenge for Deep Reinforcement Learning (RL). Street View can be a sensible testbed for such RL agents, because it provides real-world photographic imagery at ground level, with diverse street appearances; it has been made into an interactive environment called Str… ▽ More

    Submitted 22 September, 2019; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: 8 pages

  29. arXiv:1902.03393  [pdf, other

    cs.LG cs.AI stat.ML

    Improved Knowledge Distillation via Teacher Assistant

    Authors: Seyed-Iman Mirzadeh, Mehrdad Farajtabar, Ang Li, Nir Levine, Akihiro Matsukawa, Hassan Ghasemzadeh

    Abstract: Despite the fact that deep neural networks are powerful models and achieve appealing results on many tasks, they are too large to be deployed on edge devices like smartphones or embedded sensor nodes. There have been efforts to compress these networks, and a popular method is knowledge distillation, where a large (teacher) pre-trained network is used to train a smaller (student) network. However,… ▽ More

    Submitted 16 December, 2019; v1 submitted 9 February, 2019; originally announced February 2019.

    Comments: AAAI 2020

  30. arXiv:1812.02224  [pdf, other

    stat.ML cs.LG

    Adapting Auxiliary Losses Using Gradient Similarity

    Authors: Yunshu Du, Wojciech M. Czarnecki, Siddhant M. Jayakumar, Mehrdad Farajtabar, Razvan Pascanu, Balaji Lakshminarayanan

    Abstract: One approach to deal with the statistical inefficiency of neural networks is to rely on auxiliary losses that help to build useful representations. However, it is not always trivial to know if an auxiliary task will be helpful for the main task and when it could start hurting. We propose to use the cosine similarity between gradients of tasks as an adaptive weight to detect when an auxiliary loss… ▽ More

    Submitted 25 November, 2020; v1 submitted 5 December, 2018; originally announced December 2018.

  31. arXiv:1803.04051  [pdf, other

    cs.LG stat.ML

    Representation Learning over Dynamic Graphs

    Authors: Rakshit Trivedi, Mehrdad Farajtabar, Prasenjeet Biswal, Hongyuan Zha

    Abstract: How can we effectively encode evolving information over dynamic graphs into low-dimensional representations? In this paper, we propose DyRep, an inductive deep representation learning framework that learns a set of functions to efficiently produce low-dimensional node embeddings that evolves over time. The learned embeddings drive the dynamics of two key processes namely, communication and associa… ▽ More

    Submitted 16 March, 2018; v1 submitted 11 March, 2018; originally announced March 2018.

  32. arXiv:1802.03493  [pdf, other

    cs.AI

    More Robust Doubly Robust Off-policy Evaluation

    Authors: Mehrdad Farajtabar, Yinlam Chow, Mohammad Ghavamzadeh

    Abstract: We study the problem of off-policy evaluation (OPE) in reinforcement learning (RL), where the goal is to estimate the performance of a policy from the data generated by another policy(ies). In particular, we focus on the doubly robust (DR) estimators that consist of an importance sampling (IS) component and a performance model, and utilize the low (or zero) bias of IS and low variance of the model… ▽ More

    Submitted 23 May, 2018; v1 submitted 9 February, 2018; originally announced February 2018.

  33. arXiv:1712.04386  [pdf, other

    q-bio.PE cs.AI cs.CE cs.SI physics.soc-ph

    Hawkes Processes for Invasive Species Modeling and Management

    Authors: Amrita Gupta, Mehrdad Farajtabar, Bistra Dilkina, Hongyuan Zha

    Abstract: The spread of invasive species to new areas threatens the stability of ecosystems and causes major economic losses in agriculture and forestry. We propose a novel approach to minimizing the spread of an invasive species given a limited intervention budget. We first model invasive species propagation using Hawkes processes, and then derive closed-form expressions for characterizing the effect of an… ▽ More

    Submitted 12 December, 2017; originally announced December 2017.

  34. arXiv:1705.08051  [pdf, other

    cs.LG stat.ML

    Wasserstein Learning of Deep Generative Point Process Models

    Authors: Shuai Xiao, Mehrdad Farajtabar, Xiaojing Ye, Junchi Yan, Le Song, Hongyuan Zha

    Abstract: Point processes are becoming very popular in modeling asynchronous sequential data due to their sound mathematical foundation and strength in modeling a variety of real-world phenomena. Currently, they are often characterized via intensity function which limits model's expressiveness due to unrealistic assumptions on its parametric form used in practice. Furthermore, they are learned via maximum l… ▽ More

    Submitted 22 May, 2017; originally announced May 2017.

  35. arXiv:1703.08524  [pdf, other

    cs.LG

    Joint Modeling of Event Sequence and Time Series with Attentional Twin Recurrent Neural Networks

    Authors: Shuai Xiao, Junchi Yan, Mehrdad Farajtabar, Le Song, Xiaokang Yang, Hongyuan Zha

    Abstract: A variety of real-world processes (over networks) produce sequences of data whose complex temporal dynamics need to be studied. More especially, the event timestamps can carry important information about the underlying network dynamics, which otherwise are not available from the time-series evenly sampled from continuous signals. Moreover, in most complex processes, event sequences and evenly-samp… ▽ More

    Submitted 24 March, 2017; originally announced March 2017.

    Comments: 14 pages

  36. arXiv:1703.07823  [pdf, other

    cs.LG cs.SI

    Fake News Mitigation via Point Process Based Intervention

    Authors: Mehrdad Farajtabar, Jiachen Yang, Xiaojing Ye, Huan Xu, Rakshit Trivedi, Elias Khalil, Shuang Li, Le Song, Hongyuan Zha

    Abstract: We propose the first multistage intervention framework that tackles fake news in social networks by combining reinforcement learning with a point process network activity model. The spread of fake news and mitigation events within the network is modeled by a multivariate Hawkes process with additional exogenous control terms. By choosing a feature representation of states, defining mitigation acti… ▽ More

    Submitted 19 June, 2017; v1 submitted 22 March, 2017; originally announced March 2017.

    Comments: Point Process, Hawkes Process, Social Networks, Intervention and Control, Reinforcement Learning, ICML 2017

  37. arXiv:1703.01442  [pdf, other

    cs.SI cs.LG stat.ML

    Recurrent Poisson Factorization for Temporal Recommendation

    Authors: Seyed Abbas Hosseini, Keivan Alizadeh, Ali Khodadadi, Ali Arabzadeh, Mehrdad Farajtabar, Hongyuan Zha, Hamid R. Rabiee

    Abstract: Poisson factorization is a probabilistic model of users and items for recommendation systems, where the so-called implicit consumer data is modeled by a factorized Poisson distribution. There are many variants of Poisson factorization methods who show state-of-the-art performance on real-world recommendation tasks. However, most of them do not explicitly take into account the temporal behavior and… ▽ More

    Submitted 4 March, 2017; originally announced March 2017.

    Comments: Submitted to KDD 2017 | Halifax, Nova Scotia - Canada - sigkdd, Codes are available at https://github.com/AHosseini/RPF

  38. Distilling Information Reliability and Source Trustworthiness from Digital Traces

    Authors: Behzad Tabibian, Isabel Valera, Mehrdad Farajtabar, Le Song, Bernhard Schölkopf, Manuel Gomez-Rodriguez

    Abstract: Online knowledge repositories typically rely on their users or dedicated editors to evaluate the reliability of their content. These evaluations can be viewed as noisy measurements of both information reliability and information source trustworthiness. Can we leverage these noisy evaluations, often biased, to distill a robust, unbiased and interpretable measure of both notions? In this paper, we… ▽ More

    Submitted 2 April, 2017; v1 submitted 24 October, 2016; originally announced October 2016.

    Comments: Accepted at 26th World Wide Web conference (WWW-17)

  39. arXiv:1606.03816  [pdf, other

    cs.SI physics.soc-ph

    Multistage Campaigning in Social Networks

    Authors: Mehrdad Farajtabar, Xiaojing Ye, Sahar Harati, Le Song, Hongyuan Zha

    Abstract: We consider the problem of how to optimize multi-stage campaigning over social networks. The dynamic programming framework is employed to balance the high present reward and large penalty on low future outcome in the presence of extensive uncertainties. In particular, we establish theoretical foundations of optimal campaigning over social networks where the user activities are modeled as a multiva… ▽ More

    Submitted 13 June, 2016; originally announced June 2016.

  40. arXiv:1605.06855  [pdf, other

    cs.SI cs.LG stat.ML

    Smart broadcasting: Do you want to be seen?

    Authors: Mohammad Reza Karimi, Erfan Tavakoli, Mehrdad Farajtabar, Le Song, Manuel Gomez-Rodriguez

    Abstract: Many users in online social networks are constantly trying to gain attention from their followers by broadcasting posts to them. These broadcasters are likely to gain greater attention if their posts can remain visible for a longer period of time among their followers' most recent feeds. Then when to post? In this paper, we study the problem of smart broadcasting using the framework of temporal po… ▽ More

    Submitted 22 May, 2016; originally announced May 2016.

    Comments: To appear in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), San Francisco (CA, USA), 2016

  41. arXiv:1603.08981  [pdf, other

    cs.LG stat.ML

    Detecting weak changes in dynamic events over networks

    Authors: Shuang Li, Yao Xie, Mehrdad Farajtabar, Apurv Verma, Le Song

    Abstract: Large volume of networked streaming event data are becoming increasingly available in a wide variety of applications, such as social network analysis, Internet traffic monitoring and healthcare analytics. Streaming event data are discrete observation occurred in continuous time, and the precise time interval between two events carries a great deal of information about the dynamics of the underlyin… ▽ More

    Submitted 16 September, 2016; v1 submitted 29 March, 2016; originally announced March 2016.

  42. arXiv:1602.04511  [pdf, other

    cs.LG stat.ML

    Learning Granger Causality for Hawkes Processes

    Authors: Hongteng Xu, Mehrdad Farajtabar, Hongyuan Zha

    Abstract: Learning Granger causality for general point processes is a very challenging task. In this paper, we propose an effective method, learning Granger causality, for a special but significant type of point processes --- Hawkes process. We reveal the relationship between Hawkes process's impact function and its Granger causality graph. Specifically, our model represents impact functions using a series… ▽ More

    Submitted 11 June, 2016; v1 submitted 14 February, 2016; originally announced February 2016.

    Comments: International Conference on Machine Learning, 2016

  43. arXiv:1511.04145  [pdf, other

    cs.SI cs.LG

    A Continuous-time Mutually-Exciting Point Process Framework for Prioritizing Events in Social Media

    Authors: Mehrdad Farajtabar, Safoora Yousefi, Long Q. Tran, Le Song, Hongyuan Zha

    Abstract: The overwhelming amount and rate of information update in online social media is making it increasingly difficult for users to allocate their attention to their topics of interest, thus there is a strong need for prioritizing news feeds. The attractiveness of a post to a user depends on many complex contextual and temporal features of the post. For instance, the contents of the post, the responsiv… ▽ More

    Submitted 12 November, 2015; originally announced November 2015.

  44. On The Network You Keep: Analyzing Persons of Interest using Cliqster

    Authors: Saber Shokat Fadaee, Mehrdad Farajtabar, Ravi Sundaram, Javed A. Aslam, Nikos Passas

    Abstract: Our goal is to determine the structural differences between different categories of networks and to use these differences to predict the network category. Existing work on this topic has looked at social networks such as Facebook, Twitter, co-author networks etc. We, instead, focus on a novel data set that we have assembled from a variety of sources, including law-enforcement agencies, financial i… ▽ More

    Submitted 5 October, 2015; originally announced October 2015.

    Comments: The final publication is available at Springer via http://dx.doi.org/10.1007/s13278-015-0302-0 A preliminary version of this paper appeared in Proceedings of the 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)

  45. arXiv:1510.00936  [pdf, other

    cs.SI

    Correlated Cascades: Compete or Cooperate

    Authors: Ali Zarezade, Ali Khodadadi, Mehrdad Farajtabar, Hamid R. Rabiee, Hongyuan Zha

    Abstract: In real world social networks, there are multiple cascades which are rarely independent. They usually compete or cooperate with each other. Motivated by the reinforcement theory in sociology we leverage the fact that adoption of a user to any behavior is modeled by the aggregation of behaviors of its neighbors. We use a multidimensional marked Hawkes process to model users product adoption and con… ▽ More

    Submitted 22 November, 2016; v1 submitted 4 October, 2015; originally announced October 2015.

  46. arXiv:1507.02293  [pdf, other

    cs.SI cs.LG physics.soc-ph stat.ML

    COEVOLVE: A Joint Point Process Model for Information Diffusion and Network Co-evolution

    Authors: Mehrdad Farajtabar, Yichen Wang, Manuel Gomez Rodriguez, Shuang Li, Hongyuan Zha, Le Song

    Abstract: Information diffusion in online social networks is affected by the underlying network topology, but it also has the power to change it. Online users are constantly creating new links when exposed to new information sources, and in turn these links are alternating the way information spreads. However, these two highly intertwined stochastic processes, information diffusion and network evolution, ha… ▽ More

    Submitted 1 April, 2016; v1 submitted 8 July, 2015; originally announced July 2015.

  47. arXiv:1501.06582  [pdf, other

    cs.SI physics.soc-ph

    Back to the Past: Source Identification in Diffusion Networks from Partially Observed Cascades

    Authors: Mehrdad Farajtabar, Manuel Gomez-Rodriguez, Nan Du, Mohammad Zamani, Hongyuan Zha, Le Song

    Abstract: When a piece of malicious information becomes rampant in an information diffusion network, can we identify the source node that originally introduced the piece into the network and infer the time when it initiated this? Being able to do so is critical for curtailing the spread of malicious information, and reducing the potential losses incurred. This is a very challenging problem since typically o… ▽ More

    Submitted 26 January, 2015; originally announced January 2015.

    Comments: To appear in the 18th International Conference on Artificial Intelligence and Statistics (AISTATS), 2015

  48. arXiv:1408.0406  [pdf, other

    cs.SI

    Shaping Social Activity by Incentivizing Users

    Authors: Mehrdad Farajtabar, Nan Du, Manuel Gomez Rodriguez, Isabel Valera, Hongyuan Zha, Le Song

    Abstract: Events in an online social network can be categorized roughly into endogenous events, where users just respond to the actions of their neighbors within the network, or exogenous events, where users take actions due to drives external to the network. How much external drive should be provided to each user, such that the network activity can be steered towards a target state? In this paper, we model… ▽ More

    Submitted 19 August, 2014; v1 submitted 2 August, 2014; originally announced August 2014.