Skip to main content

Showing 1–17 of 17 results for author: Dogan, U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.03814  [pdf, other

    cs.LG cs.AI

    Pearl: A Production-ready Reinforcement Learning Agent

    Authors: Zheqing Zhu, Rodrigo de Salvo Braz, Jalaj Bhandari, Daniel Jiang, Yi Wan, Yonathan Efroni, Liyuan Wang, Ruiyang Xu, Hongbo Guo, Alex Nikulkov, Dmytro Korenkevych, Urun Dogan, Frank Cheng, Zheng Wu, Wanqiao Xu

    Abstract: Reinforcement Learning (RL) offers a versatile framework for achieving long-term goals. Its generality allows us to formalize a wide range of problems that real-world intelligent systems encounter, such as dealing with delayed rewards, handling partial observability, addressing the exploration and exploitation dilemma, utilizing offline data to improve online performance, and ensuring safety const… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  2. arXiv:2306.00867  [pdf, other

    cs.LG cs.AI

    IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control

    Authors: Rohan Chitnis, Yingchen Xu, Bobak Hashemi, Lucas Lehnert, Urun Dogan, Zheqing Zhu, Olivier Delalleau

    Abstract: Model-based reinforcement learning (RL) has shown great promise due to its sample efficiency, but still struggles with long-horizon sparse-reward tasks, especially in offline settings where the agent learns from a fixed dataset. We hypothesize that model-based RL agents struggle in these environments due to a lack of long-term planning capabilities, and that planning in a temporally abstract model… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Journal ref: Short version published at ICRA 2024 (https://tinyurl.com/icra24-iqltdmpc)

  3. arXiv:2110.03165  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Offline RL With Resource Constrained Online Deployment

    Authors: Jayanth Reddy Regatti, Aniket Anand Deshmukh, Frank Cheng, Young Hun Jung, Abhishek Gupta, Urun Dogan

    Abstract: Offline reinforcement learning is used to train policies in scenarios where real-time access to the environment is expensive or impossible. As a natural consequence of these harsh conditions, an agent may lack the resources to fully observe the online environment before taking an action. We dub this situation the resource-constrained setting. This leads to situations where the offline dataset (ava… ▽ More

    Submitted 7 December, 2021; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: Added experiments on discrete control and real world datasets along with more analyses on continuous control tasks

  4. arXiv:2105.01289  [pdf, other

    cs.CV cs.LG

    Representation Learning for Clustering via Building Consensus

    Authors: Aniket Anand Deshmukh, Jayanth Reddy Regatti, Eren Manavoglu, Urun Dogan

    Abstract: In this paper, we focus on unsupervised representation learning for clustering of images. Recent advances in deep clustering and unsupervised representation learning are based on the idea that different views of an input image (generated through data augmentation techniques) must be close in the representation space (exemplar consistency), and/or similar images must have similar cluster assignment… ▽ More

    Submitted 25 April, 2022; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: Paper is accepted at Springer Machine Learning Journal 2022. The code and the trained models are available at https://github.com/JayanthRR/ConCURL_NCE

  5. arXiv:2010.01245  [pdf, other

    cs.CV cs.LG

    Consensus Clustering With Unsupervised Representation Learning

    Authors: Jayanth Reddy Regatti, Aniket Anand Deshmukh, Eren Manavoglu, Urun Dogan

    Abstract: Recent advances in deep clustering and unsupervised representation learning are based on the idea that different views of an input image (generated through data augmentation techniques) must either be closer in the representation space, or have a similar cluster assignment. Bootstrap Your Own Latent (BYOL) is one such representation learning algorithm that has achieved state-of-the-art results in… ▽ More

    Submitted 8 July, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: Accepted by the 2021 International Joint Conference on Neural Networks (IJCNN 2021)

  6. arXiv:2008.07443  [pdf, other

    cs.CV cs.LG

    Zero Shot Domain Generalization

    Authors: Udit Maniyar, Joseph K J, Aniket Anand Deshmukh, Urun Dogan, Vineeth N Balasubramanian

    Abstract: Standard supervised learning setting assumes that training data and test data come from the same distribution (domain). Domain generalization (DG) methods try to learn a model that when trained on data from multiple domains, would generalize to a new unseen domain. We extend DG to an even more challenging setting, where the label space of the unseen domain could also change. We introduce this prob… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: Accepted to BMVC 2020

  7. arXiv:2003.08485  [pdf, other

    cs.CV cs.LG stat.ML

    Self-Supervised Contextual Bandits in Computer Vision

    Authors: Aniket Anand Deshmukh, Abhimanu Kumar, Levi Boyles, Denis Charles, Eren Manavoglu, Urun Dogan

    Abstract: Contextual bandits are a common problem faced by machine learning practitioners in domains as diverse as hypothesis testing to product recommendations. There have been a lot of approaches in exploiting rich data representations for contextual bandit problems with varying degree of success. Self-supervised learning is a promising approach to find rich data representations without explicit labels. I… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

  8. arXiv:2002.07384  [pdf, other

    stat.ML cs.LG math.ST

    Data Transformation Insights in Self-supervision with Clustering Tasks

    Authors: Abhimanu Kumar, Aniket Anand Deshmukh, Urun Dogan, Denis Charles, Eren Manavoglu

    Abstract: Self-supervision is key to extending use of deep learning for label scarce domains. For most of self-supervised approaches data transformations play an important role. However, up until now the impact of transformations have not been studied. Furthermore, different transformations may have different impact on the system. We provide novel insights into the use of data transformation in self-supervi… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

  9. arXiv:1911.06902  [pdf, other

    cs.CV cs.LG

    Label-similarity Curriculum Learning

    Authors: Urun Dogan, Aniket Anand Deshmukh, Marcin Machura, Christian Igel

    Abstract: Curriculum learning can improve neural network training by guiding the optimization to desirable optima. We propose a novel curriculum learning approach for image classification that adapts the loss function by changing the label representation. The idea is to use a probability distribution over classes as target label, where the class probabilities reflect the similarity to the true class. Gradua… ▽ More

    Submitted 22 July, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

    Comments: Accepted as a conference paper at ECCV 2020

  10. arXiv:1905.10392  [pdf, other

    stat.ML cs.LG

    A Generalization Error Bound for Multi-class Domain Generalization

    Authors: Aniket Anand Deshmukh, Yunwen Lei, Srinagesh Sharma, Urun Dogan, James W. Cutler, Clayton Scott

    Abstract: Domain generalization is the problem of assigning labels to an unlabeled data set, given several similar data sets for which labels have been provided. Despite considerable interest in this problem over the last decade, there has been no theoretical analysis in the setting of multi-class classification. In this work, we study a kernel-based learning algorithm and establish a generalization error b… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

  11. arXiv:1706.09814  [pdf, other

    cs.LG

    Data-dependent Generalization Bounds for Multi-class Classification

    Authors: Yunwen Lei, Urun Dogan, Ding-Xuan Zhou, Marius Kloft

    Abstract: In this paper, we study data-dependent generalization error bounds exhibiting a mild dependency on the number of classes, making them suitable for multi-class learning with a large number of label classes. The bounds generally hold for empirical multi-class risk minimization algorithms using an arbitrary norm as regularizer. Key to our analysis are new structural results for multi-class Gaussian c… ▽ More

    Submitted 29 December, 2017; v1 submitted 29 June, 2017; originally announced June 2017.

  12. arXiv:1705.08618  [pdf, other

    stat.ML cs.LG

    Multi-Task Learning for Contextual Bandits

    Authors: Aniket Anand Deshmukh, Urun Dogan, Clayton Scott

    Abstract: Contextual bandits are a form of multi-armed bandit in which the agent has access to predictive side information (known as the context) for each arm at each time step, and have been used to model personalized news recommendation, ad placement, and other applications. In this work, we propose a multi-task learning framework for contextual bandit problems. Like multi-task learning in the batch setti… ▽ More

    Submitted 24 May, 2017; originally announced May 2017.

  13. Distributed Optimization of Multi-Class SVMs

    Authors: Maximilian Alber, Julian Zimmert, Urun Dogan, Marius Kloft

    Abstract: Training of one-vs.-rest SVMs can be parallelized over the number of classes in a straight forward way. Given enough computational resources, one-vs.-rest SVMs can thus be trained on data involving a large number of classes. The same cannot be stated, however, for the so-called all-in-one SVMs, which require solving a quadratic program of size quadratically in the number of classes. We develop dis… ▽ More

    Submitted 8 December, 2016; v1 submitted 25 November, 2016; originally announced November 2016.

  14. arXiv:1506.04364  [pdf, other

    cs.LG

    Localized Multiple Kernel Learning---A Convex Approach

    Authors: Yunwen Lei, Alexander Binder, Ürün Dogan, Marius Kloft

    Abstract: We propose a localized approach to multiple kernel learning that can be formulated as a convex optimization problem over a given cluster structure. For which we obtain generalization error guarantees and derive an optimization algorithm based on the Fenchel dual representation. Experiments on real-world datasets from the application domains of computational biology and computer vision show that co… ▽ More

    Submitted 12 October, 2016; v1 submitted 14 June, 2015; originally announced June 2015.

    Comments: to appear in ACML 2016

  15. arXiv:1506.04359  [pdf, ps, other

    cs.LG

    Multi-class SVMs: From Tighter Data-Dependent Generalization Bounds to Novel Algorithms

    Authors: Yunwen Lei, Ürün Dogan, Alexander Binder, Marius Kloft

    Abstract: This paper studies the generalization performance of multi-class classification algorithms, for which we obtain, for the first time, a data-dependent generalization error bound with a logarithmic dependence on the class size, substantially improving the state-of-the-art linear dependence in the existing data-dependent generalization analysis. The theoretical analysis motivates us to introduce a ne… ▽ More

    Submitted 14 June, 2015; originally announced June 2015.

  16. arXiv:1401.3737  [pdf, ps, other

    stat.ML cs.LG

    Coordinate Descent with Online Adaptation of Coordinate Frequencies

    Authors: Tobias Glasmachers, Ürün Dogan

    Abstract: Coordinate descent (CD) algorithms have become the method of choice for solving a number of optimization problems in machine learning. They are particularly popular for training linear models, including linear support vector machine classification, LASSO regression, and logistic regression. We consider general CD with non-uniform selection of coordinates. Instead of fixing selection frequencies… ▽ More

    Submitted 15 January, 2014; originally announced January 2014.

  17. arXiv:1302.5608  [pdf, ps, other

    stat.ML cs.LG

    Accelerated Linear SVM Training with Adaptive Variable Selection Frequencies

    Authors: Tobias Glasmachers, Ürün Dogan

    Abstract: Support vector machine (SVM) training is an active research area since the dawn of the method. In recent years there has been increasing interest in specialized solvers for the important case of linear models. The algorithm presented by Hsieh et al., probably best known under the name of the "liblinear" implementation, marks a major breakthrough. The method is analog to established dual decomposit… ▽ More

    Submitted 22 February, 2013; originally announced February 2013.