Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Malekzadeh, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.12446  [pdf, other

    q-fin.RM cs.LG q-fin.ST

    EX-DRL: Hedging Against Heavy Losses with EXtreme Distributional Reinforcement Learning

    Authors: Parvin Malekzadeh, Zissis Poulos, Jacky Chen, Zeyu Wang, Konstantinos N. Plataniotis

    Abstract: Recent advancements in Distributional Reinforcement Learning (DRL) for modeling loss distributions have shown promise in developing hedging strategies in derivatives markets. A common approach in DRL involves learning the quantiles of loss distributions at specified levels using Quantile Regression (QR). This method is particularly effective in option hedging due to its direct quantile-based risk… ▽ More

    Submitted 27 August, 2024; v1 submitted 22 August, 2024; originally announced August 2024.

    Comments: 14 pages

  2. A unified uncertainty-aware exploration: Combining epistemic and aleatory uncertainty

    Authors: Parvin Malekzadeh, Ming Hou, Konstantinos N. Plataniotis

    Abstract: Exploration is a significant challenge in practical reinforcement learning (RL), and uncertainty-aware exploration that incorporates the quantification of epistemic and aleatory uncertainty has been recognized as an effective exploration strategy. However, capturing the combined effect of aleatory and epistemic uncertainty for decision-making is difficult. Existing works estimate aleatory and epis… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted by ICASSP2023

  3. arXiv:2401.02325  [pdf, other

    cs.LG stat.ML

    A Robust Quantile Huber Loss With Interpretable Parameter Adjustment In Distributional Reinforcement Learning

    Authors: Parvin Malekzadeh, Konstantinos N. Plataniotis, Zissis Poulos, Zeyu Wang

    Abstract: Distributional Reinforcement Learning (RL) estimates return distribution mainly by learning quantile values via minimizing the quantile Huber loss function, entailing a threshold parameter often selected heuristically or via hyperparameter search, which may not generalize well and can be suboptimal. This paper introduces a generalized quantile Huber loss function derived from Wasserstein distance… ▽ More

    Submitted 7 January, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

    Comments: 6 pages, 1 figure, to be published in ICASSP 2024

  4. Uncertainty-aware transfer across tasks using hybrid model-based successor feature reinforcement learning

    Authors: Parvin Malekzadeh, Ming Hou, Konstantinos N. Plataniotis

    Abstract: Sample efficiency is central to developing practical reinforcement learning (RL) for complex and large-scale decision-making problems. The ability to transfer and generalize knowledge gained from previous experiences to downstream tasks can significantly improve sample efficiency. Recent research indicates that successor feature (SF) RL algorithms enable knowledge generalization between tasks with… ▽ More

    Submitted 22 July, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: 40 pages

    Journal ref: Neurocomputing 530 (2023): 165-187

  5. arXiv:2212.07946  [pdf, other

    cs.LG cs.AI

    Active Inference and Reinforcement Learning: A unified inference on continuous state and action spaces under partial observability

    Authors: Parvin Malekzadeh, Konstantinos N. Plataniotis

    Abstract: Reinforcement learning (RL) has garnered significant attention for developing decision-making agents that aim to maximize rewards, specified by an external supervisor, within fully observable environments. However, many real-world problems involve partial observations, formulated as partially observable Markov decision processes (POMDPs). Previous studies have tackled RL in POMDPs by either incorp… ▽ More

    Submitted 31 May, 2024; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: 90 pages including appendices

  6. AKF-SR: Adaptive Kalman Filtering-based Successor Representation

    Authors: Parvin Malekzadeh, Mohammad Salimibeni, Ming Hou, Arash Mohammadi, Konstantinos N. Plataniotis

    Abstract: Recent studies in neuroscience suggest that Successor Representation (SR)-based models provide adaptation to changes in the goal locations or reward function faster than model-free algorithms, together with lower computational cost compared to that of model-based algorithms. However, it is not known how such representation might help animals to manage uncertainty in their decision-making. Existing… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

    Journal ref: Neurocomputing 467 (2022), pp.476-490

  7. arXiv:2112.15156  [pdf, other

    cs.LG cs.MA eess.SP

    Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal Difference and Successor Representation

    Authors: Mohammad Salimibeni, Arash Mohammadi, Parvin Malekzadeh, Konstantinos N. Plataniotis

    Abstract: Distributed Multi-Agent Reinforcement Learning (MARL) algorithms has attracted a surge of interest lately mainly due to the recent advancements of Deep Neural Networks (DNNs). Conventional Model-Based (MB) or Model-Free (MF) RL algorithms are not directly applicable to the MARL problems due to utilization of a fixed reward model for learning the underlying value function. While DNN-based solutions… ▽ More

    Submitted 30 December, 2021; originally announced December 2021.

  8. arXiv:2006.00195  [pdf, other

    cs.LG cs.AI eess.SP stat.ML

    MM-KTD: Multiple Model Kalman Temporal Differences for Reinforcement Learning

    Authors: Parvin Malekzadeh, Mohammad Salimibeni, Arash Mohammadi, Akbar Assa, Konstantinos N. Plataniotis

    Abstract: There has been an increasing surge of interest on development of advanced Reinforcement Learning (RL) systems as intelligent approaches to learn optimal control policies directly from smart agents' interactions with the environment. Objectives: In a model-free RL method with continuous state-space, typically, the value function of the states needs to be approximated. In this regard, Deep Neural Ne… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.