Zum Hauptinhalt springen

Showing 1–50 of 69 results for author: Rezaei, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07492  [pdf, other

    cs.CL

    Paraphrasing in Affirmative Terms Improves Negation Understanding

    Authors: MohammadHossein Rezaei, Eduardo Blanco

    Abstract: Negation is a common linguistic phenomenon. Yet language models face challenges with negation in many natural language understanding tasks such as question answering and natural language inference. In this paper, we experiment with seamless strategies that incorporate affirmative interpretations (i.e., paraphrases without negation) to make models more robust against negation. Crucially, our affirm… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024

  2. arXiv:2405.18218  [pdf, other

    cs.LG

    FinerCut: Finer-grained Interpretable Layer Pruning for Large Language Models

    Authors: Yang Zhang, Yawei Li, Xinpeng Wang, Qianli Shen, Barbara Plank, Bernd Bischl, Mina Rezaei, Kenji Kawaguchi

    Abstract: Overparametrized transformer networks are the state-of-the-art architecture for Large Language Models (LLMs). However, such models contain billions of parameters making large compute a necessity, while raising environmental concerns. To address these issues, we propose FinerCut, a new form of fine-grained layer pruning, which in contrast to prior work at the transformer block level, considers all… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 22 pages

  3. arXiv:2405.11848  [pdf, other

    stat.ML cs.AI cs.LG cs.NE physics.ao-ph q-bio.NC

    Alternators For Sequence Modeling

    Authors: Mohammad Reza Rezaei, Adji Bousso Dieng

    Abstract: This paper introduces alternators, a novel family of non-Markovian dynamical models for sequences. An alternator features two neural networks: the observation trajectory network (OTN) and the feature trajectory network (FTN). The OTN and the FTN work in conjunction, alternating between outputting samples in the observation space and some feature space, respectively, over a cycle. The parameters of… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: A new versatile family of sequence models that can be used for both generative modeling and supervised learning. The codebase will be made available upon publication. This paper is dedicated to Thomas Sankara

  4. arXiv:2402.12810  [pdf, other

    cs.CV cs.AI cs.NE eess.IV stat.ML

    PIP-Net: Pedestrian Intention Prediction in the Wild

    Authors: Mohsen Azarmi, Mahdi Rezaei, He Wang, Sebastien Glaser

    Abstract: Accurate pedestrian intention prediction (PIP) by Autonomous Vehicles (AVs) is one of the current research challenges in this field. In this article, we introduce PIP-Net, a novel framework designed to predict pedestrian crossing intentions by AVs in real-world urban scenarios. We offer two variants of PIP-Net designed for different camera mounts and setups. Leveraging both kinematic data and spat… ▽ More

    Submitted 1 March, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  5. arXiv:2401.11284  [pdf, other

    cs.CV cs.AI cs.NE

    Evaluating Driver Readiness in Conditionally Automated Vehicles from Eye-Tracking Data and Head Pose

    Authors: Mostafa Kazemi, Mahdi Rezaei, Mohsen Azarmi

    Abstract: As automated driving technology advances, the role of the driver to resume control of the vehicle in conditionally automated vehicles becomes increasingly critical. In the SAE Level 3 or partly automated vehicles, the driver needs to be available and ready to intervene when necessary. This makes it essential to evaluate their readiness accurately. This article presents a comprehensive analysis of… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  6. arXiv:2312.04427  [pdf, other

    cs.IT q-bio.CB

    Spheroidal Molecular Communication via Diffusion: Signaling Between Homogeneous Cell Aggregates

    Authors: Mitra Rezaei, Hamidreza Arjmandi, Mohammad Zoofaghari, Kajsa Kanebratt, Liisa Vilen, David Janzen, Peter Gennemark, Adam Noel

    Abstract: Recent molecular communication (MC) research has integrated more detailed computational models to capture the dynamics of practical biophysical systems. This research focuses on developing realistic models for MC transceivers inspired by spheroids - three-dimensional cell aggregates commonly used in organ-on-chip experimental systems. Potential applications that can be used or modeled with spheroi… ▽ More

    Submitted 9 February, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: 14 pages; 10 figures; accepted to appear in IEEE Transactions on Molecular, Biological, and Multi-Scale Communications. This work was presented in part at the 2023 IEEE International Conference on Communication arXiv:2302.09265

  7. arXiv:2311.18645  [pdf, other

    cs.CV cs.AI

    Stochastic Vision Transformers with Wasserstein Distance-Aware Attention

    Authors: Franciskus Xaverius Erick, Mina Rezaei, Johanna Paula Müller, Bernhard Kainz

    Abstract: Self-supervised learning is one of the most promising approaches to acquiring knowledge from limited labeled data. Despite the substantial advancements made in recent years, self-supervised models have posed a challenge to practitioners, as they do not readily provide insight into the model's confidence and uncertainty. Tackling this issue is no simple feat, primarily due to the complexity involve… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  8. arXiv:2310.13290  [pdf, other

    cs.CL

    Interpreting Indirect Answers to Yes-No Questions in Multiple Languages

    Authors: Zijie Wang, Md Mosharaf Hossain, Shivam Mathur, Terry Cruz Melo, Kadir Bulut Ozler, Keun Hee Park, Jacob Quintero, MohammadHossein Rezaei, Shreya Nupur Shakya, Md Nayem Uddin, Eduardo Blanco

    Abstract: Yes-no questions expect a yes or no for an answer, but people often skip polar keywords. Instead, they answer with long explanations that must be interpreted. In this paper, we focus on this challenging problem and release new benchmarks in eight languages. We present a distant supervision approach to collect training data. We also demonstrate that direct answers (i.e., with polar keywords) are us… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 Findings

  9. arXiv:2310.06514  [pdf, other

    cs.LG

    AttributionLab: Faithfulness of Feature Attribution Under Controllable Environments

    Authors: Yang Zhang, Yawei Li, Hannah Brown, Mina Rezaei, Bernd Bischl, Philip Torr, Ashkan Khakzar, Kenji Kawaguchi

    Abstract: Feature attribution explains neural network outputs by identifying relevant input features. The attribution has to be faithful, meaning that the attributed features must mirror the input features that influence the output. One recent trend to test faithfulness is to fit a model on designed data with known relevant features and then compare attributions with ground truth input features.This idea as… ▽ More

    Submitted 14 February, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Appear at NeurIPS 2023 Workshop XAIA

  10. arXiv:2310.03174  [pdf, other

    cs.LG cs.SE

    Test Case Recommendations with Distributed Representation of Code Syntactic Features

    Authors: Mosab Rezaei, Hamed Alhoori, Mona Rahimi

    Abstract: Frequent modifications of unit test cases are inevitable due to software's continuous underlying changes in source code, design, and requirements. Since manually maintaining software test suites is tedious, timely, and costly, automating the process of generation and maintenance of test units will significantly impact the effectiveness and efficiency of software testing processes. To this end, w… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 8 pages, 4 figures, 14th Workshop on Automating Test Case Design, Selection and Evaluation (A-TEST 2023) co-located with 38th IEEE/ACM International Conference on ASE 2023 conference

  11. arXiv:2309.02048  [pdf, other

    cs.LG stat.ML

    Probabilistic Self-supervised Learning via Scoring Rules Minimization

    Authors: Amirhossein Vahidi, Simon Schoßer, Lisa Wimmer, Yawei Li, Bernd Bischl, Eyke Hüllermeier, Mina Rezaei

    Abstract: In this paper, we propose a novel probabilistic self-supervised learning via Scoring Rule Minimization (ProSMIN), which leverages the power of probabilistic models to enhance representation quality and mitigate collapsing representations. Our proposed approach involves two neural networks; the online network and the target network, which collaborate and learn the diverse distribution of representa… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  12. arXiv:2308.14705  [pdf, other

    stat.ML cs.LG

    Diversified Ensemble of Independent Sub-Networks for Robust Self-Supervised Representation Learning

    Authors: Amirhossein Vahidi, Lisa Wimmer, Hüseyin Anil Gündüz, Bernd Bischl, Eyke Hüllermeier, Mina Rezaei

    Abstract: Ensembling a neural network is a widely recognized approach to enhance model performance, estimate uncertainty, and improve robustness in deep supervised learning. However, deep ensembles often come with high computational costs and memory demands. In addition, the efficiency of a deep ensemble is related to diversity among the ensemble members which is challenging for large, over-parameterized de… ▽ More

    Submitted 1 September, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

  13. arXiv:2308.10033  [pdf

    q-bio.TO cs.CV eess.IV

    CRC-ICM: Colorectal Cancer Immune Cell Markers Pattern Dataset

    Authors: Zahra Mokhtari, Elham Amjadi, Hamidreza Bolhasani, Zahra Faghih, AmirReza Dehghanian, Marzieh Rezaei

    Abstract: Colorectal Cancer (CRC) is the second most common cause of cancer death in the world, ad can be identified by the location of the primary tumor in the large intestine: right and left colon, and rectum. Based on the location, CRC shows differences in chromosomal and molecular characteristics, microbiomes incidence, pathogenesis, and outcome. It has been shown that tumors on left and right sides als… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

  14. arXiv:2308.08949  [pdf, other

    cs.LG cs.AI

    A Dual-Perspective Approach to Evaluating Feature Attribution Methods

    Authors: Yawei Li, Yang Zhang, Kenji Kawaguchi, Ashkan Khakzar, Bernd Bischl, Mina Rezaei

    Abstract: Feature attribution methods attempt to explain neural network predictions by identifying relevant features. However, establishing a cohesive framework for assessing feature attribution remains a challenge. There are several views through which we can evaluate attributions. One principal lens is to observe the effect of perturbing attributed features on the model's behavior (i.e., faithfulness). Wh… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 16 pages, 14 figures

  15. arXiv:2306.07077   

    cs.LG stat.ML

    Latent Dynamical Implicit Diffusion Processes

    Authors: Mohammad R. Rezaei

    Abstract: Latent dynamical models are commonly used to learn the distribution of a latent dynamical process that represents a sequence of noisy data samples. However, producing samples from such models with high fidelity is challenging due to the complexity and variability of latent and observation dynamics. Recent advances in diffusion-based generative models, such as DDPM and NCSN, have shown promising al… ▽ More

    Submitted 16 August, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: I request a withdrawal because there are no experiments with real-world datasets and also the method section requires major changes to look mathematically sounds

  16. arXiv:2305.16031  [pdf, other

    cs.CL

    Efficient Document Embeddings via Self-Contrastive Bregman Divergence Learning

    Authors: Daniel Saggau, Mina Rezaei, Bernd Bischl, Ilias Chalkidis

    Abstract: Learning quality document embeddings is a fundamental problem in natural language processing (NLP), information retrieval (IR), recommendation systems, and search engines. Despite recent advances in the development of transformer-based models that produce sentence embeddings with self-contrastive learning, the encoding of long documents (Ks of words) is still challenging with respect to both effic… ▽ More

    Submitted 26 March, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 5 pages, short paper at Findings of ACL 2023

  17. arXiv:2305.15421  [pdf

    eess.IV cs.CV cs.LG

    Generative Adversarial Networks for Brain Images Synthesis: A Review

    Authors: Firoozeh Shomal Zadeh, Sevda Molani, Maysam Orouskhani, Marziyeh Rezaei, Mehrzad Shafiei, Hossein Abbasi

    Abstract: In medical imaging, image synthesis is the estimation process of one image (sequence, modality) from another image (sequence, modality). Since images with different modalities provide diverse biomarkers and capture various features, multi-modality imaging is crucial in medicine. While multi-screening is expensive, costly, and time-consuming to report by radiologists, image synthesis methods are ca… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 9 pages, 3 tabels, 4 figures

    MSC Class: 68T07 ACM Class: I.2.m

  18. arXiv:2305.10421  [pdf

    eess.IV cs.LG cs.NE

    Evolving Tsukamoto Neuro Fuzzy Model for Multiclass Covid 19 Classification with Chest X Ray Images

    Authors: Marziyeh Rezaei, Sevda Molani, Negar Firoozeh, Hossein Abbasi, Farzan Vahedifard, Maysam Orouskhani

    Abstract: Du e to rapid population growth and the need to use artificial intelligence to make quick decisions, developing a machine learning-based disease detection model and abnormality identification system has greatly improved the level of medical diagnosis Since COVID-19 has become one of the most severe diseases in the world, developing an automatic COVID-19 detection framework helps medical doctors in… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: 14 pages, 5 figures, 3 tables

    MSC Class: 68W50 ACM Class: I.5.0

  19. arXiv:2305.01111  [pdf, other

    cs.CV cs.AI cs.LG

    Local and Global Contextual Features Fusion for Pedestrian Intention Prediction

    Authors: Mohsen Azarmi, Mahdi Rezaei, Tanveer Hussain, Chenghao Qian

    Abstract: Autonomous vehicles (AVs) are becoming an indispensable part of future transportation. However, safety challenges and lack of reliability limit their real-world deployment. Towards boosting the appearance of AVs on the roads, the interaction of AVs with pedestrians including "prediction of the pedestrian crossing intention" deserves extensive research. This is a highly challenging task as involves… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  20. arXiv:2305.01096  [pdf

    cs.RO cs.AI cs.LG

    A Novel Model for Driver Lane Change Prediction in Cooperative Adaptive Cruise Control Systems

    Authors: Armin Nejadhossein Qasemabadi, Saeed Mozaffari, Mahdi Rezaei, Majid Ahmadi, Shahpour Alirezaee

    Abstract: Accurate lane change prediction can reduce potential accidents and contribute to higher road safety. Adaptive cruise control (ACC), lane departure avoidance (LDA), and lane keeping assistance (LKA) are some conventional modules in advanced driver assistance systems (ADAS). Thanks to vehicle-to-vehicle communication (V2V), vehicles can share traffic information with surrounding vehicles, enabling c… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  21. arXiv:2305.01095  [pdf

    cs.RO cs.AI cs.LG

    LSTM-based Preceding Vehicle Behaviour Prediction during Aggressive Lane Change for ACC Application

    Authors: Rajmeet Singh, Saeed Mozaffari, Mahdi Rezaei, Shahpour Alirezaee

    Abstract: The development of Adaptive Cruise Control (ACC) systems aims to enhance the safety and comfort of vehicles by automatically regulating the speed of the vehicle to ensure a safe gap from the preceding vehicle. However, conventional ACC systems are unable to adapt themselves to changing driving conditions and drivers' behavior. To address this limitation, we propose a Long Short-Term Memory (LSTM)… ▽ More

    Submitted 5 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

  22. arXiv:2304.08500  [pdf

    cs.LG

    A comparison between Recurrent Neural Networks and classical machine learning approaches In Laser induced breakdown spectroscopy

    Authors: Fatemeh Rezaei, Pouriya Khaliliyan, Mohsen Rezaei, Parvin Karimi, Behnam Ashrafkhani

    Abstract: Recurrent Neural Networks are classes of Artificial Neural Networks that establish connections between different nodes form a directed or undirected graph for temporal dynamical analysis. In this research, the laser induced breakdown spectroscopy (LIBS) technique is used for quantitative analysis of aluminum alloys by different Recurrent Neural Network (RNN) architecture. The fundamental harmonic… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

  23. arXiv:2303.15147  [pdf, other

    cs.CV cs.AI

    Pushing the Envelope for Depth-Based Semi-Supervised 3D Hand Pose Estimation with Consistency Training

    Authors: Mohammad Rezaei, Farnaz Farahanipad, Alex Dillhoff, Vassilis Athitsos

    Abstract: Despite the significant progress that depth-based 3D hand pose estimation methods have made in recent years, they still require a large amount of labeled training data to achieve high accuracy. However, collecting such data is both costly and time-consuming. To tackle this issue, we propose a semi-supervised method to significantly reduce the dependence on labeled training data. The proposed metho… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  24. arXiv:2303.07541  [pdf, ps, other

    cs.HC

    Young Humans Make Change, Young Users Click: Creating Youth-Centered Networked Social Movements

    Authors: Mina Rezaei, Patsy Eubanks Owens

    Abstract: From the urbanists' perspective, the everyday experience of young people, as an underrepresented group in the design of public spaces, includes tactics they use to challenge the strategies which rule over urban spaces. In this regard, youth led social movements are a set of collective tactics which groups of young people use to resist power structures. Social informational streams have revolutioni… ▽ More

    Submitted 23 March, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Journal ref: CHI 2023 Workshop titled "Supporting Social Movements Through HCI and Design"

  25. arXiv:2302.09265  [pdf, other

    cs.ET physics.bio-ph

    Diffusive Molecular Communication with a Spheroidal Receiver for Organ-on-Chip Systems

    Authors: Hamidreza Arjmandi, Mohamad Zoofaghari, Mitra Rezaei, Kajsa Kanebratt, Liisa Vilen, David Janzen, Peter Gennemark, Adam Noel

    Abstract: Realistic models of the components and processes are required for molecular communication (MC) systems. In this paper, a spheroidal receiver structure is proposed for MC that is inspired by the 3D cell cultures known as spheroids being widely used in organ-on-chip systems. A simple diffusive MC system is considered where the spheroidal receiver and a point source transmitter are in an unbounded fl… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

  26. arXiv:2212.08568  [pdf, other

    cs.CV cs.LG

    Biomedical image analysis competitions: The state of current participation practice

    Authors: Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Patrick Godau, Veronika Cheplygina, Michal Kozubek, Sharib Ali, Anubha Gupta, Jan Kybic, Alison Noble, Carlos Ortiz de Solórzano, Samiksha Pachade, Caroline Petitjean, Daniel Sage, Donglai Wei, Elizabeth Wilden, Deepak Alapatt, Vincent Andrearczyk, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano , et al. (331 additional authors not shown)

    Abstract: The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,… ▽ More

    Submitted 12 September, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

  27. arXiv:2212.07560  [pdf, other

    cs.CV cs.NE cs.RO eess.IV

    Multi-level and multi-modal feature fusion for accurate 3D object detection in Connected and Automated Vehicles

    Authors: Yiming Hou, Mahdi Rezaei, Richard Romano

    Abstract: Aiming at highly accurate object detection for connected and automated vehicles (CAVs), this paper presents a Deep Neural Network based 3D object detection model that leverages a three-stage feature extractor by developing a novel LIDAR-Camera fusion scheme. The proposed feature extractor extracts high-level features from two input sensory modalities and recovers the important features discarded d… ▽ More

    Submitted 19 December, 2022; v1 submitted 14 December, 2022; originally announced December 2022.

  28. arXiv:2211.09979  [pdf

    cs.CV eess.IV

    Comparison between EM and FCM algorithms in skin tone extraction

    Authors: Elham Ravanbakhsh, Mosab Rezaei, Ehsan Namjoo, Padideh Choobdar

    Abstract: This study aims to investigate implementing EM and FCM algorithms for skin color extraction. The capabilities of three well-known color spaces, namely, RGB, HSV, and YCbCr for skin-tone extraction are assessed by using statistical modeling of skin tones using EM and FCM algorithms. The results show that utilizing a Gaussian mixture model for parametric modeling of skin tones using EM algorithm wor… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 2016 1st International Conference on New Research Achievements in Electrical and Computer Engineering (ICNRAECE)

  29. arXiv:2210.15674  [pdf, other

    cs.LG cs.AI stat.ML

    Reverse Survival Model (RSM): A Pipeline for Explaining Predictions of Deep Survival Models

    Authors: Mohammad R. Rezaei, Reza Saadati Fard, Ebrahim Pourjafari, Navid Ziaei, Amir Sameizadeh, Mohammad Shafiee, Mohammad Alavinia, Mansour Abolghasemian, Nick Sajadi

    Abstract: The aim of survival analysis in healthcare is to estimate the probability of occurrence of an event, such as a patient's death in an intensive care unit (ICU). Recent developments in deep neural networks (DNNs) for survival analysis show the superiority of these models in comparison with other well-known models in survival analysis applications. Ensuring the reliability and explainability of deep… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  30. arXiv:2209.06941  [pdf, other

    cs.CV cs.LG

    Joint Debiased Representation and Image Clustering Learning with Self-Supervision

    Authors: Shunjie-Fabian Zheng, JaeEun Nam, Emilio Dorigatti, Bernd Bischl, Shekoofeh Azizi, Mina Rezaei

    Abstract: Contrastive learning is among the most successful methods for visual representation learning, and its performance can be further improved by jointly performing clustering on the learned representations. However, existing methods for joint clustering and contrastive learning do not perform well on long-tailed data distributions, as majority classes overwhelm and distort the loss of minority classes… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

  31. arXiv:2209.05734  [pdf

    cs.HC

    Evaluating User Experience in Literary and Film Geography-based Apps with a Cartographical User-Centered Design Lens

    Authors: Mina Rezaei, Patsy Eubanks Owens, Darnel Degand

    Abstract: Geography scholarship currently includes interdisciplinary approaches and theories and reflects shifts in research methodologies. Since the spatial turn in geographical thought and the emergence of geo-web technologies, geography scholarship has leaned more toward interdisciplinarity. In recent years geographical research methods have relied on various disciplines ranging from data science to arts… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

  32. arXiv:2209.02459  [pdf, other

    cs.LG

    Robust and Efficient Imbalanced Positive-Unlabeled Learning with Self-supervision

    Authors: Emilio Dorigatti, Jonas Schweisthal, Bernd Bischl, Mina Rezaei

    Abstract: Learning from positive and unlabeled (PU) data is a setting where the learner only has access to positive and unlabeled samples while having no information on negative examples. Such PU setting is of great importance in various tasks such as medical diagnosis, social network analysis, financial markets analysis, and knowledge base completion, which also tend to be intrinsically imbalanced, i.e., w… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  33. arXiv:2208.01371  [pdf

    cs.CL

    Multi-Module G2P Converter for Persian Focusing on Relations between Words

    Authors: Mahdi Rezaei, Negar Nayeri, Saeed Farzi, Hossein Sameti

    Abstract: In this paper, we investigate the application of end-to-end and multi-module frameworks for G2P conversion for the Persian language. The results demonstrate that our proposed multi-module G2P system outperforms our end-to-end systems in terms of accuracy and speed. The system consists of a pronunciation dictionary as our look-up table, along with separate models to handle homographs, OOVs and ezaf… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: 10 pages, 4 figures

    ACM Class: I.2.7

  34. arXiv:2206.07117  [pdf, other

    cs.CV

    TriHorn-Net: A Model for Accurate Depth-Based 3D Hand Pose Estimation

    Authors: Mohammad Rezaei, Razieh Rastgoo, Vassilis Athitsos

    Abstract: 3D hand pose estimation methods have made significant progress recently. However, the estimation accuracy is often far from sufficient for specific real-world applications, and thus there is significant room for improvement. This paper proposes TriHorn-Net, a novel model that uses specific innovations to improve hand pose estimation accuracy on depth images. The first innovation is the decompositi… ▽ More

    Submitted 26 June, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

  35. arXiv:2206.00050  [pdf, other

    cs.LG

    FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation

    Authors: Mehmet Ozgur Turkoglu, Alexander Becker, Hüseyin Anil Gündüz, Mina Rezaei, Bernd Bischl, Rodrigo Caye Daudt, Stefano D'Aronco, Jan Dirk Wegner, Konrad Schindler

    Abstract: The ability to estimate epistemic uncertainty is often crucial when deploying machine learning in the real world, but modern methods often produce overconfident, uncalibrated uncertainty predictions. A common approach to quantify epistemic uncertainty, usable across a wide class of prediction models, is to train a model ensemble. In a naive implementation, the ensemble approach has high computatio… ▽ More

    Submitted 19 December, 2022; v1 submitted 31 May, 2022; originally announced June 2022.

    Comments: accepted at NeurIPS 2022

  36. arXiv:2205.10947  [pdf, other

    cs.LG stat.ML

    Deep Direct Discriminative Decoders for High-dimensional Time-series Data Analysis

    Authors: Mohammad R. Rezaei, Milos R. Popovic, Milad Lankarany, Ali Yousefi

    Abstract: The state-space models (SSMs) are widely utilized in the analysis of time-series data. SSMs rely on an explicit definition of the state and observation processes. Characterizing these processes is not always easy and becomes a modeling challenge when the dimension of observed data grows or the observed data distribution deviates from the normal distribution. Here, we propose a new formulation of S… ▽ More

    Submitted 3 July, 2023; v1 submitted 22 May, 2022; originally announced May 2022.

  37. arXiv:2204.04542  [pdf, other

    cs.LG

    Survival Seq2Seq: A Survival Model based on Sequence to Sequence Architecture

    Authors: Ebrahim Pourjafari, Navid Ziaei, Mohammad R. Rezaei, Amir Sameizadeh, Mohammad Shafiee, Mohammad Alavinia, Mansour Abolghasemian, Nick Sajadi

    Abstract: This paper introduces a novel non-parametric deep model for estimating time-to-event (survival analysis) in presence of censored data and competing risks. The model is designed based on the sequence-to-sequence (Seq2Seq) architecture, therefore we name it Survival Seq2Seq. The first recurrent neural network (RNN) layer of the encoder of our model is made up of Gated Recurrent Unit with Decay (GRU-… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

  38. arXiv:2204.01729  [pdf, other

    eess.IV cs.CV cs.LG

    Analyzing the Effects of Handling Data Imbalance on Learned Features from Medical Images by Looking Into the Models

    Authors: Ashkan Khakzar, Yawei Li, Yang Zhang, Mirac Sanisoglu, Seong Tae Kim, Mina Rezaei, Bernd Bischl, Nassir Navab

    Abstract: One challenging property lurking in medical datasets is the imbalanced data distribution, where the frequency of the samples between the different classes is not balanced. Training a model on an imbalanced dataset can introduce unique challenges to the learning problem where a model is biased towards the highly frequent class. Many methods are proposed to tackle the distributional differences and… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

  39. arXiv:2201.13192  [pdf, other

    stat.ML cs.LG

    Uncertainty-aware Pseudo-label Selection for Positive-Unlabeled Learning

    Authors: Emilio Dorigatti, Jann Goschenhofer, Benjamin Schubert, Mina Rezaei, Bernd Bischl

    Abstract: Positive-unlabeled learning (PUL) aims at learning a binary classifier from only positive and unlabeled training data. Even though real-world applications often involve imbalanced datasets where the majority of examples belong to one class, most contemporary approaches to PUL do not investigate performance in this setting, thus severely limiting their applicability in practice. In this work, we th… ▽ More

    Submitted 10 March, 2024; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: 25 pages, 4 figures

  40. arXiv:2109.10777  [pdf, other

    cs.CV cs.LG eess.IV

    Deep Variational Clustering Framework for Self-labeling of Large-scale Medical Images

    Authors: Farzin Soleymani, Mohammad Eslami, Tobias Elze, Bernd Bischl, Mina Rezaei

    Abstract: We propose a Deep Variational Clustering (DVC) framework for unsupervised representation learning and clustering of large-scale medical images. DVC simultaneously learns the multivariate Gaussian posterior through the probabilistic convolutional encoder and the likelihood distribution with the probabilistic convolutional decoder; and optimizes cluster labels assignment. Here, the learned multivari… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: arXiv admin note: text overlap with arXiv:2109.05232

  41. arXiv:2109.09435  [pdf

    cs.LG cs.AI eess.SP

    Incremental Learning Techniques for Online Human Activity Recognition

    Authors: Meysam Vakili, Masoumeh Rezaei

    Abstract: Unobtrusive and smart recognition of human activities using smartphones inertial sensors is an interesting topic in the field of artificial intelligence acquired tremendous popularity among researchers, especially in recent years. A considerable challenge that needs more attention is the real-time detection of physical activities, since for many real-world applications such as health monitoring an… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: 16 pages, 5 figures, 7 tables

  42. arXiv:2109.09165  [pdf, other

    cs.CV cs.AI cs.LG

    Traffic-Net: 3D Traffic Monitoring Using a Single Camera

    Authors: Mahdi Rezaei, Mohsen Azarmi, Farzam Mohammad Pour Mir

    Abstract: Computer Vision has played a major role in Intelligent Transportation Systems (ITS) and traffic surveillance. Along with the rapidly growing automated vehicles and crowded cities, the automated and advanced traffic management systems (ATMS) using video surveillance infrastructures have been evolved by the implementation of Deep Neural Networks. In this research, we provide a practical platform for… ▽ More

    Submitted 2 July, 2022; v1 submitted 19 September, 2021; originally announced September 2021.

  43. arXiv:2109.07455  [pdf, other

    cs.CV cs.AI cs.LG

    Deep Bregman Divergence for Contrastive Learning of Visual Representations

    Authors: Mina Rezaei, Farzin Soleymani, Bernd Bischl, Shekoofeh Azizi

    Abstract: Deep Bregman divergence measures divergence of data points using neural networks which is beyond Euclidean distance and capable of capturing divergence over distributions. In this paper, we propose deep Bregman divergences for contrastive learning of visual representation where we aim to enhance contrastive loss used in self-supervised learning by training additional networks based on functional B… ▽ More

    Submitted 22 November, 2021; v1 submitted 15 September, 2021; originally announced September 2021.

  44. arXiv:2109.05232  [pdf, other

    cs.CV

    Joint Debiased Representation Learning and Imbalanced Data Clustering

    Authors: Mina Rezaei, Emilio Dorigatti, David Ruegamer, Bernd Bischl

    Abstract: One of the most promising approaches for unsupervised learning is combining deep representation learning and deep clustering. Some recent works propose to simultaneously learn representation using deep neural networks and perform clustering by defining a clustering loss on top of embedded features. However, these approaches are sensitive to imbalanced data and out-of-distribution samples. As a con… ▽ More

    Submitted 6 September, 2022; v1 submitted 11 September, 2021; originally announced September 2021.

  45. arXiv:2105.14383  [pdf, other

    cs.NE

    Gradient-Free Neural Network Training via Synaptic-Level Reinforcement Learning

    Authors: Aman Bhargava, Mohammad R. Rezaei, Milad Lankarany

    Abstract: An ongoing challenge in neural information processing is: how do neurons adjust their connectivity to improve task performance over time (i.e., actualize learning)? It is widely believed that there is a consistent, synaptic-level learning mechanism in specific brain regions that actualizes learning. However, the exact nature of this mechanism remains unclear. Here we propose an algorithm based on… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

    Comments: 10 pages, 3 figures, submitted to NeurIPS 2021

    MSC Class: 68T07 ACM Class: I.2.6

  46. Applications of Deep Learning Techniques for Automated Multiple Sclerosis Detection Using Magnetic Resonance Imaging: A Review

    Authors: Afshin Shoeibi, Marjane Khodatars, Mahboobeh Jafari, Parisa Moridian, Mitra Rezaei, Roohallah Alizadehsani, Fahime Khozeimeh, Juan Manuel Gorriz, Jónathan Heras, Maryam Panahiazar, Saeid Nahavandi, U. Rajendra Acharya

    Abstract: Multiple Sclerosis (MS) is a type of brain disease which causes visual, sensory, and motor problems for people with a detrimental effect on the functioning of the nervous system. In order to diagnose MS, multiple screening methods have been proposed so far; among them, magnetic resonance imaging (MRI) has received considerable attention among physicians. MRI modalities provide physicians with fund… ▽ More

    Submitted 9 August, 2021; v1 submitted 11 May, 2021; originally announced May 2021.

    Journal ref: Computers in Biology and Medicine,Volume 136,2021,104697

  47. arXiv:2105.00499  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    Curious Exploration and Return-based Memory Restoration for Deep Reinforcement Learning

    Authors: Saeed Tafazzol, Erfan Fathi, Mahdi Rezaei, Ehsan Asali

    Abstract: Reward engineering and designing an incentive reward function are non-trivial tasks to train agents in complex environments. Furthermore, an inaccurate reward function may lead to a biased behaviour which is far from an efficient and optimised behaviour. In this paper, we focus on training a single agent to score goals with binary success/failure reward function in Half Field Offense domain. As th… ▽ More

    Submitted 2 May, 2021; originally announced May 2021.

  48. arXiv:2102.04238  [pdf

    cs.IR cs.LG

    Amazon Product Recommender System

    Authors: Mohammad R. Rezaei

    Abstract: The number of reviews on Amazon has grown significantly over the years. Customers who made purchases on Amazon provide reviews by rating the product from 1 to 5 stars and sharing a text summary of their experience and opinion of the product. The ratings of a product are averaged to provide an overall product rating. We analyzed what ratings score customers give to a specific product (a music track… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.

  49. arXiv:2011.13851  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.IV

    Real-time Active Vision for a Humanoid Soccer Robot Using Deep Reinforcement Learning

    Authors: Soheil Khatibi, Meisam Teimouri, Mahdi Rezaei

    Abstract: In this paper, we present an active vision method using a deep reinforcement learning approach for a humanoid soccer-playing robot. The proposed method adaptively optimises the viewpoint of the robot to acquire the most useful landmarks for self-localisation while keeping the ball into its viewpoint. Active vision is critical for humanoid decision-maker robots with a limited field of view. To deal… ▽ More

    Submitted 27 November, 2020; originally announced November 2020.

    Comments: The paper has been accepted in ICAART 2021

  50. arXiv:2008.11672  [pdf, other

    cs.CV cs.LG eess.IV physics.med-ph

    DeepSOCIAL: Social Distancing Monitoring and Infection Risk Assessment in COVID-19 Pandemic

    Authors: Mahdi Rezaei, Mohsen Azarmi

    Abstract: Social distancing is a recommended solution by the World Health Organisation (WHO) to minimise the spread of COVID-19 in public places. The majority of governments and national health authorities have set the 2-meter physical distancing as a mandatory safety measure in shopping centres, schools and other covered areas. In this research, we develop a hybrid Computer Vision and YOLOv4-based Deep Neu… ▽ More

    Submitted 28 November, 2020; v1 submitted 26 August, 2020; originally announced August 2020.

    Journal ref: Applied Sciences. 2020, 10, 7514