Zum Hauptinhalt springen

Showing 1–50 of 78 results for author: Rahimi, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04590  [pdf, other

    cs.CV

    SH17: A Dataset for Human Safety and Personal Protective Equipment Detection in Manufacturing Industry

    Authors: Hafiz Mughees Ahmad, Afshin Rahimi

    Abstract: Workplace accidents continue to pose significant risks for human safety, particularly in industries such as construction and manufacturing, and the necessity for effective Personal Protective Equipment (PPE) compliance has become increasingly paramount. Our research focuses on the development of non-invasive techniques based on the Object Detection (OD) and Convolutional Neural Network (CNN) to de… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    ACM Class: I.2.10; I.4.8; I.4.9; I.5.1; I.5.4

  2. arXiv:2407.02060  [pdf, other

    cs.LG cs.AI cs.SC

    Terminating Differentiable Tree Experts

    Authors: Jonathan Thomm, Michael Hersche, Giacomo Camposampiero, Aleksandar Terzić, Bernhard Schölkopf, Abbas Rahimi

    Abstract: We advance the recently proposed neuro-symbolic Differentiable Tree Machine, which learns tree operations using a combination of transformers and Tensor Product Representations. We investigate the architecture and propose two key components. We first remove a series of different transformer layers that are used in every step by introducing a mixture of experts. This results in a Differentiable Tre… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted at the 18th International Conference on Neural-Symbolic Learning and Reasoning (NeSy) 2024

  3. arXiv:2406.19121  [pdf, other

    cs.LG cs.AI cs.SC

    Towards Learning Abductive Reasoning using VSA Distributed Representations

    Authors: Giacomo Camposampiero, Michael Hersche, Aleksandar Terzić, Roger Wattenhofer, Abu Sebastian, Abbas Rahimi

    Abstract: We introduce the Abductive Rule Learner with Context-awareness (ARLC), a model that solves abstract reasoning tasks based on Learn-VRF. ARLC features a novel and more broadly applicable training objective for abductive reasoning, resulting in better interpretability and higher accuracy when solving Raven's progressive matrices (RPM). ARLC allows both programming domain knowledge and learning the r… ▽ More

    Submitted 30 August, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: Accepted at the 18th International Conference on Neural-Symbolic Learning and Reasoning (NeSy) 2024 [Spotlight]

  4. arXiv:2403.10569  [pdf, other

    cs.LG cs.AI cs.CV

    Achieving Pareto Optimality using Efficient Parameter Reduction for DNNs in Resource-Constrained Edge Environment

    Authors: Atah Nuh Mih, Alireza Rahimi, Asfia Kawnine, Francis Palma, Monica Wachowicz, Rickey Dubay, Hung Cao

    Abstract: This paper proposes an optimization of an existing Deep Neural Network (DNN) that improves its hardware utilization and facilitates on-device training for resource-constrained edge environments. We implement efficient parameter reduction strategies on Xception that shrink the model size without sacrificing accuracy, thus decreasing memory utilization during training. We evaluate our model in two e… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2401.05355

  5. arXiv:2403.07851  [pdf, other

    cs.LG cs.CV

    12 mJ per Class On-Device Online Few-Shot Class-Incremental Learning

    Authors: Yoga Esa Wibowo, Cristian Cioflan, Thorir Mar Ingolfsson, Michael Hersche, Leo Zhao, Abbas Rahimi, Luca Benini

    Abstract: Few-Shot Class-Incremental Learning (FSCIL) enables machine learning systems to expand their inference capabilities to new classes using only a few labeled examples, without forgetting the previously learned classes. Classical backpropagation-based learning and its variants are often unsuitable for battery-powered, memory-constrained systems at the extreme edge. In this work, we introduce Online F… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 6 pages, 4 tables, 3 figures. Accepted at IEEE DATE 2024

  6. arXiv:2402.05785  [pdf, other

    cs.LG cs.AI cs.CL

    Limits of Transformer Language Models on Learning to Compose Algorithms

    Authors: Jonathan Thomm, Aleksandar Terzic, Giacomo Camposampiero, Michael Hersche, Bernhard Schölkopf, Abbas Rahimi

    Abstract: We analyze the capabilities of Transformer language models in learning compositional discrete tasks. To this end, we evaluate training LLaMA models and prompting GPT-4 and Gemini on four tasks demanding to learn a composition of several discrete sub-tasks. On both training LLaMA models from scratch and prompting on GPT-4 and Gemini, we measure how well these models can reuse primitives observable… ▽ More

    Submitted 25 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  7. arXiv:2402.00243  [pdf, other

    cs.CV

    Capacity Constraint Analysis Using Object Detection for Smart Manufacturing

    Authors: Hafiz Mughees Ahmad, Afshin Rahimi, Khizer Hayat

    Abstract: The increasing popularity of Deep Learning (DL) based Object Detection (OD) methods and their real-world applications have opened new venues in smart manufacturing. Traditional industries struck by capacity constraints after Coronavirus Disease (COVID-19) require non-invasive methods for in-depth operations' analysis to optimize and increase their revenue. In this study, we have initially develope… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

    ACM Class: I.2.10; I.4.8; I.4.9; I.5.1; I.5.4

  8. arXiv:2401.16876  [pdf, other

    cs.CV cs.LG

    Zero-shot Classification using Hyperdimensional Computing

    Authors: Samuele Ruffino, Geethan Karunaratne, Michael Hersche, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Classification based on Zero-shot Learning (ZSL) is the ability of a model to classify inputs into novel classes on which the model has not previously seen any training examples. Providing an auxiliary descriptor in the form of a set of attributes describing the new classes involved in the ZSL-based classification is one of the favored approaches to solving this challenging task. In this work, ins… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: This is the extended version of a paper accepted in the Design, Automation, and Test in Europe Conference (DATE), 2024

  9. arXiv:2401.16024  [pdf, other

    cs.LG cs.AI

    Probabilistic Abduction for Visual Abstract Reasoning via Learning Rules in Vector-symbolic Architectures

    Authors: Michael Hersche, Francesco di Stefano, Thomas Hofmann, Abu Sebastian, Abbas Rahimi

    Abstract: Abstract reasoning is a cornerstone of human intelligence, and replicating it with artificial intelligence (AI) presents an ongoing challenge. This study focuses on efficiently solving Raven's progressive matrices (RPM), a visual test for assessing abstract reasoning abilities, by using distributed computation and operators provided by vector-symbolic architectures (VSA). Instead of hard-coding th… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Accepted in NeurIPS 2023 Workshop on MATH-AI

  10. arXiv:2312.05605  [pdf, other

    cs.LG cs.CV

    TCNCA: Temporal Convolution Network with Chunked Attention for Scalable Sequence Processing

    Authors: Aleksandar Terzic, Michael Hersche, Geethan Karunaratne, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: MEGA is a recent transformer-based architecture, which utilizes a linear recurrent operator whose parallel computation, based on the FFT, scales as $O(LlogL)$, with $L$ being the sequence length. We build upon their approach by replacing the linear recurrence with a special temporal convolutional network which permits larger receptive field size with shallower networks, and reduces the computation… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  11. arXiv:2312.04540  [pdf, other

    cs.LG cs.AI cs.CV cs.MA cs.RO

    Sim-to-Real Causal Transfer: A Metric Learning Approach to Causally-Aware Interaction Representations

    Authors: Yuejiang Liu, Ahmad Rahimi, Po-Chien Luan, Frano Rajič, Alexandre Alahi

    Abstract: Modeling spatial-temporal interactions among neighboring agents is at the heart of multi-agent problems such as motion forecasting and crowd navigation. Despite notable progress, it remains unclear to which extent modern representations can capture the causal relationships behind agent interactions. In this work, we take an in-depth look at the causal awareness of these representations, from compu… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Preprint

  12. arXiv:2312.02829  [pdf, other

    cs.LG cs.AI stat.ML

    MIMONets: Multiple-Input-Multiple-Output Neural Networks Exploiting Computation in Superposition

    Authors: Nicolas Menet, Michael Hersche, Geethan Karunaratne, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: With the advent of deep learning, progressively larger neural networks have been designed to solve complex tasks. We take advantage of these capacity-rich models to lower the cost of inference by exploiting computation in superposition. To reduce the computational burden per input, we propose Multiple-Input-Multiple-Output Neural Networks (MIMONets) capable of handling many inputs at once. MIMONet… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: accepted in NeurIPS 2023

  13. arXiv:2309.08798  [pdf, other

    cs.AI cs.CV

    D3: Data Diversity Design for Systematic Generalization in Visual Question Answering

    Authors: Amir Rahimi, Vanessa D'Amario, Moyuru Yamada, Kentaro Takemoto, Tomotake Sasaki, Xavier Boix

    Abstract: Systematic generalization is a crucial aspect of intelligence, which refers to the ability to generalize to novel tasks by combining known subtasks and concepts. One critical factor that has been shown to influence systematic generalization is the diversity of training data. However, diversity can be defined in various ways, as data have many factors of variation. A more granular understanding of… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Under review, 15 pages

  14. arXiv:2308.08496  [pdf, other

    cs.IR cs.AI

    Understanding User Intent Modeling for Conversational Recommender Systems: A Systematic Literature Review

    Authors: Siamak Farshidi, Kiyan Rezaee, Sara Mazaheri, Amir Hossein Rahimi, Ali Dadashzadeh, Morteza Ziabakhsh, Sadegh Eskandari, Slinger Jansen

    Abstract: Context: User intent modeling is a crucial process in Natural Language Processing that aims to identify the underlying purpose behind a user's request, enabling personalized responses. With a vast array of approaches introduced in the literature (over 13,000 papers in the last decade), understanding the related concepts and commonly used models in AI-based systems is essential. Method: We conducte… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

  15. arXiv:2307.04599  [pdf, other

    cs.SE cs.AI

    Bridging MDE and AI: A Systematic Review of Domain-Specific Languages and Model-Driven Practices in AI Software Systems Engineering

    Authors: Simon Raedler, Luca Berardinelli, Karolin Winter, Abbas Rahimi, Stefanie Rinderle-Ma

    Abstract: Background:Technical systems are growing in complexity with more components and functions across various disciplines. Model-Driven Engineering (MDE) helps manage this complexity by using models as key artifacts. Domain-Specific Languages (DSL) supported by MDE facilitate modeling. As data generation in product development increases, there's a growing demand for AI algorithms, which can be challeng… ▽ More

    Submitted 6 May, 2024; v1 submitted 10 July, 2023; originally announced July 2023.

    Comments: 57 pages, 2 figures, 8 tables

    ACM Class: A.1; H.1.0; I.2.4

  16. arXiv:2304.04687  [pdf, other

    cs.CV cs.HC

    Learning to Detect Touches on Cluttered Tables

    Authors: Norberto Adrian Goussies, Kenji Hata, Shruthi Prabhakara, Abhishek Amit, Tony Aube, Carl Cepress, Diana Chang, Li-Te Cheng, Horia Stefan Ciurdar, Mike Cleron, Chelsey Fleming, Ashwin Ganti, Divyansh Garg, Niloofar Gheissari, Petra Luna Grutzik, David Hendon, Daniel Iglesia, Jin Kim, Stuart Kyle, Chris LaRosa, Roman Lewkow, Peter F McDermott, Chris Melancon, Paru Nackeeran, Neal Norwitz , et al. (6 additional authors not shown)

    Abstract: We present a novel self-contained camera-projector tabletop system with a lamp form-factor that brings digital intelligence to our tables. We propose a real-time, on-device, learning-based touch detection algorithm that makes any tabletop interactive. The top-down configuration and learning-based algorithm makes our method robust to the presence of clutter, a main limitation of existing camera-pro… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  17. arXiv:2303.13957  [pdf, other

    cs.CV cs.LG cs.NE

    Factorizers for Distributed Sparse Block Codes

    Authors: Michael Hersche, Aleksandar Terzic, Geethan Karunaratne, Jovin Langenegger, Angéline Pouget, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Distributed sparse block codes (SBCs) exhibit compact representations for encoding and manipulating symbolic data structures using fixed-width vectors. One major challenge however is to disentangle, or factorize, the distributed representation of data structures into their constituent elements without having to search through all possible combinations. This factorization becomes more challenging w… ▽ More

    Submitted 28 May, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: Accepted at Neurosymbolic Artificial Intelligence

  18. WHYPE: A Scale-Out Architecture with Wireless Over-the-Air Majority for Scalable In-memory Hyperdimensional Computing

    Authors: Robert Guirado, Abbas Rahimi, Geethan Karunaratne, Eduard Alarcón, Abu Sebastian, Sergi Abadal

    Abstract: Hyperdimensional computing (HDC) is an emerging computing paradigm that represents, manipulates, and communicates data using long random vectors known as hypervectors. Among different hardware platforms capable of executing HDC algorithms, in-memory computing (IMC) has shown promise as it is very efficient in performing matrix-vector multiplications, which are common in the HDC algebra. Although H… ▽ More

    Submitted 4 February, 2023; originally announced March 2023.

    Comments: Accepted at IEEE Journal on Emerging and Selected Topics in Circuits and Systems (JETCAS). arXiv admin note: text overlap with arXiv:2205.10889

  19. arXiv:2301.01682  [pdf, other

    cs.CE q-bio.QM

    DOT: A flexible multi-objective optimization framework for transferring features across single-cell and spatial omics

    Authors: Arezou Rahimi, Luis A. Vale-Silva, Maria Faelth Savitski, Jovan Tanevski, Julio Saez-Rodriguez

    Abstract: Single-cell RNA sequencing (scRNA-seq) and spatially-resolved imaging/sequencing technologies have revolutionized biomedical research. On one hand, scRNA-seq provides information about a large portion of the transcriptome for individual cells, but lacks the spatial context. On the other hand, spatially-resolved measurements come with a trade-off between resolution and gene coverage. Combining scRN… ▽ More

    Submitted 21 July, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: 36 pages, 6 figures

  20. arXiv:2301.01221  [pdf, other

    cs.CR

    Unlocking Metaverse-as-a-Service The three pillars to watch: Privacy and Security, Edge Computing, and Blockchain

    Authors: Vesal Ahsani, Ali Rahimi, Mehdi Letafati, Babak Hossein Khalaj

    Abstract: In this article, the authors provide a comprehensive overview on three core pillars of metaverse-as-a-service (MaaS) platforms; privacy and security, edge computing, and blockchain technology. The article starts by investigating security aspects for the wireless access to the metaverse. Then it goes through the privacy and security issues inside the metaverse from data-centric, learning-centric, a… ▽ More

    Submitted 11 January, 2023; v1 submitted 1 January, 2023; originally announced January 2023.

    Comments: 21 pages, 4 figures, added references for section 3-A

  21. arXiv:2211.15351  [pdf, other

    cs.CL cs.AI cs.LG

    Testing the effectiveness of saliency-based explainability in NLP using randomized survey-based experiments

    Authors: Adel Rahimi, Shaurya Jain

    Abstract: As the applications of Natural Language Processing (NLP) in sensitive areas like Political Profiling, Review of Essays in Education, etc. proliferate, there is a great need for increasing transparency in NLP models to build trust with stakeholders and identify biases. A lot of work in Explainable AI has aimed to devise explanation methods that give humans insights into the workings and predictions… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

  22. arXiv:2211.05052  [pdf, other

    cs.ET cs.CV cs.LG cs.NE

    In-memory factorization of holographic perceptual representations

    Authors: Jovin Langenegger, Geethan Karunaratne, Michael Hersche, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Disentanglement of constituent factors of a sensory signal is central to perception and cognition and hence is a critical task for future artificial intelligence systems. In this paper, we present a compute engine capable of efficiently factorizing holographic perceptual representations by exploiting the computation-in-superposition capability of brain-inspired hyperdimensional computing and the i… ▽ More

    Submitted 16 February, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

    Comments: 23 pages, 4 figures, 1 extended data figure, 3 supplementary notes, 2 supplementary figures and 3 supplementary tables

  23. arXiv:2207.06810  [pdf, other

    cs.LG

    In-memory Realization of In-situ Few-shot Continual Learning with a Dynamically Evolving Explicit Memory

    Authors: Geethan Karunaratne, Michael Hersche, Jovin Langenegger, Giovanni Cherubini, Manuel Le Gallo-Bourdeau, Urs Egger, Kevin Brew, Sam Choi, INJO OK, Mary Claire Silvestre, Ning Li, Nicole Saulnier, Victor Chan, Ishtiaq Ahsan, Vijay Narayanan, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Continually learning new classes from a few training examples without forgetting previous old classes demands a flexible architecture with an inevitably growing portion of storage, in which new examples and classes can be incrementally stored and efficiently retrieved. One viable architectural solution is to tightly couple a stationary deep neural network to a dynamically evolving explicit memory… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: Accepted at the European Solid-state Devices and Circuits Conference (ESSDERC), September 2022

  24. arXiv:2205.10889  [pdf, other

    cs.AR

    Wireless On-Chip Communications for Scalable In-memory Hyperdimensional Computing

    Authors: Robert Guirado, Abbas Rahimi, Geethan Karunaratne, Eduard Alarcón, Abu Sebastian, Sergi Abadal

    Abstract: Hyperdimensional computing (HDC) is an emerging computing paradigm that represents, manipulates, and communicates data using very long random vectors (aka hypervectors). Among different hardware platforms capable of executing HDC algorithms, in-memory computing (IMC) systems have been recently proved to be one of the most energy-efficient options, due to hypervector manipulations in the memory its… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

    Comments: This paper has been accepted at 2022 IEEE International Joint Conference on Neural Networks (IJCNN)

  25. arXiv:2203.16588  [pdf, other

    cs.CV cs.LG

    Constrained Few-shot Class-incremental Learning

    Authors: Michael Hersche, Geethan Karunaratne, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Continually learning new classes from fresh data without forgetting previous knowledge of old classes is a very challenging research problem. Moreover, it is imperative that such learning must respect certain memory and computational constraints such as (i) training samples are limited to only a few per class, (ii) the computational cost of learning a novel class remains constant, and (iii) the me… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: CVPR 2022 camera-ready version

  26. Generalized Key-Value Memory to Flexibly Adjust Redundancy in Memory-Augmented Networks

    Authors: Denis Kleyko, Geethan Karunaratne, Jan M. Rabaey, Abu Sebastian, Abbas Rahimi

    Abstract: Memory-augmented neural networks enhance a neural network with an external key-value memory whose complexity is typically dominated by the number of support vectors in the key memory. We propose a generalized key-value memory that decouples its dimension from the number of support vectors by introducing a free parameter that can arbitrarily add or remove redundancy to the key memory representation… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 8 pages, 7 figures

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2022

  27. arXiv:2203.04571  [pdf, other

    cs.LG cs.AI cs.CV

    A Neuro-vector-symbolic Architecture for Solving Raven's Progressive Matrices

    Authors: Michael Hersche, Mustafa Zeqiri, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Neither deep neural networks nor symbolic AI alone has approached the kind of intelligence expressed in humans. This is mainly because neural networks are not able to decompose joint representations to obtain distinct objects (the so-called binding problem), while symbolic AI suffers from exhaustive rule searches, among other problems. These two problems are still pronounced in neuro-symbolic AI w… ▽ More

    Submitted 3 March, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: Updated version with additional NVSA end-to-end training, generalization experiments, and PGM experiments

  28. A Survey on Hyperdimensional Computing aka Vector Symbolic Architectures, Part II: Applications, Cognitive Models, and Challenges

    Authors: Denis Kleyko, Dmitri A. Rachkovskij, Evgeny Osipov, Abbas Rahimi

    Abstract: This is Part II of the two-part comprehensive survey devoted to a computing framework most commonly known under the names Hyperdimensional Computing and Vector Symbolic Architectures (HDC/VSA). Both names refer to a family of computational models that use high-dimensional distributed representations and rely on the algebraic properties of their key operations to incorporate the advantages of struc… ▽ More

    Submitted 1 August, 2023; v1 submitted 12 November, 2021; originally announced December 2021.

    Comments: 37 pages

    Journal ref: ACM Computing Surveys (2023), vol. 55, no. 9

  29. arXiv:2112.03909  [pdf, other

    cs.CV

    Vehicle trajectory prediction works, but not everywhere

    Authors: Mohammadhossein Bahari, Saeed Saadatnejad, Ahmad Rahimi, Mohammad Shaverdikondori, Amir-Hossein Shahidzadeh, Seyed-Mohsen Moosavi-Dezfooli, Alexandre Alahi

    Abstract: Vehicle trajectory prediction is nowadays a fundamental pillar of self-driving cars. Both the industry and research communities have acknowledged the need for such a pillar by providing public benchmarks. While state-of-the-art methods are impressive, i.e., they have no off-road prediction, their generalization to cities outside of the benchmark remains unexplored. In this work, we show that those… ▽ More

    Submitted 29 March, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: CVPR 2022

  30. arXiv:2111.15199  [pdf, other

    cs.CV

    Semi-Supervised 3D Hand Shape and Pose Estimation with Label Propagation

    Authors: Samira Kaviani, Amir Rahimi, Richard Hartley

    Abstract: To obtain 3D annotations, we are restricted to controlled environments or synthetic datasets, leading us to 3D datasets with less generalizability to real-world scenarios. To tackle this issue in the context of semi-supervised 3D hand shape and pose estimation, we propose the Pose Alignment network to propagate 3D annotations from labelled frames to nearby unlabelled frames in sparsely annotated v… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

    Comments: DICTA 2021

  31. arXiv:2111.06077  [pdf, other

    cs.AI cs.LG

    A Survey on Hyperdimensional Computing aka Vector Symbolic Architectures, Part I: Models and Data Transformations

    Authors: Denis Kleyko, Dmitri A. Rachkovskij, Evgeny Osipov, Abbas Rahimi

    Abstract: This two-part comprehensive survey is devoted to a computing framework most commonly known under the names Hyperdimensional Computing and Vector Symbolic Architectures (HDC/VSA). Both names refer to a family of computational models that use high-dimensional distributed representations and rely on the algebraic properties of their key operations to incorporate the advantages of structured symbolic… ▽ More

    Submitted 31 July, 2023; v1 submitted 11 November, 2021; originally announced November 2021.

    Comments: 31 pages

    Journal ref: ACM Computing Surveys (2022), vol. 55, no. 6

  32. arXiv:2109.10444  [pdf, other

    cs.CL

    Fairness-aware Class Imbalanced Learning

    Authors: Shivashankar Subramanian, Afshin Rahimi, Timothy Baldwin, Trevor Cohn, Lea Frermann

    Abstract: Class imbalance is a common challenge in many NLP tasks, and has clear connections to bias, in that bias in training data often leads to higher accuracy for majority groups at the expense of minority groups. However there has traditionally been a disconnect between research on class-imbalanced learning and mitigating bias, and only recently have the two been looked at through a common lens. In thi… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: To appear in EMNLP 2021

  33. arXiv:2107.10456  [pdf, other

    cs.CV

    CogSense: A Cognitively Inspired Framework for Perception Adaptation

    Authors: Hyukseong Kwon, Amir Rahimi, Kevin G. Lee, Amit Agarwal, Rajan Bhattacharyya

    Abstract: This paper proposes the CogSense system, which is inspired by sense-making cognition and perception in the mammalian brain to perform perception error detection and perception parameter adaptation using probabilistic signal temporal logic. As a specific application, a contrast-based perception adaption method is presented and validated. The proposed method evaluates perception errors using heterog… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

  34. Energy Efficient In-memory Hyperdimensional Encoding for Spatio-temporal Signal Processing

    Authors: Geethan Karunaratne, Manuel Le Gallo, Michael Hersche, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: The emerging brain-inspired computing paradigm known as hyperdimensional computing (HDC) has been proven to provide a lightweight learning framework for various cognitive tasks compared to the widely used deep learning-based approaches. Spatio-temporal (ST) signal processing, which encompasses biosignals such as electromyography (EMG) and electroencephalography (EEG), is one family of applications… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Journal ref: IEEE Transactions on Circuits and Systems II: Express Briefs, vol. 68, no. 5, pp. 1725-1729, May 2021

  35. Vector Symbolic Architectures as a Computing Framework for Emerging Hardware

    Authors: Denis Kleyko, Mike Davies, E. Paxon Frady, Pentti Kanerva, Spencer J. Kent, Bruno A. Olshausen, Evgeny Osipov, Jan M. Rabaey, Dmitri A. Rachkovskij, Abbas Rahimi, Friedrich T. Sommer

    Abstract: This article reviews recent progress in the development of the computing framework vector symbolic architectures (VSA) (also known as hyperdimensional computing). This framework is well suited for implementation in stochastic, emerging hardware, and it naturally expresses the types of cognitive operations required for artificial intelligence (AI). We demonstrate in this article that the field-like… ▽ More

    Submitted 20 July, 2023; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: 31 pages, 15 figures, 4 Tables

    Journal ref: Proceedings of the IEEE (2022), vol. 110, no. 10

  36. Saliency-Guided Deep Learning Network for Automatic Tumor Bed Volume Delineation in Post-operative Breast Irradiation

    Authors: Mahdieh Kazemimoghadam, Weicheng Chi, Asal Rahimi, Nathan Kim, Prasanna Alluri, Chika Nwachukwu, Weiguo Lu, Xuejun Gu

    Abstract: Efficient, reliable and reproducible target volume delineation is a key step in the effective planning of breast radiotherapy. However, post-operative breast target delineation is challenging as the contrast between the tumor bed volume (TBV) and normal breast tissue is relatively low in CT images. In this study, we propose to mimic the marker-guidance procedure in manual target delineation. We de… ▽ More

    Submitted 26 July, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: https://iopscience.iop.org/article/10.1088/1361-6560/ac176d

    Journal ref: Physics in Medicine & Biology 2021

  37. arXiv:2104.11949  [pdf, other

    eess.IV cs.CV cs.LG

    Automatic Diagnosis of COVID-19 from CT Images using CycleGAN and Transfer Learning

    Authors: Navid Ghassemi, Afshin Shoeibi, Marjane Khodatars, Jonathan Heras, Alireza Rahimi, Assef Zare, Ram Bilas Pachori, J. Manuel Gorriz

    Abstract: The outbreak of the corona virus disease (COVID-19) has changed the lives of most people on Earth. Given the high prevalence of this disease, its correct diagnosis in order to quarantine patients is of the utmost importance in steps of fighting this pandemic. Among the various modalities used for diagnosis, medical imaging, especially computed tomography (CT) imaging, has been the focus of many pr… ▽ More

    Submitted 24 April, 2021; originally announced April 2021.

  38. arXiv:2103.14162  [pdf, other

    cs.CV

    Few-shot Weakly-Supervised Object Detection via Directional Statistics

    Authors: Amirreza Shaban, Amir Rahimi, Thalaiyasingam Ajanthan, Byron Boots, Richard Hartley

    Abstract: Detecting novel objects from few examples has become an emerging topic in computer vision recently. However, these methods need fully annotated training images to learn new object categories which limits their applicability in real world scenarios such as field robotics. In this work, we propose a probabilistic multiple instance learning approach for few-shot Common Object Localization (COL) and f… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

  39. arXiv:2103.01344  [pdf, ps, other

    cs.CR cs.IT

    Multi-Party Proof Generation in QAP-based zk-SNARKs

    Authors: Ali Rahimi, Mohammad Ali Maddah-Ali

    Abstract: Zero-knowledge succinct non-interactive argument of knowledge (zkSNARK) allows a party, known as the prover, to convince another party, known as the verifier, that he knows a private value $v$, without revealing it, such that $F(u,v)=y$ for some function $F$ and public values $u$ and $y$. There are various versions of zk-SNARK, among them, Quadratic Arithmetic Program (QAP)-based zk-SNARK has been… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: 31 pages, 2 figures

  40. arXiv:2102.09107  [pdf

    econ.GN cs.CY

    A Core of E-Commerce Customer Experience based on Conversational Data using Network Text Methodology

    Authors: Andry Alamsyah, Nurlisa Laksmiani, Lies Anisa Rahimi

    Abstract: E-commerce provides an efficient and effective way to exchange goods between sellers and customers. E-commerce has been a popular method for doing business, because of its simplicity of having commerce activity transparently available, including customer voice and opinion about their own experience. Those experiences can be a great benefit to understand customer experience comprehensively, both fo… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: 9 pages, 1 figure, 4 tables

    MSC Class: 00-XX ACM Class: K.4

    Journal ref: International Journal of Business, 2018, 23(3)

  41. arXiv:2102.02758  [pdf, other

    eess.SP cs.AI eess.SY

    A 5 μW Standard Cell Memory-based Configurable Hyperdimensional Computing Accelerator for Always-on Smart Sensing

    Authors: Manuel Eggimann, Abbas Rahimi, Luca Benini

    Abstract: Hyperdimensional computing (HDC) is a brain-inspired computing paradigm based on high-dimensional holistic representations of vectors. It recently gained attention for embedded smart sensing due to its inherent error-resiliency and suitability to highly parallel hardware implementations. In this work, we propose a programmable all-digital CMOS implementation of a fully autonomous HDC accelerator f… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

  42. arXiv:2011.13115  [pdf, ps, other

    cs.CL

    Learning Causal Bayesian Networks from Text

    Authors: Farhad Moghimifar, Afshin Rahimi, Mahsa Baktashmotlagh, Xue Li

    Abstract: Causal relationships form the basis for reasoning and decision-making in Artificial Intelligence systems. To exploit the large volume of textual data available today, the automatic discovery of causal relationships from text has emerged as a significant challenge in recent years. Existing approaches in this realm are limited to the extraction of low-level relations among individual events. To over… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Comments: ALTA2020

  43. arXiv:2011.00677  [pdf, other

    cs.CL

    IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP

    Authors: Fajri Koto, Afshin Rahimi, Jey Han Lau, Timothy Baldwin

    Abstract: Although the Indonesian language is spoken by almost 200 million people and the 10th most spoken language in the world, it is under-represented in NLP research. Previous work on Indonesian has been hampered by a lack of annotated datasets, a sparsity of language resources, and a lack of resource standardization. In this work, we release the IndoLEM dataset comprising seven tasks for the Indonesian… ▽ More

    Submitted 1 November, 2020; originally announced November 2020.

    Comments: Accepted at COLING 2020 - The 28th International Conference on Computational Linguistics

  44. arXiv:2010.11273  [pdf, other

    cs.LG cs.AI

    The Need for Standardized Explainability

    Authors: Othman Benchekroun, Adel Rahimi, Qini Zhang, Tetiana Kodliuk

    Abstract: Explainable AI (XAI) is paramount in industry-grade AI; however existing methods fail to address this necessity, in part due to a lack of standardisation of explainability methods. The purpose of this paper is to offer a perspective on the current state of the area of explainability, and to provide novel definitions for Explainability and Interpretability to begin standardising this area of resear… ▽ More

    Submitted 22 October, 2020; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: Accepted in 2nd ICML 2020 Workshop on Human in the Loop Learning

  45. arXiv:2010.08232  [pdf, other

    cs.CL

    WNUT-2020 Task 2: Identification of Informative COVID-19 English Tweets

    Authors: Dat Quoc Nguyen, Thanh Vu, Afshin Rahimi, Mai Hoang Dao, Linh The Nguyen, Long Doan

    Abstract: In this paper, we provide an overview of the WNUT-2020 shared task on the identification of informative COVID-19 English Tweets. We describe how we construct a corpus of 10K Tweets and organize the development and evaluation phases for this task. In addition, we also present a brief summary of results obtained from the final system evaluation submissions of 55 teams, finding that (i) many systems… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: In Proceedings of the 6th Workshop on Noisy User-generated Text

  46. arXiv:2010.07004  [pdf, other

    eess.SP cs.LG

    Binarization Methods for Motor-Imagery Brain-Computer Interface Classification

    Authors: Michael Hersche, Luca Benini, Abbas Rahimi

    Abstract: Successful motor-imagery brain-computer interface (MI-BCI) algorithms either extract a large number of handcrafted features and train a classifier, or combine feature extraction and classification within deep convolutional neural networks (CNNs). Both approaches typically result in a set of real-valued weights, that pose challenges when targeting real-time execution on tightly resource-constrained… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

  47. Robust High-dimensional Memory-augmented Neural Networks

    Authors: Geethan Karunaratne, Manuel Schmuck, Manuel Le Gallo, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Traditional neural networks require enormous amounts of data to build their complex mappings during a slow training procedure that hinders their abilities for relearning and adapting to new data. Memory-augmented neural networks enhance neural networks with an explicit memory to overcome these issues. Access to this explicit memory, however, occurs via soft read and write operations involving ever… ▽ More

    Submitted 19 March, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: This is a pre-print of an article accepted for publication in Nature Communications

    Journal ref: Nature Communications volume 12, Article number: 2468 (2021)

  48. arXiv:2010.00287  [pdf, ps, other

    cs.CL

    Joint Persian Word Segmentation Correction and Zero-Width Non-Joiner Recognition Using BERT

    Authors: Ehsan Doostmohammadi, Minoo Nassajian, Adel Rahimi

    Abstract: Words are properly segmented in the Persian writing system; in practice, however, these writing rules are often neglected, resulting in single words being written disjointedly and multiple words written without any white spaces between them. This paper addresses the problems of word segmentation and zero-width non-joiner (ZWNJ) recognition in Persian, which we approach jointly as a sequence labeli… ▽ More

    Submitted 28 October, 2020; v1 submitted 1 October, 2020; originally announced October 2020.

  49. arXiv:2009.09474  [pdf, other

    cs.CL cs.LG

    Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging

    Authors: Ehsan Doostmohammadi, Minoo Nassajian, Adel Rahimi

    Abstract: Ezafe is a grammatical particle in some Iranian languages that links two words together. Regardless of the important information it conveys, it is almost always not indicated in Persian script, resulting in mistakes in reading complex sentences and errors in natural language processing tasks. In this paper, we experiment with different machine learning methods to achieve state-of-the-art results i… ▽ More

    Submitted 4 October, 2020; v1 submitted 20 September, 2020; originally announced September 2020.

  50. arXiv:2006.12807  [pdf, other

    cs.LG cs.CV stat.ML

    Post-hoc Calibration of Neural Networks by g-Layers

    Authors: Amir Rahimi, Thomas Mensink, Kartik Gupta, Thalaiyasingam Ajanthan, Cristian Sminchisescu, Richard Hartley

    Abstract: Calibration of neural networks is a critical aspect to consider when incorporating machine learning models in real-world decision-making systems where the confidence of decisions are equally important as the decisions themselves. In recent years, there is a surge of research on neural network calibration and the majority of the works can be categorized into post-hoc calibration methods, defined as… ▽ More

    Submitted 21 February, 2022; v1 submitted 23 June, 2020; originally announced June 2020.