Zum Hauptinhalt springen

Showing 1–12 of 12 results for author: Rohanian, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10086  [pdf, other

    cs.CL cs.AI

    Rapid Biomedical Research Classification: The Pandemic PACT Advanced Categorisation Engine

    Authors: Omid Rohanian, Mohammadmahdi Nouriborji, Olena Seminog, Rodrigo Furst, Thomas Mendy, Shanthi Levanita, Zaharat Kadri-Alabi, Nusrat Jabin, Daniela Toale, Georgina Humphreys, Emilia Antonio, Adrian Bucher, Alice Norton, David A. Clifton

    Abstract: This paper introduces the Pandemic PACT Advanced Categorisation Engine (PPACE) along with its associated dataset. PPACE is a fine-tuned model developed to automatically classify research abstracts from funded biomedical projects according to WHO-aligned research priorities. This task is crucial for monitoring research trends and identifying gaps in global health preparedness and response. Our appr… ▽ More

    Submitted 19 July, 2024; v1 submitted 14 July, 2024; originally announced July 2024.

    MSC Class: 68T50 ACM Class: I.2.7

  2. arXiv:2405.00716  [pdf, other

    cs.CL cs.AI

    Large Language Models in the Clinic: A Comprehensive Benchmark

    Authors: Andrew Liu, Hongjian Zhou, Yining Hua, Omid Rohanian, Anshul Thakur, Lei Clifton, David A. Clifton

    Abstract: The adoption of large language models (LLMs) to assist clinicians has attracted remarkable attention. Existing works mainly adopt the close-ended question-answering (QA) task with answer options for evaluation. However, many clinical decisions involve answering open-ended questions without pre-set options. To better understand LLMs in the clinic, we construct a benchmark ClinicBench. We first coll… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 April, 2024; originally announced May 2024.

  3. arXiv:2402.10597  [pdf, other

    cs.CL cs.AI

    Efficiency at Scale: Investigating the Performance of Diminutive Language Models in Clinical Tasks

    Authors: Niall Taylor, Upamanyu Ghose, Omid Rohanian, Mohammadmahdi Nouriborji, Andrey Kormilitzin, David Clifton, Alejo Nevado-Holgado

    Abstract: The entry of large language models (LLMs) into research and commercial spaces has led to a trend of ever-larger models, with initial promises of generalisability, followed by a widespread desire to downsize and create specialised models without the need for complete fine-tuning, using Parameter Efficient Fine-tuning (PEFT) methods. We present an investigation into the suitability of different PEFT… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  4. arXiv:2401.00579  [pdf, other

    cs.CL cs.AI cs.LG

    Exploring the Effectiveness of Instruction Tuning in Biomedical Language Processing

    Authors: Omid Rohanian, Mohammadmahdi Nouriborji, David A. Clifton

    Abstract: Large Language Models (LLMs), particularly those similar to ChatGPT, have significantly influenced the field of Natural Language Processing (NLP). While these models excel in general language tasks, their performance in domain-specific downstream tasks such as biomedical and clinical Named Entity Recognition (NER), Relation Extraction (RE), and Medical Natural Language Inference (NLI) is still evo… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    MSC Class: 68T50 ACM Class: I.2.7

  5. arXiv:2302.04725  [pdf, other

    cs.CL cs.AI cs.LG

    Lightweight Transformers for Clinical Natural Language Processing

    Authors: Omid Rohanian, Mohammadmahdi Nouriborji, Hannah Jauncey, Samaneh Kouchaki, ISARIC Clinical Characterisation Group, Lei Clifton, Laura Merson, David A. Clifton

    Abstract: Specialised pre-trained language models are becoming more frequent in NLP since they can potentially outperform models trained on generic texts. BioBERT and BioClinicalBERT are two examples of such models that have shown promise in medical NLP tasks. Many of these models are overparametrised and resource-intensive, but thanks to techniques like Knowledge Distillation (KD), it is possible to create… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    MSC Class: 68T50 ACM Class: I.2.7

  6. arXiv:2210.09440  [pdf, other

    cs.CL cs.AI

    Using Bottleneck Adapters to Identify Cancer in Clinical Notes under Low-Resource Constraints

    Authors: Omid Rohanian, Hannah Jauncey, Mohammadmahdi Nouriborji, Vinod Kumar Chauhan, Bronner P. Gonçalves, Christiana Kartsonaki, ISARIC Clinical Characterisation Group, Laura Merson, David Clifton

    Abstract: Processing information locked within clinical health records is a challenging task that remains an active area of research in biomedical NLP. In this work, we evaluate a broad set of machine learning techniques ranging from simple RNNs to specialised transformers such as BioBERT on a dataset containing clinical notes along with a set of annotations indicating whether a sample is cancer-related or… ▽ More

    Submitted 7 June, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    MSC Class: 68T50 ACM Class: I.2.7

  7. arXiv:2210.06425  [pdf, other

    cs.CL cs.LG

    MiniALBERT: Model Distillation via Parameter-Efficient Recursive Transformers

    Authors: Mohammadmahdi Nouriborji, Omid Rohanian, Samaneh Kouchaki, David A. Clifton

    Abstract: Pre-trained Language Models (LMs) have become an integral part of Natural Language Processing (NLP) in recent years, due to their superior performance in downstream applications. In spite of this resounding success, the usability of LMs is constrained by computational and time complexity, along with their increasing size; an issue that has been referred to as `overparameterisation'. Different stra… ▽ More

    Submitted 30 April, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    MSC Class: 68T50 ACM Class: I.2.7

  8. arXiv:2209.03182  [pdf, ps, other

    cs.CL cs.LG

    On the Effectiveness of Compact Biomedical Transformers

    Authors: Omid Rohanian, Mohammadmahdi Nouriborji, Samaneh Kouchaki, David A. Clifton

    Abstract: Language models pre-trained on biomedical corpora, such as BioBERT, have recently shown promising results on downstream biomedical tasks. Many existing pre-trained models, on the other hand, are resource-intensive and computationally heavy owing to factors such as embedding size, hidden dimension, and number of layers. The natural language processing (NLP) community has developed numerous strategi… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    MSC Class: 68T50

  9. arXiv:2204.00556  [pdf, other

    cs.CL cs.AI

    Nowruz at SemEval-2022 Task 7: Tackling Cloze Tests with Transformers and Ordinal Regression

    Authors: Mohammadmahdi Nouriborji, Omid Rohanian, David Clifton

    Abstract: This paper outlines the system using which team Nowruz participated in SemEval 2022 Task 7 Identifying Plausible Clarifications of Implicit and Underspecified Phrases for both subtasks A and B. Using a pre-trained transformer as a backbone, the model targeted the task of multi-task classification and ranking in the context of finding the best fillers for a cloze task related to instructional texts… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: SemEval 2022

    MSC Class: 68T50 ACM Class: I.2.7

  10. arXiv:2201.03004  [pdf, other

    cs.LG cs.AI cs.CR

    Privacy-aware Early Detection of COVID-19 through Adversarial Training

    Authors: Omid Rohanian, Samaneh Kouchaki, Andrew Soltan, Jenny Yang, Morteza Rohanian, Yang Yang, David Clifton

    Abstract: Early detection of COVID-19 is an ongoing area of research that can help with triage, monitoring and general health assessment of potential patients and may reduce operational strain on hospitals that cope with the coronavirus pandemic. Different machine learning techniques have been used in the literature to detect coronavirus using routine clinical data (blood tests, and vital signs). Data breac… ▽ More

    Submitted 9 January, 2022; originally announced January 2022.

    ACM Class: J.3

  11. arXiv:1902.10667  [pdf, other

    cs.CL cs.AI

    Bridging the Gap: Attending to Discontinuity in Identification of Multiword Expressions

    Authors: Omid Rohanian, Shiva Taslimipoor, Samaneh Kouchaki, Le An Ha, Ruslan Mitkov

    Abstract: We introduce a new method to tag Multiword Expressions (MWEs) using a linguistically interpretable language-independent deep learning architecture. We specifically target discontinuity, an under-explored aspect that poses a significant challenge to computational treatment of MWEs. Two neural architectures are explored: Graph Convolutional Network (GCN) and multi-head self-attention. GCN leverages… ▽ More

    Submitted 25 April, 2019; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: Accepted at NAACL-HLT 2019

  12. arXiv:1809.03056  [pdf, other

    cs.CL

    SHOMA at Parseme Shared Task on Automatic Identification of VMWEs: Neural Multiword Expression Tagging with High Generalisation

    Authors: Shiva Taslimipoor, Omid Rohanian

    Abstract: This paper presents a language-independent deep learning architecture adapted to the task of multiword expression (MWE) identification. We employ a neural architecture comprising of convolutional and recurrent layers with the addition of an optional CRF layer at the top. This system participated in the open track of the Parseme shared task on automatic identification of verbal MWEs due to the use… ▽ More

    Submitted 9 September, 2018; originally announced September 2018.