Zum Hauptinhalt springen

Showing 1–29 of 29 results for author: Barnes, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.06996  [pdf, other

    cs.CL cs.AI

    XNLIeu: a dataset for cross-lingual NLI in Basque

    Authors: Maite Heredia, Julen Etxaniz, Muitze Zulaika, Xabier Saralegi, Jeremy Barnes, Aitor Soroa

    Abstract: XNLI is a popular Natural Language Inference (NLI) benchmark widely used to evaluate cross-lingual Natural Language Understanding (NLU) capabilities across languages. In this paper, we expand XNLI to include Basque, a low-resource language that can greatly benefit from transfer-learning approaches. The new dataset, dubbed XNLIeu, has been developed by first machine-translating the English XNLI cor… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted to NAACL 2024

  2. English Prompts are Better for NLI-based Zero-Shot Emotion Classification than Target-Language Prompts

    Authors: Patrick Bareiß, Roman Klinger, Jeremy Barnes

    Abstract: Emotion classification in text is a challenging task due to the processes involved when interpreting a textual description of a potential emotion stimulus. In addition, the set of emotion categories is highly domain-specific. For instance, literature analysis might require the use of aesthetic emotions (e.g., finding something beautiful), and social media analysis could benefit from fine-grained s… ▽ More

    Submitted 7 March, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: published at the PromptEng workshop at TheWebConf

  3. arXiv:2308.01982  [pdf, other

    eess.IV cs.CV q-bio.QM

    Predicting Ki67, ER, PR, and HER2 Statuses from H&E-stained Breast Cancer Images

    Authors: Amir Akbarnejad, Nilanjan Ray, Penny J. Barnes, Gilbert Bigras

    Abstract: Despite the advances in machine learning and digital pathology, it is not yet clear if machine learning methods can accurately predict molecular information merely from histomorphology. In a quest to answer this question, we built a large-scale dataset (185538 images) with reliable measurements for Ki67, ER, PR, and HER2 statuses. The dataset is composed of mirrored images of H\&E and correspondin… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

  4. arXiv:2301.12872  [pdf, other

    astro-ph.EP astro-ph.IM astro-ph.SR cs.LG

    A Machine Learning approach for correcting radial velocities using physical observables

    Authors: M. Perger, G. Anglada-Escudé, D. Baroch, M. Lafarga, I. Ribas, J. C. Morales, E. Herrero, P. J. Amado, J. R. Barnes, J. A. Caballero, S. V. Jeffers, A. Quirrenbach, A. Reiners

    Abstract: Precision radial velocity (RV) measurements continue to be a key tool to detect and characterise extrasolar planets. While instrumental precision keeps improving, stellar activity remains a barrier to obtain reliable measurements below 1-2 m/s accuracy. Using simulations and real data, we investigate the capabilities of a Deep Neural Network approach to produce activity free Doppler measurements o… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Journal ref: A&A 672, A118 (2023)

  5. arXiv:2210.06150  [pdf, other

    cs.CL

    Annotating Norwegian Language Varieties on Twitter for Part-of-Speech

    Authors: Petter Mæhlum, Andre Kåsen, Samia Touileb, Jeremy Barnes

    Abstract: Norwegian Twitter data poses an interesting challenge for Natural Language Processing (NLP) tasks. These texts are difficult for models trained on standardized text in one of the two Norwegian written forms (Bokmål and Nynorsk), as they contain both the typical variation of social media text, as well as a large amount of dialectal variety. In this paper we present a novel Norwegian Twitter dataset… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted at the Ninth Workshop on NLP for Similar Languages, Varieties and Dialects (Vardial2022). Collocated with COLING2022

  6. arXiv:2203.13209  [pdf, other

    cs.CL

    Direct parsing to sentiment graphs

    Authors: David Samuel, Jeremy Barnes, Robin Kurtz, Stephan Oepen, Lilja Øvrelid, Erik Velldal

    Abstract: This paper demonstrates how a graph-based semantic parser can be applied to the task of structured sentiment analysis, directly predicting sentiment graphs from text. We advance the state of the art on 4 out of 5 standard benchmark sets. We release the source code, models and predictions.

    Submitted 26 April, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: Accepted to ACL 2022

  7. arXiv:2105.14504  [pdf, other

    cs.CL

    Structured Sentiment Analysis as Dependency Graph Parsing

    Authors: Jeremy Barnes, Robin Kurtz, Stephan Oepen, Lilja Øvrelid, Erik Velldal

    Abstract: Structured sentiment analysis attempts to extract full opinion tuples from a text, but over time this task has been subdivided into smaller and smaller sub-tasks, e,g,, target extraction or targeted polarity classification. We argue that this division has become counterproductive and propose a new unified framework to remedy the situation. We cast the structured sentiment problem as dependency gra… ▽ More

    Submitted 30 May, 2021; originally announced May 2021.

    Comments: Accepted at ACL-IJCNLP 2021

  8. arXiv:2105.07400  [pdf, other

    cs.CL

    The interplay between language similarity and script on a novel multi-layer Algerian dialect corpus

    Authors: Samia Touileb, Jeremy Barnes

    Abstract: Recent years have seen a rise in interest for cross-lingual transfer between languages with similar typology, and between languages of various scripts. However, the interplay between language similarity and difference in script on cross-lingual transfer is a less studied problem. We explore this interplay on cross-lingual transfer for two supervised tasks, namely part-of-speech tagging and sentime… ▽ More

    Submitted 31 May, 2021; v1 submitted 16 May, 2021; originally announced May 2021.

    Comments: Accepted at Findings of ACL: ACL2021

  9. skweak: Weak Supervision Made Easy for NLP

    Authors: Pierre Lison, Jeremy Barnes, Aliaksandr Hubin

    Abstract: We present skweak, a versatile, Python-based software toolkit enabling NLP developers to apply weak supervision to a wide range of NLP tasks. Weak supervision is an emerging machine learning paradigm based on a simple idea: instead of labelling data points by hand, we use labelling functions derived from domain knowledge to automatically obtain annotations for a given dataset. The resulting labels… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

  10. arXiv:2104.08281  [pdf, other

    physics.ao-ph cs.LG

    Controlled abstention neural networks for identifying skillful predictions for classification problems

    Authors: Elizabeth A. Barnes, Randal J. Barnes

    Abstract: The earth system is exceedingly complex and often chaotic in nature, making prediction incredibly challenging: we cannot expect to make perfect predictions all of the time. Instead, we look for specific states of the system that lead to more predictable behavior than others, often termed "forecasts of opportunity." When these opportunities are not present, scientists need prediction systems that a… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

    Comments: submitted to the Journal of Advances in Earth System Modeling. arXiv admin note: substantial text overlap with arXiv:2104.08236

  11. arXiv:2104.08236  [pdf, other

    cs.LG physics.ao-ph

    Controlled abstention neural networks for identifying skillful predictions for regression problems

    Authors: Elizabeth A. Barnes, Randal J. Barnes

    Abstract: The earth system is exceedingly complex and often chaotic in nature, making prediction incredibly challenging: we cannot expect to make perfect predictions all of the time. Instead, we look for specific states of the system that lead to more predictable behavior than others, often termed "forecasts of opportunity". When these opportunities are not present, scientists need prediction systems that a… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

    Comments: submitted to the Journal of Advances of Earth System Modeling

  12. arXiv:2104.06546  [pdf, other

    cs.CL

    Large-Scale Contextualised Language Modelling for Norwegian

    Authors: Andrey Kutuzov, Jeremy Barnes, Erik Velldal, Lilja Øvrelid, Stephan Oepen

    Abstract: We present the ongoing NorLM initiative to support the creation and use of very large contextualised language models for Norwegian (and in principle other Nordic languages), including a ready-to-use software environment, as well as an experience report for data preparation and training. This paper introduces the first large-scale monolingual language models for Norwegian, based on both the ELMo an… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Comments: Accepted to NoDaLiDa'2021

  13. arXiv:2104.04989  [pdf, other

    cs.CL

    NorDial: A Preliminary Corpus of Written Norwegian Dialect Use

    Authors: Jeremy Barnes, Petter Mæhlum, Samia Touileb

    Abstract: Norway has a large amount of dialectal variation, as well as a general tolerance to its use in the public sphere. There are, however, few available resources to study this variation and its change over time and in more informal areas, \eg on social media. In this paper, we propose a first step to creating a corpus of dialectal variation of written Norwegian. We collect a small corpus of tweets and… ▽ More

    Submitted 11 April, 2021; originally announced April 2021.

    Comments: Accepted to NoDaLiDa 2021

  14. arXiv:2102.00299  [pdf, other

    cs.CL

    If you've got it, flaunt it: Making the most of fine-grained sentiment annotations

    Authors: Jeremy Barnes, Lilja Øvrelid, Erik Velldal

    Abstract: Fine-grained sentiment analysis attempts to extract sentiment holders, targets and polar expressions and resolve the relationship between them, but progress has been hampered by the difficulty of annotation. Targeted sentiment analysis, on the other hand, is a more narrow task, focusing on extracting sentiment targets and classifying their polarity.In this paper, we explore whether incorporating h… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.

    Comments: To appear in EACL 2021

  15. arXiv:2010.08318  [pdf, other

    cs.CL

    Multi-task Learning of Negation and Speculation for Targeted Sentiment Classification

    Authors: Andrew Moore, Jeremy Barnes

    Abstract: The majority of work in targeted sentiment analysis has concentrated on finding better methods to improve the overall results. Within this paper we show that these models are not robust to linguistic phenomena, specifically negation and speculation. In this paper, we propose a multi-task learning method to incorporate information from syntactic and semantic auxiliary tasks, including negation and… ▽ More

    Submitted 31 March, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

    Comments: To appear at NAACL 2021 (long)

  16. arXiv:2004.14723  [pdf, other

    cs.CL cs.LG stat.ML

    Named Entity Recognition without Labelled Data: A Weak Supervision Approach

    Authors: Pierre Lison, Aliaksandr Hubin, Jeremy Barnes, Samia Touileb

    Abstract: Named Entity Recognition (NER) performance often degrades rapidly when applied to target domains that differ from the texts observed during training. When in-domain labelled data is available, transfer learning techniques can be used to adapt existing NER models to the target domain. But what should one do when there is no hand-labelled data for the target domain? This paper presents a simple but… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

    Comments: Accepted to ACL 2020 (long paper)

  17. arXiv:2004.04103  [pdf, ps, other

    cs.CL

    Cross-lingual Emotion Intensity Prediction

    Authors: Irean Navas Alejo, Toni Badia, Jeremy Barnes

    Abstract: Emotion intensity prediction determines the degree or intensity of an emotion that the author expresses in a text, extending previous categorical approaches to emotion detection. While most previous work on this topic has concentrated on English texts, other languages would also benefit from fine-grained emotion classification, preferably without having to recreate the amount of annotated data ava… ▽ More

    Submitted 24 November, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: Accepted in PEOPLES 2020 Workshop

  18. arXiv:2002.08131  [pdf, other

    cs.CL cs.LG

    A Systematic Comparison of Architectures for Document-Level Sentiment Classification

    Authors: Jeremy Barnes, Vinit Ravishankar, Lilja Øvrelid, Erik Velldal

    Abstract: Documents are composed of smaller pieces - paragraphs, sentences, and tokens - that have complex relationships between one another. Sentiment classification models that take into account the structure inherent in these documents have a theoretical advantage over those that do not. At the same time, transfer learning models based on language model pretraining have shown promise for document classif… ▽ More

    Submitted 2 February, 2022; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: 5 pages, 2 figures

  19. arXiv:1911.12722  [pdf, other

    cs.CL

    A Fine-Grained Sentiment Dataset for Norwegian

    Authors: Lilja Øvrelid, Petter Mæhlum, Jeremy Barnes, Erik Velldal

    Abstract: We introduce NoReC_fine, a dataset for fine-grained sentiment analysis in Norwegian, annotated with respect to polar expressions, targets and holders of opinion. The underlying texts are taken from a corpus of professionally authored reviews from multiple news-sources and across a wide variety of domains, including literature, games, music, products, movies and more. We here present a detailed des… ▽ More

    Submitted 6 April, 2020; v1 submitted 28 November, 2019; originally announced November 2019.

    Comments: Accepted for LREC 2020

  20. arXiv:1909.10576  [pdf, other

    cs.NI

    The Potential Short- and Long-Term Disruptions and Transformative Impacts of 5G and Beyond Wireless Networks: Lessons Learnt from the Development of a 5G Testbed Environment

    Authors: Mohmammad N. Patwary, Syed Junaid Nawaz, Md. Abdur Rahman, Shree Krishna Sharma, Md Mamunur Rashid, Stuart J. Barnes

    Abstract: The anticipated deployment cost of 5G communication networks in the UK is predicted to be in between £30bn- £50bn, whereas the current annual capital expenditure of the mobile network operators (MNOs) is £2.5bn. This prospect has vastly impacted and has become one of the major delaying factors for building the 5G physical infrastructure, whereas other areas of 5G developments are progressing at th… ▽ More

    Submitted 31 May, 2020; v1 submitted 23 September, 2019; originally announced September 2019.

    Comments: 22 pages, 9 figures, 11 tables

  21. arXiv:1906.10519  [pdf, other

    cs.CL

    Embedding Projection for Targeted Cross-Lingual Sentiment: Model Comparisons and a Real-World Study

    Authors: Jeremy Barnes, Roman Klinger

    Abstract: Sentiment analysis benefits from large, hand-annotated resources in order to train and test machine learning models, which are often data hungry. While some languages, e.g., English, have a vast array of these resources, most under-resourced languages do not, especially for fine-grained sentiment tasks, such as aspect-level or targeted sentiment analysis. To improve this situation, we propose a cr… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    Comments: Submitted to Journal of Artificial Intelligence Research (41 pages, 51 with references). arXiv admin note: text overlap with arXiv:1805.09016

  22. Improving Sentiment Analysis with Multi-task Learning of Negation

    Authors: Jeremy Barnes, Erik Velldal, Lilja Øvrelid

    Abstract: Sentiment analysis is directly affected by compositional phenomena in language that act on the prior polarity of the words and phrases found in the text. Negation is the most prevalent of these phenomena and in order to correctly predict sentiment, a classifier must be able to identify negation and disentangle the effect that its scope has on the final polarity of a text. This paper proposes a mul… ▽ More

    Submitted 1 October, 2019; v1 submitted 18 June, 2019; originally announced June 2019.

    Comments: Under submission for Journal of Natural Language Engineering special issue on Negation. 30 pages with references

    Journal ref: Nat. Lang. Eng. 27 (2021) 249-269

  23. arXiv:1906.07599  [pdf, other

    cs.CL

    LTG-Oslo Hierarchical Multi-task Network: The importance of negation for document-level sentiment in Spanish

    Authors: Jeremy Barnes

    Abstract: This paper details LTG-Oslo team's participation in the sentiment track of the NEGES 2019 evaluation campaign. We participated in the task with a hierarchical multi-task network, which used shared lower-layers in a deep BiLSTM to predict negation, while the higher layers were dedicated to predicting document-level sentiment. The multi-task component shows promise as a way to incorporate informatio… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

    Comments: Accepted in NEGES (Negation in Spanish) workshop at SEPLN 2019

  24. arXiv:1906.05889  [pdf, ps, other

    cs.CL

    On the Effect of Word Order on Cross-lingual Sentiment Analysis

    Authors: Àlex R. Atrio, Toni Badia, Jeremy Barnes

    Abstract: Current state-of-the-art models for sentiment analysis make use of word order either explicitly by pre-training on a language modeling objective or implicitly by using recurrent neural networks (RNNs) or convolutional networks (CNNs). This is a problem for cross-lingual models that use bilingual embeddings as features, as the difference in word order between source and target languages is not reso… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: Accepted to SEPLN 2019

  25. arXiv:1906.05887  [pdf, other

    cs.CL

    Sentiment analysis is not solved! Assessing and probing sentiment classification

    Authors: Jeremy Barnes, Lilja Øvrelid, Erik Velldal

    Abstract: Neural methods for SA have led to quantitative improvements over previous approaches, but these advances are not always accompanied with a thorough analysis of the qualitative differences. Therefore, it is not clear what outstanding conceptual challenges for sentiment analysis remain. In this work, we attempt to discover what challenges still prove a problem for sentiment classifiers for English a… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: Accepted to BlackBoxNLP Workshop at ACL 2019

  26. arXiv:1806.04381  [pdf, other

    cs.CL

    Projecting Embeddings for Domain Adaptation: Joint Modeling of Sentiment Analysis in Diverse Domains

    Authors: Jeremy Barnes, Roman Klinger, Sabine Schulte im Walde

    Abstract: Domain adaptation for sentiment analysis is challenging due to the fact that supervised classifiers are very sensitive to changes in domain. The two most prominent approaches to this problem are structural correspondence learning and autoencoders. However, they either require long training times or suffer greatly on highly divergent domains. Inspired by recent advances in cross-lingual sentiment a… ▽ More

    Submitted 13 June, 2018; v1 submitted 12 June, 2018; originally announced June 2018.

    Comments: Accepted to COLING 2018

  27. arXiv:1805.09016  [pdf, other

    cs.CL

    Bilingual Sentiment Embeddings: Joint Projection of Sentiment Across Languages

    Authors: Jeremy Barnes, Roman Klinger, Sabine Schulte im Walde

    Abstract: Sentiment analysis in low-resource languages suffers from a lack of annotated corpora to estimate high-performing models. Machine translation and bilingual word embeddings provide some relief through cross-lingual sentiment approaches. However, they either require large amounts of parallel data or do not sufficiently capture sentiment information. We introduce Bilingual Sentiment Embeddings (BLSE)… ▽ More

    Submitted 23 May, 2018; originally announced May 2018.

    Comments: Accepted to ACL 2018 (Long Papers)

  28. arXiv:1803.08614  [pdf, ps, other

    cs.CL

    MultiBooked: A Corpus of Basque and Catalan Hotel Reviews Annotated for Aspect-level Sentiment Classification

    Authors: Jeremy Barnes, Patrik Lambert, Toni Badia

    Abstract: While sentiment analysis has become an established field in the NLP community, research into languages other than English has been hindered by the lack of resources. Although much research in multi-lingual and cross-lingual sentiment analysis has focused on unsupervised or semi-supervised approaches, these still require a large number of resources and do not reach the performance of supervised app… ▽ More

    Submitted 22 March, 2018; originally announced March 2018.

    Comments: Accepted at LREC 2018

  29. arXiv:1709.04219  [pdf, other

    cs.CL cs.AI

    Assessing State-of-the-Art Sentiment Models on State-of-the-Art Sentiment Datasets

    Authors: Jeremy Barnes, Roman Klinger, Sabine Schulte im Walde

    Abstract: There has been a good amount of progress in sentiment analysis over the past 10 years, including the proposal of new methods and the creation of benchmark datasets. In some papers, however, there is a tendency to compare models only on one or two datasets, either because of time restraints or because the model is tailored to a specific task. Accordingly, it is hard to understand how well a certain… ▽ More

    Submitted 13 September, 2017; originally announced September 2017.

    Comments: Presented at WASSA 2017

    Journal ref: In Proceedings of WASSA (2017). 2 - 12