Zum Hauptinhalt springen

Showing 1–24 of 24 results for author: Apidianaki, M

Searching in archive cs. Search in all archives.
.
  1. Learning Translations via Matrix Completion

    Authors: Derry Wijaya, Brendan Callahan, John Hewitt, Jie Gao, Xiao Ling, Marianna Apidianaki, Chris Callison-Burch

    Abstract: Bilingual Lexicon Induction is the task of learning word translations without bilingual parallel corpora. We model this task as a matrix completion problem, and present an effective and extendable framework for completing the matrix. This method harnesses diverse bilingual and monolingual signals, each of which may be incomplete or noisy. Our model achieves state-of-the-art performance for both hi… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: This is a late posting of an old paper as Google Scholar somehow misses indexing the ACL anthology version of the paper

    ACM Class: I.2.7

    Journal ref: Volume: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Year: 2017, Pages: 1452-1463

  2. arXiv:2404.02619  [pdf, other

    cs.CL

    Adjusting Interpretable Dimensions in Embedding Space with Human Judgments

    Authors: Katrin Erk, Marianna Apidianaki

    Abstract: Embedding spaces contain interpretable dimensions indicating gender, formality in style, or even object properties. This has been observed multiple times. Such interpretable dimensions are becoming valuable tools in different areas of study, from social science to neuroscience. The standard way to compute these dimensions uses contrasting seed words and computes difference vectors over them. This… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: NAACL 2024

  3. arXiv:2403.07726  [pdf, other

    cs.CL

    SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes

    Authors: Timothee Mickus, Elaine Zosa, Raúl Vázquez, Teemu Vahtola, Jörg Tiedemann, Vincent Segonne, Alessandro Raganato, Marianna Apidianaki

    Abstract: This paper presents the results of the SHROOM, a shared task focused on detecting hallucinations: outputs from natural language generation (NLG) systems that are fluent, yet inaccurate. Such cases of overgeneration put in jeopardy many NLG applications, where correctness is often mission-critical. The shared task was conducted with a newly constructed dataset of 4000 model outputs labeled by 5 ann… ▽ More

    Submitted 29 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: SemEval 2024 shared task. Pre-review version

  4. arXiv:2402.13904  [pdf, other

    cs.CL

    Calibrating Large Language Models with Sample Consistency

    Authors: Qing Lyu, Kumar Shridhar, Chaitanya Malaviya, Li Zhang, Yanai Elazar, Niket Tandon, Marianna Apidianaki, Mrinmaya Sachan, Chris Callison-Burch

    Abstract: Accurately gauging the confidence level of Large Language Models' (LLMs) predictions is pivotal for their reliable application. However, LLMs are often uncalibrated inherently and elude conventional calibration techniques due to their proprietary nature and massive scale. In this work, we explore the potential of deriving confidence from the distribution of multiple randomly sampled model generati… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  5. arXiv:2305.18657  [pdf, other

    cs.CL

    Representation Of Lexical Stylistic Features In Language Models' Embedding Space

    Authors: Qing Lyu, Marianna Apidianaki, Chris Callison-Burch

    Abstract: The representation space of pretrained Language Models (LMs) encodes rich information about words and their relationships (e.g., similarity, hypernymy, polysemy) as well as abstract semantic notions (e.g., intensity). In this paper, we demonstrate that lexical stylistic notions such as complexity, formality, and figurativeness, can also be identified in this space. We show that it is possible to d… ▽ More

    Submitted 31 May, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted at *SEM 2023

  6. arXiv:2305.14724  [pdf, other

    cs.CL cs.AI cs.CV cs.HC

    I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors

    Authors: Tuhin Chakrabarty, Arkadiy Saakyan, Olivia Winn, Artemis Panagopoulou, Yue Yang, Marianna Apidianaki, Smaranda Muresan

    Abstract: Visual metaphors are powerful rhetorical devices used to persuade or communicate creative ideas through images. Similar to linguistic metaphors, they convey meaning implicitly through symbolism and juxtaposition of the symbols. We propose a new task of generating visual metaphors from linguistic metaphors. This is a challenging task for diffusion-based text-to-image models, such as DALL$\cdot$E 2,… ▽ More

    Submitted 14 July, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: ACL 2023 (Findings)

  7. arXiv:2305.04990  [pdf, other

    cs.CL cs.LG

    Explanation-based Finetuning Makes Models More Robust to Spurious Cues

    Authors: Josh Magnus Ludan, Yixuan Meng, Tai Nguyen, Saurabh Shah, Qing Lyu, Marianna Apidianaki, Chris Callison-Burch

    Abstract: Large Language Models (LLMs) are so powerful that they sometimes learn correlations between labels and features that are irrelevant to the task, leading to poor generalization on out-of-distribution data. We propose explanation-based finetuning as a general approach to mitigate LLMs' reliance on spurious correlations. Unlike standard finetuning where the model only predicts the answer given the in… ▽ More

    Submitted 6 June, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

  8. arXiv:2301.13379  [pdf, other

    cs.CL

    Faithful Chain-of-Thought Reasoning

    Authors: Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong, Marianna Apidianaki, Chris Callison-Burch

    Abstract: While Chain-of-Thought (CoT) prompting boosts Language Models' (LM) performance on a gamut of complex reasoning tasks, the generated reasoning chain does not necessarily reflect how the model arrives at the answer (aka. faithfulness). We propose Faithful CoT, a reasoning framework involving two stages: Translation (Natural Language query $\rightarrow$ symbolic reasoning chain) and Problem Solving… ▽ More

    Submitted 20 September, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: IJCNLP-AACL 2023 camera-ready version

  9. arXiv:2210.12905  [pdf, other

    cs.CL

    Visualizing the Obvious: A Concreteness-based Ensemble Model for Noun Property Prediction

    Authors: Yue Yang, Artemis Panagopoulou, Marianna Apidianaki, Mark Yatskar, Chris Callison-Burch

    Abstract: Neural language models encode rich knowledge about entities and their relationships which can be extracted from their representations using probing. Common properties of nouns (e.g., red strawberries, small ant) are, however, more challenging to extract compared to other types of knowledge because they are rarely explicitly stated in texts. We hypothesize this to mainly be the case for perceptual… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: Findings of EMNLP 2022; The first two authors contributed equally

    Journal ref: Findings of EMNLP 2022

  10. arXiv:2209.11326  [pdf, other

    cs.CL

    Towards Faithful Model Explanation in NLP: A Survey

    Authors: Qing Lyu, Marianna Apidianaki, Chris Callison-Burch

    Abstract: End-to-end neural Natural Language Processing (NLP) models are notoriously difficult to understand. This has given rise to numerous efforts towards model explainability in recent years. One desideratum of model explanation is faithfulness, i.e. an explanation should accurately represent the reasoning process behind the model's prediction. In this survey, we review over 110 model explanation method… ▽ More

    Submitted 12 January, 2024; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: Added acknowledgements; Accepted to the Computational Linguistics Journal (June 2024 issue)

  11. arXiv:2201.04467  [pdf, other

    cs.CL

    How Does Data Corruption Affect Natural Language Understanding Models? A Study on GLUE datasets

    Authors: Aarne Talman, Marianna Apidianaki, Stergios Chatzikyriakidis, Jörg Tiedemann

    Abstract: A central question in natural language understanding (NLU) research is whether high performance demonstrates the models' strong reasoning capabilities. We present an extensive series of controlled experiments where pre-trained language models are exposed to data that have undergone specific corruption transformations. These involve removing instances of specific word classes and often lead to non-… ▽ More

    Submitted 15 May, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

    Comments: *SEM 2022 camera ready version

  12. arXiv:2112.08326  [pdf, other

    cs.CL

    Is "My Favorite New Movie" My Favorite Movie? Probing the Understanding of Recursive Noun Phrases

    Authors: Qing Lyu, Hua Zheng, Daoxin Li, Li Zhang, Marianna Apidianaki, Chris Callison-Burch

    Abstract: Recursive noun phrases (NPs) have interesting semantic properties. For example, "my favorite new movie" is not necessarily my favorite movie, whereas "my new favorite movie" is. This is common sense to humans, yet it is unknown whether language models have such knowledge. We introduce the Recursive Noun Phrase Challenge (RNPC), a dataset of three textual inference tasks involving textual entailmen… ▽ More

    Submitted 8 May, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: NAACL 2022

  13. arXiv:2110.06376  [pdf, other

    cs.CL

    ALL Dolphins Are Intelligent and SOME Are Friendly: Probing BERT for Nouns' Semantic Properties and their Prototypicality

    Authors: Marianna Apidianaki, Aina Garí Soler

    Abstract: Large scale language models encode rich commonsense knowledge acquired through exposure to massive data during pre-training, but their understanding of entities and their semantic properties is unclear. We probe BERT (Devlin et al., 2019) for the properties of English nouns as expressed by adjectives that do not restrict the reference scope of the noun they modify (as in "red car"), but instead em… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: Accepted to BlackboxNLP 2021

  14. arXiv:2105.01180  [pdf, other

    cs.CL

    Scalar Adjective Identification and Multilingual Ranking

    Authors: Aina Garí Soler, Marianna Apidianaki

    Abstract: The intensity relationship that holds between scalar adjectives (e.g., nice < great < wonderful) is highly relevant for natural language inference and common-sense reasoning. Previous research on scalar adjective ranking has focused on English, mainly due to the availability of datasets for evaluation. We introduce a new multilingual dataset in order to promote research on scalar adjectives in new… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

    Comments: Accepted at NAACL-HLT 2021

  15. arXiv:2104.14694  [pdf, other

    cs.CL

    Let's Play Mono-Poly: BERT Can Reveal Words' Polysemy Level and Partitionability into Senses

    Authors: Aina Garí Soler, Marianna Apidianaki

    Abstract: Pre-trained language models (LMs) encode rich information about linguistic structure but their knowledge about lexical polysemy remains unclear. We propose a novel experimental setup for analysing this knowledge in LMs specifically trained for different languages (English, French, Spanish and Greek) and in multilingual BERT. We perform our analysis on datasets carefully designed to reflect differe… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

    Comments: Accepted to TACL. Pre-MIT Press publication version

  16. arXiv:2104.04751  [pdf, other

    cs.CL

    NLI Data Sanity Check: Assessing the Effect of Data Corruption on Model Performance

    Authors: Aarne Talman, Marianna Apidianaki, Stergios Chatzikyriakidis, Jörg Tiedemann

    Abstract: Pre-trained neural language models give high performance on natural language inference (NLI) tasks. But whether they actually understand the meaning of the processed sequences remains unclear. We propose a new diagnostics test suite which allows to assess whether a dataset constitutes a good testbed for evaluating the models' meaning understanding capabilities. We specifically apply controlled cor… ▽ More

    Submitted 10 April, 2021; originally announced April 2021.

    Comments: NoDaLiDa 2021 camera ready

  17. arXiv:2012.12382  [pdf, other

    cs.CL

    Simple-QE: Better Automatic Quality Estimation for Text Simplification

    Authors: Reno Kriz, Marianna Apidianaki, Chris Callison-Burch

    Abstract: Text simplification systems generate versions of texts that are easier to understand for a broader audience. The quality of simplified texts is generally estimated using metrics that compare to human references, which can be difficult to obtain. We propose Simple-QE, a BERT-based quality estimation (QE) model adapted from prior summarization QE work, and show that it correlates well with human qua… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

    Comments: 4 pages, 1 figure, 2 tables

  18. arXiv:2010.02686  [pdf, other

    cs.CL

    BERT Knows Punta Cana is not just beautiful, it's gorgeous: Ranking Scalar Adjectives with Contextualised Representations

    Authors: Aina Garí Soler, Marianna Apidianaki

    Abstract: Adjectives like pretty, beautiful and gorgeous describe positive properties of the nouns they modify but with different intensity. These differences are important for natural language understanding and reasoning. We propose a novel BERT-based approach to intensity detection for scalar adjectives. We model intensity by vectors directly derived from contextualised representations and show they can s… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020

  19. arXiv:2007.12432  [pdf, other

    cs.CL

    MULTISEM at SemEval-2020 Task 3: Fine-tuning BERT for Lexical Meaning

    Authors: Aina Garí Soler, Marianna Apidianaki

    Abstract: We present the MULTISEM systems submitted to SemEval 2020 Task 3: Graded Word Similarity in Context (GWSC). We experiment with injecting semantic knowledge into pre-trained BERT models through fine-tuning on lexical semantic tasks related to GWSC. We use existing semantically annotated datasets and propose to approximate similarity through automatically generated lexical substitutes in context. We… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

    Comments: 8 pages, 2 tables. Accepted at the 14th International Workshop on Semantic Evaluation (SemEval-2020)

  20. arXiv:1909.00578  [pdf, other

    cs.CL

    SumQE: a BERT-based Summary Quality Estimation Model

    Authors: Stratos Xenouleas, Prodromos Malakasiotis, Marianna Apidianaki, Ion Androutsopoulos

    Abstract: We propose SumQE, a novel Quality Estimation model for summarization based on BERT. The model addresses linguistic quality aspects that are only indirectly captured by content-based approaches to summary evaluation, without involving comparison with human references. SumQE achieves very high correlations with human ratings, outperforming simpler models addressing these linguistic aspects. Predicti… ▽ More

    Submitted 2 September, 2019; originally announced September 2019.

    Comments: In Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019), Hong Kong, China, 2019

  21. arXiv:1906.05939  [pdf, other

    cs.AI cs.CL

    Embedding Biomedical Ontologies by Jointly Encoding Network Structure and Textual Node Descriptors

    Authors: Sotiris Kotitsas, Dimitris Pappas, Ion Androutsopoulos, Ryan McDonald, Marianna Apidianaki

    Abstract: Network Embedding (NE) methods, which map network nodes to low-dimensional feature vectors, have wide applications in network analysis and bioinformatics. Many existing NE methods rely only on network structure, overlooking other information associated with the nodes, e.g., text describing the nodes. Recent attempts to combine the two sources of information only consider local network structure. W… ▽ More

    Submitted 20 June, 2019; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: Proceedings of the 18th Workshop on Biomedical Natural Language Processing (BioNLP 2019) of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019), Florence, Italy, 2019

  22. arXiv:1905.08377  [pdf, other

    cs.CL

    Word Usage Similarity Estimation with Sentence Representations and Automatic Substitutes

    Authors: Aina Garí Soler, Marianna Apidianaki, Alexandre Allauzen

    Abstract: Usage similarity estimation addresses the semantic proximity of word instances in different contexts. We apply contextualized (ELMo and BERT) word and sentence embeddings to this task, and propose supervised models that leverage these representations for prediction. Our models are further assisted by lexical substitute annotations automatically assigned to word instances by context2vec, a neural m… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

    Comments: *SEM 2019

  23. arXiv:1904.02767  [pdf, other

    cs.CL

    Complexity-Weighted Loss and Diverse Reranking for Sentence Simplification

    Authors: Reno Kriz, João Sedoc, Marianna Apidianaki, Carolina Zheng, Gaurav Kumar, Eleni Miltsakaki, Chris Callison-Burch

    Abstract: Sentence simplification is the task of rewriting texts so they are easier to understand. Recent research has applied sequence-to-sequence (Seq2Seq) models to this task, focusing largely on training-time improvements via reinforcement learning and memory augmentation. One of the main problems with applying generic Seq2Seq models for simplification is that these models tend to copy directly from the… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

    Comments: 11 pages, North American Association of Computational Linguistics (NAACL 2019)

  24. arXiv:1810.11190  [pdf, other

    cs.CL cs.AI

    Magnitude: A Fast, Efficient Universal Vector Embedding Utility Package

    Authors: Ajay Patel, Alexander Sands, Chris Callison-Burch, Marianna Apidianaki

    Abstract: Vector space embedding models like word2vec, GloVe, fastText, and ELMo are extremely popular representations in natural language processing (NLP) applications. We present Magnitude, a fast, lightweight tool for utilizing and processing embeddings. Magnitude is an open source Python package with a compact vector storage file format that allows for efficient manipulation of huge numbers of embedding… ▽ More

    Submitted 26 October, 2018; originally announced October 2018.