Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Spector, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.04560  [pdf, other

    cs.CL

    Conversational Prompt Engineering

    Authors: Liat Ein-Dor, Orith Toledo-Ronen, Artem Spector, Shai Gretz, Lena Dankin, Alon Halfon, Yoav Katz, Noam Slonim

    Abstract: Prompts are how humans communicate with LLMs. Informative prompts are essential for guiding LLMs to produce the desired output. However, prompt engineering is often tedious and time-consuming, requiring significant expertise, limiting its widespread use. We propose Conversational Prompt Engineering (CPE), a user-friendly tool that helps users create personalized prompts for their specific tasks. C… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  2. arXiv:2407.18990  [pdf, other

    cs.LG cs.AI cs.CL

    Stay Tuned: An Empirical Study of the Impact of Hyperparameters on LLM Tuning in Real-World Applications

    Authors: Alon Halfon, Shai Gretz, Ofir Arviv, Artem Spector, Orith Toledo-Ronen, Yoav Katz, Liat Ein-Dor, Michal Shmueli-Scheuer, Noam Slonim

    Abstract: Fine-tuning Large Language Models (LLMs) is an effective method to enhance their performance on downstream tasks. However, choosing the appropriate setting of tuning hyperparameters (HPs) is a labor-intensive and computationally expensive process. Here, we provide recommended HP configurations for practical use-cases that represent a better starting point for practitioners, when considering two SO… ▽ More

    Submitted 7 August, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

  3. arXiv:2201.02026  [pdf, other

    cs.CL

    Fortunately, Discourse Markers Can Enhance Language Models for Sentiment Analysis

    Authors: Liat Ein-Dor, Ilya Shnayderman, Artem Spector, Lena Dankin, Ranit Aharonov, Noam Slonim

    Abstract: In recent years, pretrained language models have revolutionized the NLP world, while achieving state of the art performance in various downstream tasks. However, in many cases, these models do not perform well when labeled data is scarce and the model is expected to perform in the zero or few shot setting. Recently, several works have shown that continual pretraining or performing a second phase o… ▽ More

    Submitted 5 April, 2022; v1 submitted 6 January, 2022; originally announced January 2022.

    Comments: Published in AAAI 2022

  4. arXiv:2012.14541  [pdf, other

    cs.CL cs.IR cs.LG

    YASO: A Targeted Sentiment Analysis Evaluation Dataset for Open-Domain Reviews

    Authors: Matan Orbach, Orith Toledo-Ronen, Artem Spector, Ranit Aharonov, Yoav Katz, Noam Slonim

    Abstract: Current TSA evaluation in a cross-domain setup is restricted to the small set of review domains available in existing datasets. Such an evaluation is limited, and may not reflect true performance on sites like Amazon or Yelp that host diverse reviews from many domains. To address this gap, we present YASO - a new TSA evaluation dataset of open-domain user reviews. YASO contains 2,215 English sente… ▽ More

    Submitted 13 September, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

    Comments: Accepted to EMNLP 2021 (long paper). To download YASO, see https://github.com/IBM/yaso-tsa

  5. arXiv:2010.06432  [pdf, other

    cs.CL cs.AI cs.LG

    Multilingual Argument Mining: Datasets and Analysis

    Authors: Orith Toledo-Ronen, Matan Orbach, Yonatan Bilu, Artem Spector, Noam Slonim

    Abstract: The growing interest in argument mining and computational argumentation brings with it a plethora of Natural Language Understanding (NLU) tasks and corresponding datasets. However, as with many other NLU tasks, the dominant language is English, with resources in other languages being few and far between. In this work, we explore the potential of transfer learning using the multilingual BERT model… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: Accepted to Findings of EMNLP 2020 (Long Paper). For the associated multilingual arguments and evidence corpus, see https://www.research.ibm.com/haifa/dept/vst/debating_data.shtml#Multilingual%20Argument%20Mining

  6. arXiv:1908.06785  [pdf, other

    cs.CL

    Fast End-to-End Wikification

    Authors: Ilya Shnayderman, Liat Ein-Dor, Yosi Mass, Alon Halfon, Benjamin Sznajder, Artem Spector, Yoav Katz, Dafna Sheinwald, Ranit Aharonov, Noam Slonim

    Abstract: Wikification of large corpora is beneficial for various NLP applications. Existing methods focus on quality performance rather than run-time, and are therefore non-feasible for large data. Here, we introduce RedW, a run-time oriented Wikification solution, based on Wikipedia redirects, that can Wikify massive corpora with competitive performance. We further propose an efficient method for estimati… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

  7. arXiv:1809.01285  [pdf, ps, other

    cs.CL

    Learning Concept Abstractness Using Weak Supervision

    Authors: Ella Rabinovich, Benjamin Sznajder, Artem Spector, Ilya Shnayderman, Ranit Aharonov, David Konopnicki, Noam Slonim

    Abstract: We introduce a weakly supervised approach for inferring the property of abstractness of words and expressions in the complete absence of labeled data. Exploiting only minimal linguistic clues and the contextual usage of a concept as manifested in textual data, we train sufficiently powerful classifiers, obtaining high correlation with human labels. The results imply the applicability of this appro… ▽ More

    Submitted 4 September, 2018; originally announced September 2018.

    Comments: 6 pages, EMNLP 2018