Skip to main content

Showing 1–10 of 10 results for author: Soboroff, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11139  [pdf, other

    cs.CL

    Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance

    Authors: Somnath Banerjee, Avik Halder, Rajarshi Mandal, Sayan Layek, Ian Soboroff, Rima Hazra, Animesh Mukherjee

    Abstract: The integration of pretrained language models (PLMs) like BERT and GPT has revolutionized NLP, particularly for English, but it has also created linguistic imbalances. This paper strategically identifies the need for linguistic equity by examining several knowledge editing techniques in multilingual contexts. We evaluate the performance of models such as Mistral, TowerInstruct, OpenHathi, Tamil-Ll… ▽ More

    Submitted 17 July, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: Under review

  2. On the Evaluation of Machine-Generated Reports

    Authors: James Mayfield, Eugene Yang, Dawn Lawrie, Sean MacAvaney, Paul McNamee, Douglas W. Oard, Luca Soldaini, Ian Soboroff, Orion Weller, Efsun Kayi, Kate Sanders, Marc Mason, Noah Hibbler

    Abstract: Large Language Models (LLMs) have enabled new ways to satisfy information needs. Although great strides have been made in applying them to settings like document ranking and short-form text generation, they still struggle to compose complete, accurate, and verifiable long-form reports. Reports with these qualities are necessary to satisfy the complex, nuanced, or multi-faceted information needs of… ▽ More

    Submitted 9 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: 12 pages, 4 figures, accepted at SIGIR 2024 as perspective paper

  3. arXiv:2210.10266  [pdf, ps, other

    cs.IR

    Corrected Evaluation Results of the NTCIR WWW-2, WWW-3, and WWW-4 English Subtasks

    Authors: Tetsuya Sakai, Sijie Tao, Maria Maistro, Zhumin Chu, Yujing Li, Nuo Chen, Nicola Ferro, Junjie Wang, Ian Soboroff, Yiqun Liu

    Abstract: Unfortunately, the official English (sub)task results reported in the NTCIR-14 WWW-2, NTCIR-15 WWW-3, and NTCIR-16 WWW-4 overview papers are incorrect due to noise in the official qrels files; this paper reports results based on the corrected qrels files. The noise is due to a fatal bug in the backend of our relevance assessment interface. More specifically, at WWW-2, WWW-3, and WWW-4, two version… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: 24 pages

  4. arXiv:2201.11086  [pdf, other

    cs.IR

    Can Old TREC Collections Reliably Evaluate Modern Neural Retrieval Models?

    Authors: Ellen M. Voorhees, Ian Soboroff, Jimmy Lin

    Abstract: Neural retrieval models are generally regarded as fundamentally different from the retrieval techniques used in the late 1990's when the TREC ad hoc test collections were constructed. They thus provide the opportunity to empirically test the claim that pooling-built test collections can reliably evaluate retrieval systems that did not contribute to the construction of the collection (in other word… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  5. Podcast Metadata and Content: Episode Relevance andAttractiveness in Ad Hoc Search

    Authors: Ben Carterette, Rosie Jones, Gareth F. Jones, Maria Eskevich, Sravana Reddy, Ann Clifton, Yongze Yu, Jussi Karlgren, Ian Soboroff

    Abstract: Rapidly growing online podcast archives contain diverse content on a wide range of topics. These archives form an important resource for entertainment and professional use, but their value can only be realized if users can rapidly and reliably locate content of interest. Search for relevant content can be based on metadata provided by content creators, but also on transcripts of the spoken content… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

  6. arXiv:2104.09632  [pdf

    cs.IR

    Searching for Scientific Evidence in a Pandemic: An Overview of TREC-COVID

    Authors: Kirk Roberts, Tasmeer Alam, Steven Bedrick, Dina Demner-Fushman, Kyle Lo, Ian Soboroff, Ellen Voorhees, Lucy Lu Wang, William R Hersh

    Abstract: We present an overview of the TREC-COVID Challenge, an information retrieval (IR) shared task to evaluate search on scientific literature related to COVID-19. The goals of TREC-COVID include the construction of a pandemic search test collection and the evaluation of IR methods for COVID-19. The challenge was conducted over five rounds from April to July, 2020, with participation from 92 unique tea… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

  7. arXiv:2104.09399  [pdf, other

    cs.IR cs.AI cs.LG

    TREC Deep Learning Track: Reusable Test Collections in the Large Data Regime

    Authors: Nick Craswell, Bhaskar Mitra, Emine Yilmaz, Daniel Campos, Ellen M. Voorhees, Ian Soboroff

    Abstract: The TREC Deep Learning (DL) Track studies ad hoc search in the large data regime, meaning that a large set of human-labeled training data is available. Results so far indicate that the best models with large data may be deep neural networks. This paper supports the reuse of the TREC DL test collections in three ways. First we describe the data sets in detail, documenting clearly and in one place s… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:2003.07820

  8. How to Measure the Reproducibility of System-oriented IR Experiments

    Authors: Timo Breuer, Nicola Ferro, Norbert Fuhr, Maria Maistro, Tetsuya Sakai, Philipp Schaer, Ian Soboroff

    Abstract: Replicability and reproducibility of experimental results are primary concerns in all the areas of science and IR is not an exception. Besides the problem of moving the field towards more reproducible experimental practices and protocols, we also face a severe methodological issue: we do not have any means to assess when reproduced is reproduced. Moreover, we lack any reproducibility-oriented data… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: SIGIR2020 Full Conference Paper

  9. arXiv:2005.04474  [pdf, other

    cs.IR

    TREC-COVID: Constructing a Pandemic Information Retrieval Test Collection

    Authors: Ellen Voorhees, Tasmeer Alam, Steven Bedrick, Dina Demner-Fushman, William R Hersh, Kyle Lo, Kirk Roberts, Ian Soboroff, Lucy Lu Wang

    Abstract: TREC-COVID is a community evaluation designed to build a test collection that captures the information needs of biomedical researchers using the scientific literature during a pandemic. One of the key characteristics of pandemic search is the accelerated rate of change: the topics of interest evolve as the pandemic progresses and the scientific literature in the area explodes. The COVID-19 pandemi… ▽ More

    Submitted 9 May, 2020; originally announced May 2020.

    Comments: 10 pages, 5 figures. TREC-COVID web site: http://ir.nist.gov/covidSubmit/ Will also appear in June 2020 issue of ACM SIGIR Forum

    ACM Class: H.3.0

  10. arXiv:2005.00463  [pdf, other

    cs.AI cs.CL cs.CV

    HLVU : A New Challenge to Test Deep Understanding of Movies the Way Humans do

    Authors: Keith Curtis, George Awad, Shahzad Rajput, Ian Soboroff

    Abstract: In this paper we propose a new evaluation challenge and direction in the area of High-level Video Understanding. The challenge we are proposing is designed to test automatic video analysis and understanding, and how accurately systems can comprehend a movie in terms of actors, entities, events and their relationship to each other. A pilot High-Level Video Understanding (HLVU) dataset of open sourc… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.