Zum Hauptinhalt springen

Showing 1–43 of 43 results for author: Rosenthal, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.17347  [pdf, other

    cs.SE cs.HC

    InspectorRAGet: An Introspection Platform for RAG Evaluation

    Authors: Kshitij Fadnis, Siva Sankalp Patel, Odellia Boni, Yannis Katsis, Sara Rosenthal, Benjamin Sznajder, Marina Danilevsky

    Abstract: Large Language Models (LLM) have become a popular approach for implementing Retrieval Augmented Generation (RAG) systems, and a significant amount of effort has been spent on building good models and metrics. In spite of increased recognition of the need for rigorous evaluation of RAG systems, few tools exist that go beyond the creation of model output and automatic calculation. We present Inspect… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  2. arXiv:2404.02103  [pdf, other

    cs.CL

    CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems

    Authors: Sara Rosenthal, Avirup Sil, Radu Florian, Salim Roukos

    Abstract: Retrieval Augmented Generation (RAG) has become a popular application for large language models. It is preferable that successful RAG systems provide accurate answers that are supported by being grounded in a passage without any hallucinations. While considerable work is required for building a full RAG pipeline, being able to benchmark performance is also necessary. We present ClapNQ, a benchmark… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 25 pages

  3. arXiv:2401.13588  [pdf

    cs.CL cs.AI cs.SE

    Evaluation of General Large Language Models in Contextually Assessing Semantic Concepts Extracted from Adult Critical Care Electronic Health Record Notes

    Authors: Darren Liu, Cheng Ding, Delgersuren Bold, Monique Bouvier, Jiaying Lu, Benjamin Shickel, Craig S. Jabaley, Wenhui Zhang, Soojin Park, Michael J. Young, Mark S. Wainwright, Gilles Clermont, Parisa Rashidi, Eric S. Rosenthal, Laurie Dimisko, Ran Xiao, Joo Heung Yoon, Carl Yang, Xiao Hu

    Abstract: The field of healthcare has increasingly turned its focus towards Large Language Models (LLMs) due to their remarkable performance. However, their performance in actual clinical applications has been underexplored. Traditional evaluations based on question-answering tasks don't fully capture the nuanced contexts. This gap highlights the need for more in-depth and practical assessments of LLMs in r… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  4. arXiv:2312.11344  [pdf, other

    cs.CL cs.AI cs.HC

    Muted: Multilingual Targeted Offensive Speech Identification and Visualization

    Authors: Christoph Tillmann, Aashka Trivedi, Sara Rosenthal, Santosh Borse, Rong Zhang, Avirup Sil, Bishwaranjan Bhattacharjee

    Abstract: Offensive language such as hate, abuse, and profanity (HAP) occurs in various content on the web. While previous work has mostly dealt with sentence level annotations, there have been a few recent attempts to identify offensive spans as well. We build upon this work and introduce Muted, a system to identify multilingual HAP content by displaying offensive arguments and their targets using heat map… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Journal ref: EMNLP 2023 Demo Track

  5. Can Large Language Models Capture Public Opinion about Global Warming? An Empirical Assessment of Algorithmic Fidelity and Bias

    Authors: S. Lee, T. Q. Peng, M. H. Goldberg, S. A. Rosenthal, J. E. Kotcher, E. W. Maibach, A. Leiserowitz

    Abstract: Large language models (LLMs) have demonstrated their potential in social science research by emulating human perceptions and behaviors, a concept referred to as algorithmic fidelity. This study assesses the algorithmic fidelity and bias of LLMs by utilizing two nationally representative climate change surveys. The LLMs were conditioned on demographics and/or psychological covariates to simulate su… ▽ More

    Submitted 7 February, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: 34 pages, 6 figures, 1 table

    Journal ref: PLOS Climate, 3(2024), e0000429

  6. arXiv:2301.09715  [pdf, other

    cs.CL cs.IR cs.LG

    PrimeQA: The Prime Repository for State-of-the-Art Multilingual Question Answering Research and Development

    Authors: Avirup Sil, Jaydeep Sen, Bhavani Iyer, Martin Franz, Kshitij Fadnis, Mihaela Bornea, Sara Rosenthal, Scott McCarley, Rong Zhang, Vishwajeet Kumar, Yulong Li, Md Arafat Sultan, Riyaz Bhat, Radu Florian, Salim Roukos

    Abstract: The field of Question Answering (QA) has made remarkable progress in recent years, thanks to the advent of large pre-trained language models, newer realistic benchmark datasets with leaderboards, and novel algorithms for key components such as retrievers and readers. In this paper, we introduce PRIMEQA: a one-stop and open-source QA repository with an aim to democratize QA re-search and facilitate… ▽ More

    Submitted 25 January, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

  7. arXiv:2206.08441  [pdf, other

    cs.CL

    GAAMA 2.0: An Integrated System that Answers Boolean and Extractive Questions

    Authors: Scott McCarley, Mihaela Bornea, Sara Rosenthal, Anthony Ferritto, Md Arafat Sultan, Avirup Sil, Radu Florian

    Abstract: Recent machine reading comprehension datasets include extractive and boolean questions but current approaches do not offer integrated support for answering both question types. We present a multilingual machine reading comprehension system and front-end demo that handles boolean questions by providing both a YES/NO answer and highlighting supporting evidence, and handles extractive questions by hi… ▽ More

    Submitted 21 June, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

  8. arXiv:2206.06705  [pdf, other

    cs.CL cs.LG

    Task Transfer and Domain Adaptation for Zero-Shot Question Answering

    Authors: Xiang Pan, Alex Sheng, David Shimshoni, Aditya Singhal, Sara Rosenthal, Avirup Sil

    Abstract: Pretrained language models have shown success in various areas of natural language processing, including reading comprehension tasks. However, when applying machine learning methods to new domains, labeled data may not always be available. To address this, we use supervised pretraining on source-domain data to reduce sample complexity on domain-specific downstream tasks. We evaluate zero-shot perf… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: NAACL 2022 Deep Learning for Low-Resource NLP Workshop Paper

    MSC Class: 68T50 ACM Class: I.2.7

  9. arXiv:2112.07772  [pdf, other

    cs.CL

    Do Answers to Boolean Questions Need Explanations? Yes

    Authors: Sara Rosenthal, Mihaela Bornea, Avirup Sil, Radu Florian, Scott McCarley

    Abstract: Existing datasets that contain boolean questions, such as BoolQ and TYDI QA , provide the user with a YES/NO response to the question. However, a one word response is not sufficient for an explainable system. We promote explainability by releasing a new set of annotations marking the evidence in existing TyDi QA and BoolQ datasets. We show that our annotations can be used to train a model that ext… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: 9 pages

  10. arXiv:2109.10231  [pdf

    cs.HC cs.AI

    SalienTrack: providing salient information for semi-automated self-tracking feedback with model explanations

    Authors: Yunlong Wang, Jiaying Liu, Homin Park, Jordan Schultz-McArdle, Stephanie Rosenthal, Judy Kay, Brian Y. Lim

    Abstract: Self-tracking can improve people's awareness of their unhealthy behaviors and support reflection to inform behavior change. Increasingly, new technologies make tracking easier, leading to large amounts of tracked data. However, much of that information is not salient for reflection and self-awareness. To tackle this burden for reflection, we created the SalienTrack framework, which aims to 1) iden… ▽ More

    Submitted 16 February, 2022; v1 submitted 21 September, 2021; originally announced September 2021.

  11. arXiv:2105.13995  [pdf, other

    cs.CL

    SemEval-2021 Task 9: Fact Verification and Evidence Finding for Tabular Data in Scientific Documents (SEM-TAB-FACTS)

    Authors: Nancy X. R. Wang, Diwakar Mahajan, Marina Danilevsky, Sara Rosenthal

    Abstract: Understanding tables is an important and relevant task that involves understanding table structure as well as being able to compare and contrast information within cells. In this paper, we address this challenge by presenting a new dataset and tasks that addresses this goal in a shared task in SemEval 2020 Task 9: Fact Verification and Evidence Finding for Tabular Data in Scientific Documents (SEM… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

    Comments: To Appear in SemEval 2021

  12. arXiv:2104.07646  [pdf, other

    cs.CL

    Are Multilingual BERT models robust? A Case Study on Adversarial Attacks for Multilingual Question Answering

    Authors: Sara Rosenthal, Mihaela Bornea, Avirup Sil

    Abstract: Recent approaches have exploited weaknesses in monolingual question answering (QA) models by adding adversarial statements to the passage. These attacks caused a reduction in state-of-the-art performance by almost 50%. In this paper, we are the first to explore and successfully attack a multilingual QA (MLQA) system pre-trained on multilingual BERT using several attack strategies for the adversari… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  13. arXiv:2101.10813  [pdf, other

    cs.RO cs.AI

    Impact of Explanation on Trust of a Novel Mobile Robot

    Authors: Stephanie Rosenthal, Elizabeth J. Carter

    Abstract: One challenge with introducing robots into novel environments is misalignment between supervisor expectations and reality, which can greatly affect a user's trust and continued use of the robot. We performed an experiment to test whether the presence of an explanation of expected robot behavior affected a supervisor's trust in an autonomous robot. We measured trust both subjectively through survey… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: 9 pages, 3 figures

    Journal ref: Proceedings of the AAAI Fall Symposium Series - Artificial Intelligence for Human-Robot Interaction: Trust Explainability in Artificial Intelligence for Human-Robot Interaction AI-HRI (AI-HRI '20), November 13-14, 2020, Washington DC, USA

  14. arXiv:2012.05958  [pdf, ps, other

    cs.CL

    Multilingual Transfer Learning for QA Using Translation as Data Augmentation

    Authors: Mihaela Bornea, Lin Pan, Sara Rosenthal, Radu Florian, Avirup Sil

    Abstract: Prior work on multilingual question answering has mostly focused on using large multilingual pre-trained language models (LM) to perform zero-shot language-wise learning: train a QA model on English and test on other languages. In this work, we explore strategies that improve cross-lingual transfer by bringing the multilingual embeddings closer in the semantic space. Our first strategy augments th… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

    Journal ref: AAAI 2021

  15. Introducing a new high-resolution handwritten digits data set with writer characteristics

    Authors: Cédric Beaulac, Jeffrey S. Rosenthal

    Abstract: The contributions in this article are two-fold. First, we introduce a new hand-written digit data set that we collected. It contains high-resolution images of hand-written The contributions in this article are two-fold. First, we introduce a new handwritten digit data set that we collected. It contains high-resolution images of handwritten digits together with various writer characteristics which… ▽ More

    Submitted 13 April, 2022; v1 submitted 4 November, 2020; originally announced November 2020.

    Comments: Data set available here : https://drive.google.com/drive/folders/1f2o1kjXLvcxRgtmMMuDkA2PQ5Zato4Or?usp=sharing

    Journal ref: SN COMPUT. SCI. 4, 66 (2023)

  16. arXiv:2006.07235  [pdf, ps, other

    cs.CL

    SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020)

    Authors: Marcos Zampieri, Preslav Nakov, Sara Rosenthal, Pepa Atanasova, Georgi Karadzhov, Hamdy Mubarak, Leon Derczynski, Zeses Pitenis, Çağrı Çöltekin

    Abstract: We present the results and main findings of SemEval-2020 Task 12 on Multilingual Offensive Language Identification in Social Media (OffensEval 2020). The task involves three subtasks corresponding to the hierarchical taxonomy of the OLID schema (Zampieri et al., 2019a) from OffensEval 2019. The task featured five languages: English, Arabic, Danish, Greek, and Turkish for Subtask A. In addition, En… ▽ More

    Submitted 30 September, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: Proceedings of the International Workshop on Semantic Evaluation (SemEval-2020)

    MSC Class: 68T50; 68T07 ACM Class: I.2.7

  17. arXiv:2004.14454  [pdf, other

    cs.CL

    SOLID: A Large-Scale Semi-Supervised Dataset for Offensive Language Identification

    Authors: Sara Rosenthal, Pepa Atanasova, Georgi Karadzhov, Marcos Zampieri, Preslav Nakov

    Abstract: The widespread use of offensive content in social media has led to an abundance of research in detecting language such as hate speech, cyberbullying, and cyber-aggression. Recent work presented the OLID dataset, which follows a taxonomy for offensive language identification that provides meaningful information for understanding the type and the target of offensive messages. However, it is limited… ▽ More

    Submitted 24 September, 2021; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: offensive language, hate speech, cyberbullying, cyber-aggression, taxonomy for offensive language identification

    MSC Class: 68T50; 68T07 ACM Class: F.2.2; I.2.7

    Journal ref: ACL-2021 (Findings)

  18. arXiv:1912.06806  [pdf, other

    cs.CL cs.IR cs.LG

    SemEval-2013 Task 2: Sentiment Analysis in Twitter

    Authors: Preslav Nakov, Zornitsa Kozareva, Alan Ritter, Sara Rosenthal, Veselin Stoyanov, Theresa Wilson

    Abstract: In recent years, sentiment analysis in social media has attracted a lot of research interest and has been used for a number of applications. Unfortunately, research has been hindered by the lack of suitable datasets, complicating the comparison between approaches. To address this issue, we have proposed SemEval-2013 Task 2: Sentiment Analysis in Twitter, which included two subtasks: A, an expressi… ▽ More

    Submitted 14 December, 2019; originally announced December 2019.

    Comments: Sentiment analysis, microblog sentiment analysis, Twitter opinion mining, SMS

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: SemEval-2013

  19. arXiv:1912.02990  [pdf, ps, other

    cs.CL cs.IR cs.LG cs.SI

    SemEval-2014 Task 9: Sentiment Analysis in Twitter

    Authors: Sara Rosenthal, Preslav Nakov, Alan Ritter, Veselin Stoyanov

    Abstract: We describe the Sentiment Analysis in Twitter task, ran as part of SemEval-2014. It is a continuation of the last year's task that ran successfully as part of SemEval-2013. As in 2013, this was the most popular SemEval task; a total of 46 teams contributed 27 submissions for subtask A (21 teams) and 50 submissions for subtask B (44 teams). This year, we introduced three new test sets: (i) regular… ▽ More

    Submitted 6 December, 2019; originally announced December 2019.

    Comments: Sentiment analysis, microblog sentiment analysis, Twitter opinion mining, sarcasm, LiveJournal, SMS

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: SemEval-2014

  20. arXiv:1912.02387  [pdf, other

    cs.CL cs.IR cs.LG

    SemEval-2015 Task 10: Sentiment Analysis in Twitter

    Authors: Sara Rosenthal, Saif M Mohammad, Preslav Nakov, Alan Ritter, Svetlana Kiritchenko, Veselin Stoyanov

    Abstract: In this paper, we describe the 2015 iteration of the SemEval shared task on Sentiment Analysis in Twitter. This was the most popular sentiment analysis shared task to date with more than 40 teams participating in each of the last three years. This year's shared task competition consisted of five sentiment prediction subtasks. Two were reruns from previous years: (A) sentiment expressed by a phrase… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

    Comments: Sentiment analysis, sentiment towards a topic, quantification, microblog sentiment analysis; Twitter opinion mining

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: SemEval-2015

  21. arXiv:1912.01973  [pdf, other

    cs.CL cs.IR

    SemEval-2016 Task 4: Sentiment Analysis in Twitter

    Authors: Preslav Nakov, Alan Ritter, Sara Rosenthal, Fabrizio Sebastiani, Veselin Stoyanov

    Abstract: This paper discusses the fourth year of the ``Sentiment Analysis in Twitter Task''. SemEval-2016 Task 4 comprises five subtasks, three of which represent a significant departure from previous editions. The first two subtasks are reruns from prior years and ask to predict the overall sentiment, and the sentiment towards a topic in a tweet. The three new subtasks focus on two variants of the basic `… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

    Comments: Sentiment analysis, sentiment towards a topic, quantification, microblog sentiment analysis; Twitter opinion mining. arXiv admin note: text overlap with arXiv:1912.00741

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: Final version published in the Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval 2016), San Diego, US, 2016, pp. 1-18

  22. arXiv:1912.00741  [pdf, ps, other

    cs.CL cs.IR cs.LG

    SemEval-2017 Task 4: Sentiment Analysis in Twitter

    Authors: Sara Rosenthal, Noura Farra, Preslav Nakov

    Abstract: This paper describes the fifth year of the Sentiment Analysis in Twitter task. SemEval-2017 Task 4 continues with a rerun of the subtasks of SemEval-2016 Task 4, which include identifying the overall sentiment of the tweet, sentiment towards a topic with classification on a two-point and on a five-point ordinal scale, and quantification of the distribution of sentiment towards a topic across a num… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Comments: sentiment analysis, Twitter, classification, quantification, ranking, English, Arabic

    Report number: SemEval-2017 MSC Class: 68T50 ACM Class: I.2.7

  23. arXiv:1908.02233  [pdf, ps, other

    math.DS cs.LG

    Koopman Representations of Dynamic Systems with Control

    Authors: Craig Bakker, Steven Rosenthal, Kathleen E. Nowak

    Abstract: The design and analysis of optimal control policies for dynamical systems can be complicated by nonlinear dependence in the state variables. Koopman operators have been used to simplify the analysis of dynamical systems by mapping the flow of the system onto a space of observables where the dynamics are linear (and possibly infinte). This paper focuses on the development of consistent Koopman repr… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

  24. arXiv:1903.08983  [pdf, other

    cs.CL

    SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval)

    Authors: Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Sara Rosenthal, Noura Farra, Ritesh Kumar

    Abstract: We present the results and the main findings of SemEval-2019 Task 6 on Identifying and Categorizing Offensive Language in Social Media (OffensEval). The task was based on a new dataset, the Offensive Language Identification Dataset (OLID), which contains over 14,000 English tweets. It featured three sub-tasks. In sub-task A, the goal was to discriminate between offensive and non-offensive posts. I… ▽ More

    Submitted 26 April, 2019; v1 submitted 19 March, 2019; originally announced March 2019.

    Comments: Proceedings of the International Workshop on Semantic Evaluation (SemEval)

  25. arXiv:1902.09666  [pdf, ps, other

    cs.CL

    Predicting the Type and Target of Offensive Posts in Social Media

    Authors: Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Sara Rosenthal, Noura Farra, Ritesh Kumar

    Abstract: As offensive content has become pervasive in social media, there has been much research in identifying potentially offensive messages. However, previous work on this topic did not consider the problem as a whole, but rather focused on detecting very specific types of offensive content, e.g., hate speech, cyberbulling, or cyber-aggression. In contrast, here we target several different kinds of offe… ▽ More

    Submitted 16 April, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: Proceedings of the 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL)

  26. arXiv:1811.12323  [pdf, other

    stat.ML cs.LG

    A Deep Latent-Variable Model Application to Select Treatment Intensity in Survival Analysis

    Authors: Cédric Beaulac, Jeffrey S. Rosenthal, David Hodgson

    Abstract: In the following short article we adapt a new and popular machine learning model for inference on medical data sets. Our method is based on the Variational AutoEncoder (VAE) framework that we adapt to survival analysis on small data sets with missing values. In our model, the true health status appears as a set of latent variables that affects the observed covariates and the survival chances. We s… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/53

  27. arXiv:1810.08055  [pdf

    cs.OH cs.CY

    Ten Simple Rules for Reproducible Research in Jupyter Notebooks

    Authors: Adam Rule, Amanda Birmingham, Cristal Zuniga, Ilkay Altintas, Shih-Cheng Huang, Rob Knight, Niema Moshiri, Mai H. Nguyen, Sara Brin Rosenthal, Fernando Pérez, Peter W. Rose

    Abstract: Reproducibility of computational studies is a hallmark of scientific methodology. It enables researchers to build with confidence on the methods and findings of others, reuse and extend computational pipelines, and thereby drive scientific progress. Since many experimental studies rely on computational analyses, biologists need guidance on how to set up and document reproducible data analyses or s… ▽ More

    Submitted 13 October, 2018; originally announced October 2018.

  28. BEST : A decision tree algorithm that handles missing values

    Authors: Cédric Beaulac, Jeffrey S. Rosenthal

    Abstract: The main contribution of this paper is the development of a new decision tree algorithm. The proposed approach allows users to guide the algorithm through the data partitioning process. We believe this feature has many applications but in this paper we demonstrate how to utilize this algorithm to analyse data sets containing missing values. We tested our algorithm against simulated data sets with… ▽ More

    Submitted 14 April, 2020; v1 submitted 26 April, 2018; originally announced April 2018.

    Comments: To appear in Computational Statistics

    Journal ref: Computational Statistics 2020

  29. arXiv:1802.03675  [pdf, other

    cs.LG cs.CY stat.ML

    Understanding Convolutional Networks with APPLE : Automatic Patch Pattern Labeling for Explanation

    Authors: Sandeep Konam, Ian Quah, Stephanie Rosenthal, Manuela Veloso

    Abstract: With the success of deep learning, recent efforts have been focused on analyzing how learned networks make their classifications. We are interested in analyzing the network output based on the network structure and information flow through the network layers. We contribute an algorithm for 1) analyzing a deep network to find neurons that are 'important' in terms of the network classification outco… ▽ More

    Submitted 10 February, 2018; originally announced February 2018.

    Comments: AAAI/ACM Conference on AI, Ethics, and Society

  30. Predicting University Students' Academic Success and Major using Random Forests

    Authors: Cédric Beaulac, Jeffrey S. Rosenthal

    Abstract: In this article, a large data set containing every course taken by every undergraduate student in a major university in Canada over 10 years is analysed. Modern machine learning algorithms can use large data sets to build useful tools for the data provider, in this case, the university. In this article, two classifiers are constructed using random forests. To begin, the first two semesters of cour… ▽ More

    Submitted 12 January, 2019; v1 submitted 9 February, 2018; originally announced February 2018.

    Journal ref: Research in Higher Education 2019

  31. arXiv:1709.08831  [pdf, other

    cs.RO cs.CV

    UAV and Service Robot Coordination for Indoor Object Search Tasks

    Authors: Sandeep Konam, Stephanie Rosenthal, Manuela Veloso

    Abstract: Our CoBot robots have successfully performed a variety of service tasks in our multi-building environment including accompanying people to meetings and delivering objects to offices due to its navigation and localization capabilities. However, they lack the capability to visually search over desks and other confined locations for an object of interest. Conversely, an inexpensive GPS-denied quadcop… ▽ More

    Submitted 26 September, 2017; originally announced September 2017.

    Comments: IJCAI-2016 Workshop on Autonomous Mobile Service Robots

  32. arXiv:1707.09641  [pdf, other

    cs.LG stat.ML

    Towards Visual Explanations for Convolutional Neural Networks via Input Resampling

    Authors: Benjamin J. Lengerich, Sandeep Konam, Eric P. Xing, Stephanie Rosenthal, Manuela Veloso

    Abstract: The predictive power of neural networks often costs model interpretability. Several techniques have been developed for explaining model outputs in terms of input features; however, it is difficult to translate such interpretations into actionable insight. Here, we propose a framework to analyze predictions in terms of the model's internal features by inspecting information flow through the network… ▽ More

    Submitted 16 August, 2017; v1 submitted 30 July, 2017; originally announced July 2017.

    Comments: Presented at ICML 2017 Workshop on Visualization for Deep Learning

  33. arXiv:1704.08759  [pdf, other

    cs.RO cs.CV

    Obstacle Avoidance through Deep Networks based Intermediate Perception

    Authors: Shichao Yang, Sandeep Konam, Chen Ma, Stephanie Rosenthal, Manuela Veloso, Sebastian Scherer

    Abstract: Obstacle avoidance from monocular images is a challenging problem for robots. Though multi-view structure-from-motion could build 3D maps, it is not robust in textureless environments. Some learning based methods exploit human demonstration to predict a steering command directly from a single image. However, this method is usually biased towards certain tasks or demonstration scenarios and also bi… ▽ More

    Submitted 27 April, 2017; originally announced April 2017.

  34. arXiv:1605.06154  [pdf, other

    cs.DL

    Web Infrastructure to Support e-Journal Preservation (and More)

    Authors: Herbert Van de Sompel, David S. H. Rosenthal, Michael L. Nelson

    Abstract: E-journal preservation systems have to ingest millions of articles each year. Ingest, especially of the "long tail" of journals from small publishers, is the largest element of their cost. Cost is the major reason that archives contain less than half the content they should. Automation is essential to minimize these costs. This paper examines the potential for automation beyond the status quo base… ▽ More

    Submitted 19 May, 2016; originally announced May 2016.

    Comments: 23 pages, 5 figures

    ACM Class: H.3.7

  35. arXiv:cs/0509018  [pdf, ps, other

    cs.DL

    Requirements for Digital Preservation Systems: A Bottom-Up Approach

    Authors: David S. H. Rosenthal, Thomas S. Robertson, Tom Lipkis, Vicky Reich, Seth Morabito

    Abstract: The field of digital preservation is being defined by a set of standards developed top-down, starting with an abstract reference model (OAIS) and gradually adding more specific detail. Systems claiming conformance to these standards are entering production use. Work is underway to certify that systems conform to requirements derived from OAIS. We complement these requirements derived top-down… ▽ More

    Submitted 6 September, 2005; v1 submitted 6 September, 2005; originally announced September 2005.

    ACM Class: H.3.7

  36. arXiv:cs/0508130  [pdf, ps, other

    cs.DL cs.DB cs.OS

    A Fresh Look at the Reliability of Long-term Digital Storage

    Authors: Mary Baker, Mehul Shah, David S. H. Rosenthal, Mema Roussopoulos, Petros Maniatis, TJ Giuli, Prashanth Bungale

    Abstract: Many emerging Web services, such as email, photo sharing, and web site archives, need to preserve large amounts of quickly-accessible data indefinitely into the future. In this paper, we make the case that these applications' demands on large scale storage systems over long time horizons require us to re-evaluate traditional storage system designs. We examine threats to long-lived data from an e… ▽ More

    Submitted 30 August, 2005; originally announced August 2005.

  37. arXiv:cs/0411078  [pdf, ps, other

    cs.DL

    Notes On The Design Of An Internet Adversary

    Authors: David S. H. Rosenthal, Petros Maniatis, Mema Roussopoulos, T. J. Giuli, Mary Baker

    Abstract: The design of the defenses Internet systems can deploy against attack, especially adaptive and resilient defenses, must start from a realistic model of the threat. This requires an assessment of the capabilities of the adversary. The design typically evolves through a process of simulating both the system and the adversary. This requires the design and implementation of a simulated adversary bas… ▽ More

    Submitted 21 November, 2004; originally announced November 2004.

    ACM Class: H.3.7

    Journal ref: Second Annual Adaptive and Resilient Computing Security Workshop, Santa Fe, 2003

  38. arXiv:cs/0411077  [pdf, ps, other

    cs.DL

    Transparent Format Migration of Preserved Web Content

    Authors: David S. H. Rosenthal, Thomas Lipkis, Thomas Robertson, Seth Morabito

    Abstract: The LOCKSS digital preservation system collects content by crawling the web and preserves it in the format supplied by the publisher. Eventually, browsers will no longer understand that format. A process called format migration converts it to a newer format that the browsers do understand. The LOCKSS program has designed and tested an initial implementation of format migration for Web content th… ▽ More

    Submitted 21 November, 2004; originally announced November 2004.

    Comments: 6 pages, 2 figures

    ACM Class: H.5.4; H.3.7

  39. arXiv:cs/0405111  [pdf, ps, other

    cs.CR

    Attrition Defenses for a Peer-to-Peer Digital Preservation System

    Authors: T. J. Giuli, Petros Maniatis, Mary Baker, David S. H. Rosenthal, Mema Roussopoulos

    Abstract: In peer-to-peer systems, attrition attacks include both traditional, network-level denial of service attacks as well as application-level attacks in which malign peers conspire to waste loyal peers' resources. We describe several defenses for LOCKSS, a peer-to-peer digital preservation system, that help ensure that application-level attacks even from powerful adversaries are less effective than… ▽ More

    Submitted 27 November, 2004; v1 submitted 28 May, 2004; originally announced May 2004.

    Comments: 14 pages, 8 figures. version 2: Reworked the paper according to reviews. Expanded the evaluation section with experiments with more AUs

    ACM Class: C.2.4; D.4.6

  40. arXiv:cs/0311017  [pdf, ps, other

    cs.NI cs.AR

    2 P2P or Not 2 P2P?

    Authors: Mema Roussopoulos, Mary Baker, David S. H. Rosenthal, TJ Giuli, Petros Maniatis, Jeff Mogul

    Abstract: In the hope of stimulating discussion, we present a heuristic decision tree that designers can use to judge the likely suitability of a P2P architecture for their applications. It is based on the characteristics of a wide range of P2P systems from the literature, both proposed and deployed.

    Submitted 14 November, 2003; originally announced November 2003.

    Comments: 6 pages, 1 figure

    ACM Class: C.2.4

  41. arXiv:cs/0311005  [pdf, ps, other

    cs.CR cs.DL

    On The Cost Distribution of a Memory Bound Function

    Authors: David S. H. Rosenthal

    Abstract: Memory Bound Functions have been proposed for fighting spam, resisting Sybil attacks and other purposes. A particular implementation of such functions has been proposed in which the average effort required to generate a proof of effort is set by parameters E and l to E * l. The distribution of effort required to generate an individual proof about this average is fairly broad. When particular use… ▽ More

    Submitted 6 November, 2003; originally announced November 2003.

    Comments: 8 pages

    Report number: LOCKSS TR2003-02 ACM Class: D.4.6

  42. arXiv:cs/0303033  [pdf, ps, other

    cs.DC cs.DL

    A Digital Preservation Appliance Based on OpenBSD

    Authors: David S. H. Rosenthal

    Abstract: The LOCKSS program has developed and deployed in a world-wide test a system for preserving access to academic journals published on the Web. The fundamental problem for any digital preservation system is that it must be affordable for the long term. To reduce the cost of ownership, the LOCKSS system uses generic PC hardware, open source software, and peer-to-peer technology. It is packaged as a… ▽ More

    Submitted 21 November, 2004; v1 submitted 30 March, 2003; originally announced March 2003.

    Comments: 12 pages

    ACM Class: D.4.5

    Journal ref: Proceedings of BSDcon, 2003

  43. arXiv:cs/0303026  [pdf, ps, other

    cs.DC cs.DL

    Preserving Peer Replicas By Rate-Limited Sampled Voting in LOCKSS

    Authors: Petros Maniatis, Mema Roussopoulos, TJ Giuli, David S. H. Rosenthal, Mary Baker, Yanto Muliadi

    Abstract: The LOCKSS project has developed and deployed in a world-wide test a peer-to-peer system for preserving access to journals and other archival information published on the Web. It consists of a large number of independent, low-cost, persistent web caches that cooperate to detect and repair damage to their content by voting in "opinion polls." Based on this experience, we present a design for and… ▽ More

    Submitted 17 October, 2003; v1 submitted 25 March, 2003; originally announced March 2003.

    Comments: 25 Pages, 10 figures. Extended version of conference paper

    ACM Class: C.2.4; H.3.7; D.4.5