Zum Hauptinhalt springen

Showing 1–50 of 56 results for author: Ewerth, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.04515  [pdf

    cs.CV

    Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions

    Authors: Evelyn Navarrete, Ralph Ewerth, Anett Hoppe

    Abstract: Identifying the regions of a learning resource that a learner pays attention to is crucial for assessing the material's impact and improving its design and related support systems. Saliency detection in videos addresses the automatic recognition of attention-drawing regions in single frames. In educational settings, the recognition of pertinent regions in a video's visual stream can enhance conten… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  2. arXiv:2407.14321  [pdf, other

    cs.CL cs.IR cs.MM

    Multimodal Misinformation Detection using Large Vision-Language Models

    Authors: Sahar Tahmasebi, Eric Müller-Budack, Ralph Ewerth

    Abstract: The increasing proliferation of misinformation and its alarming impact have motivated both industry and academia to develop approaches for misinformation detection and fact checking. Recent advances on large language models (LLMs) have shown remarkable performance in various tasks, but whether and how LLMs could help with misinformation detection remains relatively underexplored. Most of existing… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: Accepted for publication in: Conference on Information and Knowledge Management (CIKM) 2024

  3. arXiv:2401.05148  [pdf, ps, other

    cs.IR

    On the Influence of Reading Sequences on Knowledge Gain during Web Search

    Authors: Wolfgang Gritz, Anett Hoppe, Ralph Ewerth

    Abstract: Nowadays, learning increasingly involves the usage of search engines and web resources. The related interdisciplinary research field search as learning aims to understand how people learn on the web. Previous work has investigated several feature classes to predict, for instance, the expected knowledge gain during web search. Therein, eye-tracking features have not been extensively studied so far.… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: This preprint has not undergone peer review (when applicable) or any post-submission improvements or corrections. Accepted at ECIR 2024

  4. arXiv:2307.10471  [pdf, other

    cs.CV cs.AI cs.DL cs.IR cs.LG

    Classification of Visualization Types and Perspectives in Patents

    Authors: Junaid Ahmed Ghauri, Eric Müller-Budack, Ralph Ewerth

    Abstract: Due to the swift growth of patent applications each year, information and multimedia retrieval approaches that facilitate patent exploration and retrieval are of utmost importance. Different types of visualizations (e.g., graphs, technical drawings) and perspectives (e.g., side view, perspective) are used to visualize details of innovations in patents. The classification of these images enables a… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: Accepted in International Conference on Theory and Practice of Digital Libraries (TPDL) 2023 (They have the copyright to publish camera-ready version of this work)

  5. arXiv:2305.18599  [pdf, other

    cs.CL cs.IR cs.LG cs.MM

    Improving Generalization for Multimodal Fake News Detection

    Authors: Sahar Tahmasebi, Sherzod Hakimov, Ralph Ewerth, Eric Müller-Budack

    Abstract: The increasing proliferation of misinformation and its alarming impact have motivated both industry and academia to develop approaches for fake news detection. However, state-of-the-art approaches are usually trained on datasets of smaller size or with a limited set of specific topics. As a consequence, these models lack generalization capabilities and are not applicable to real-world data. In thi… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: This paper has been accepted for ICMR 2023

  6. arXiv:2301.13617  [pdf, other

    cs.MM

    A Closer Look into Recent Video-based Learning Research: A Comprehensive Review of Video Characteristics, Tools, Technologies, and Learning Effectiveness

    Authors: Evelyn Navarrete, Andreas Nehring, Sascha Schanze, Ralph Ewerth, Anett Hoppe

    Abstract: People increasingly use videos on the Web as a source for learning. To support this way of learning, researchers and developers are continuously developing tools, proposing guidelines, analyzing data, and conducting experiments. However, it is still not clear what characteristics a video should have to be an effective learning medium. In this paper, we present a comprehensive review of 257 article… ▽ More

    Submitted 11 August, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

  7. Predicting Knowledge Gain for MOOC Video Consumption

    Authors: Christian Otto, Markos Stamatakis, Anett Hoppe, Ralph Ewerth

    Abstract: Informal learning on the Web using search engines as well as more structured learning on MOOC platforms have become very popular in recent years. As a result of the vast amount of available learning resources, intelligent retrieval and recommendation methods are indispensable -- this is true also for MOOC videos. However, the automatic assessment of this content with regard to predicting (potentia… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: 13 pages, 1 figure, 3 tables

    Journal ref: AIED 2022. Lecture Notes in Computer Science, vol 13356, pp. 458-462

  8. arXiv:2211.08042  [pdf, other

    cs.IR

    MM-Locate-News: Multimodal Focus Location Estimation in News

    Authors: Golsa Tahmasebzadeh, Eric Müller-Budack, Sherzod Hakimov, Ralph Ewerth

    Abstract: The consumption of news has changed significantly as the Web has become the most influential medium for information. To analyze and contextualize the large amount of news published every day, the geographic focus of an article is an important aspect in order to enable content-based news retrieval. There are methods and datasets for geolocation estimation from text or photos, but they are typically… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  9. SoccerNet 2022 Challenges Results

    Authors: Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao , et al. (69 additional authors not shown)

    Abstract: The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team. In 2022, the challenges were composed of 6 vision-based tasks: (1) action spotting, focusing on retrieving action timestamps in long untrimmed videos, (2) replay grounding, focusing on retrieving the live moment of an action shown in a replay, (3) pitch localization, focusing on det… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted at ACM MMSports 2022

  10. arXiv:2207.11709  [pdf, other

    cs.CV

    TVCalib: Camera Calibration for Sports Field Registration in Soccer

    Authors: Jonas Theiner, Ralph Ewerth

    Abstract: Sports field registration in broadcast videos is typically interpreted as the task of homography estimation, which provides a mapping between a planar field and the corresponding visible area of the image. In contrast to previous approaches, we consider the task as a camera calibration problem. First, we introduce a differentiable objective function that is able to learn the camera pose and focal… ▽ More

    Submitted 1 October, 2022; v1 submitted 24 July, 2022; originally announced July 2022.

    Comments: Accepted for publication at WACV'23

  11. arXiv:2207.02976  [pdf, other

    cs.CV cs.IR

    Semi-supervised Human Pose Estimation in Art-historical Images

    Authors: Matthias Springstein, Stefanie Schneider, Christian Althaus, Ralph Ewerth

    Abstract: Gesture as language of non-verbal communication has been theoretically established since the 17th century. However, its relevance for the visual arts has been expressed only sporadically. This may be primarily due to the sheer overwhelming amount of data that traditionally had to be processed by hand. With the steady progress of digitization, though, a growing number of historical artifacts have b… ▽ More

    Submitted 15 August, 2022; v1 submitted 6 July, 2022; originally announced July 2022.

    Comments: Accepted at ACM MM 2022 as a conference paper

  12. arXiv:2205.01989  [pdf, other

    cs.CL cs.AI cs.CV cs.MM cs.SI

    MM-Claims: A Dataset for Multimodal Claim Detection in Social Media

    Authors: Gullal S. Cheema, Sherzod Hakimov, Abdul Sittar, Eric Müller-Budack, Christian Otto, Ralph Ewerth

    Abstract: In recent years, the problem of misinformation on the web has become widespread across languages, countries, and various social media platforms. Although there has been much work on automated fake news detection, the role of images and their variety are not well explored. In this paper, we investigate the roles of image and text at an earlier stage of the fake news detection pipeline, called claim… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted to Findings of NAACL 2022

  13. arXiv:2204.06299  [pdf, other

    cs.CL cs.AI cs.CV

    TIB-VA at SemEval-2022 Task 5: A Multimodal Architecture for the Detection and Classification of Misogynous Memes

    Authors: Sherzod Hakimov, Gullal S. Cheema, Ralph Ewerth

    Abstract: The detection of offensive, hateful content on social media is a challenging problem that affects many online users on a daily basis. Hateful content is often used to target a group of people based on ethnicity, gender, religion and other factors. The hate or contempt toward women has been increasing on social platforms. Misogynous content detection is especially challenging when textual and visua… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: Accepted for publication at SemEval-2022 Workshop, Task 5: MAMI - Multimedia Automatic Misogyny Identification co-located with NAACL 2022

  14. SaL-Lightning Dataset: Search and Eye Gaze Behavior, Resource Interactions and Knowledge Gain during Web Search

    Authors: Christian Otto, Markus Rokicki, Georg Pardi, Wolfgang Gritz, Daniel Hienert, Ran Yu, Johannes von Hoyer, Anett Hoppe, Stefan Dietze, Peter Holtz, Yvonne Kammerer, Ralph Ewerth

    Abstract: The emerging research field Search as Learning investigates how the Web facilitates learning through modern information retrieval systems. SAL research requires significant amounts of data that capture both search behavior of users and their acquired knowledge in order to obtain conclusive insights or train supervised machine learning models. However, the creation of such datasets is costly and re… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

    Comments: To be published at the 2022 ACM SIGIR Conference on Human Information Interaction and Retrieval (CHIIR '22)

  15. arXiv:2112.04803  [pdf, other

    cs.CL cs.LG

    Combining Textual Features for the Detection of Hateful and Offensive Language

    Authors: Sherzod Hakimov, Ralph Ewerth

    Abstract: The detection of offensive, hateful and profane language has become a critical challenge since many users in social networks are exposed to cyberbullying activities on a daily basis. In this paper, we present an analysis of combining different textual features for the detection of hateful or offensive posts on Twitter. We provide a detailed experimental evaluation to understand the impact of each… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: HASOC 2021, Forum for Information Retrieval Evaluation, 2021

  16. arXiv:2110.11107  [pdf, other

    cs.CV

    Extraction of Positional Player Data from Broadcast Soccer Videos

    Authors: Jonas Theiner, Wolfgang Gritz, Eric Müller-Budack, Robert Rein, Daniel Memmert, Ralph Ewerth

    Abstract: Computer-aided support and analysis are becoming increasingly important in the modern world of sports. The scouting of potential prospective players, performance as well as match analysis, and the monitoring of training programs rely more and more on data-driven technologies to ensure success. Therefore, many approaches require large amounts of data, which are, however, not easy to obtain in gener… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: Accepted for publication at WACV'22; Preprint

  17. arXiv:2108.11149  [pdf, other

    cs.CV

    A Unified Taxonomy and Multimodal Dataset for Events in Invasion Games

    Authors: Henrik Biermann, Jonas Theiner, Manuel Bassek, Dominik Raabe, Daniel Memmert, Ralph Ewerth

    Abstract: The automatic detection of events in complex sports games like soccer and handball using positional or video data is of large interest in research and industry. One requirement is a fundamental understanding of underlying concepts, i.e., events that occur on the pitch. Previous work often deals only with so-called low-level events based on well-defined rules such as free kicks, free throws, or goa… ▽ More

    Submitted 26 August, 2021; v1 submitted 25 August, 2021; originally announced August 2021.

  18. iART: A Search Engine for Art-Historical Images to Support Research in the Humanities

    Authors: Matthias Springstein, Stefanie Schneider, Javad Rahnama, Eyke Hüllermeier, Hubertus Kohle, Ralph Ewerth

    Abstract: In this paper, we introduce iART: an open Web platform for art-historical research that facilitates the process of comparative vision. The system integrates various machine learning techniques for keyword- and content-based image retrieval as well as category formation via clustering. An intuitive GUI supports users to define queries and explore results. By using a state-of-the-art cross-modal dee… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Journal ref: ACM Multimedia Conference 2021

  19. Evaluation of Automated Image Descriptions for Visually Impaired Students

    Authors: Anett Hoppe, David Morris, Ralph Ewerth

    Abstract: Illustrations are widely used in education, and sometimes, alternatives are not available for visually impaired students. Therefore, those students would benefit greatly from an automatic illustration description system, but only if those descriptions were complete, correct, and easily understandable using a screenreader. In this paper, we report on a study for the assessment of automated image de… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: 6 pages, 12 references. Accepted for publication at the 22nd International Conference on Artificial Intelligence in Education (AIED 2021), June 14-16 2021, Utrecht, The Netherlands

    ACM Class: H.5.2; I.4; K.3.1

    Journal ref: Hoppe A., Morris D., Ewerth R. (2021) Evaluation of Automated Image Descriptions for Visually Impaired Students. In: Roll I., McNamara D., Sosnovsky S., Luckin R., Dimitrova V. (eds) AIED 2021. LNCS vol 12749. Springer, Cham

  20. arXiv:2106.09432  [pdf, other

    cs.CV cs.LG

    Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention

    Authors: Matthias Springstein, Eric Müller-Budack, Ralph Ewerth

    Abstract: The recognition of handwritten mathematical expressions in images and video frames is a difficult and unsolved problem yet. Deep convectional neural networks are basically a promising approach, but typically require a large amount of labeled training data. However, such a large training dataset does not exist for the task of handwritten formula recognition. In this paper, we introduce a system tha… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: Accepted for publication in: ACM International Conference on Multimedia Retrieval (ICMR) Workshop 2021

  21. arXiv:2106.08829  [pdf, other

    cs.SI cs.CL cs.CV

    A Fair and Comprehensive Comparison of Multimodal Tweet Sentiment Analysis Methods

    Authors: Gullal S. Cheema, Sherzod Hakimov, Eric Müller-Budack, Ralph Ewerth

    Abstract: Opinion and sentiment analysis is a vital task to characterize subjective information in social media posts. In this paper, we present a comprehensive experimental evaluation and comparison with six state-of-the-art methods, from which we have re-implemented one of them. In addition, we investigate different textual and visual feature embeddings that cover different aspects of the content, as well… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: Accepted in Workshop on Multi-ModalPre-Training for Multimedia Understanding (MMPT 2021), co-located with ICMR 2021

  22. arXiv:2106.06244  [pdf, other

    cs.IR

    Predicting Knowledge Gain during Web Search based on Multimedia Resource Consumption

    Authors: Christian Otto, Ran Yu, Georg Pardi, Johannes von Hoyer, Markus Rokicki, Anett Hoppe, Peter Holtz, Yvonne Kammerer, Stefan Dietze, Ralph Ewerth

    Abstract: In informal learning scenarios the popularity of multimedia content, such as video tutorials or lectures, has significantly increased. Yet, the users' interactions, navigation behavior, and consequently learning outcome, have not been researched extensively. Related work in this field, also called search as learning, has focused on behavioral or text resource features to predict learning outcome a… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: 13 pages, 2 figures, 2 tables

  23. arXiv:2106.05633  [pdf, other

    cs.DL cs.IR

    Citation Recommendation for Research Papers via Knowledge Graphs

    Authors: Arthur Brack, Anett Hoppe, Ralph Ewerth

    Abstract: Citation recommendation for research papers is a valuable task that can help researchers improve the quality of their work by suggesting relevant related work. Current approaches for this task rely primarily on the text of the papers and the citation network. In this paper, we propose to exploit an additional source of information, namely research knowledge graphs (KG) that interlink research pape… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: Accepted for publication in 25th International Conference on Theory and Practice of Digital Libraries (TPDL), 2021

  24. arXiv:2105.12532  [pdf, other

    cs.CV cs.AI

    Unsupervised Video Summarization via Multi-source Features

    Authors: Hussain Kanafani, Junaid Ahmed Ghauri, Sherzod Hakimov, Ralph Ewerth

    Abstract: Video summarization aims at generating a compact yet representative visual summary that conveys the essence of the original video. The advantage of unsupervised approaches is that they do not require human annotations to learn the summarization capability and generalize to a wider range of domains. Previous work relies on the same type of deep features, typically based on a model pre-trained on Im… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: Accepted for publication at the ACM International Conference on Multimedia Retrieval (ICMR) 2021

  25. arXiv:2104.14995  [pdf, other

    cs.CV

    Interpretable Semantic Photo Geolocation

    Authors: Jonas Theiner, Eric Mueller-Budack, Ralph Ewerth

    Abstract: Planet-scale photo geolocalization is the complex task of estimating the location depicted in an image solely based on its visual content. Due to the success of convolutional neural networks (CNNs), current approaches achieve super-human performance. However, previous work has exclusively focused on optimizing geolocalization accuracy. Due to the black-box property of deep learning systems, their… ▽ More

    Submitted 20 October, 2021; v1 submitted 30 April, 2021; originally announced April 2021.

    Comments: Accepted for publication at WACV'22

  26. arXiv:2104.14994  [pdf, other

    cs.IR cs.MM

    GeoWINE: Geolocation based Wiki, Image,News and Event Retrieval

    Authors: Golsa Tahmasebzadeh, Endri Kacupaj, Eric Müller-Budack, Sherzod Hakimov, Jens Lehmann, Ralph Ewerth

    Abstract: In the context of social media, geolocation inference on news or events has become a very important task. In this paper, we present the GeoWINE (Geolocation-based Wiki-Image-News-Event retrieval) demonstrator, an effective modular system for multimodal retrieval which expects only a single image as input. The GeoWINE system consists of five modules in order to retrieve related information from var… ▽ More

    Submitted 4 May, 2021; v1 submitted 30 April, 2021; originally announced April 2021.

    Comments: Accepted for publication in: International ACM SIGIR Conference on Research and Development in Information Retrieval 2021

  27. arXiv:2104.13748  [pdf, other

    cs.IR cs.MM

    QuTI! Quantifying Text-Image Consistency in Multimodal Documents

    Authors: Matthias Springstein, Eric Müller-Budack, Ralph Ewerth

    Abstract: The World Wide Web and social media platforms have become popular sources for news and information. Typically, multimodal information, e.g., image and text is used to convey information more effectively and to attract attention. While in most cases image content is decorative or depicts additional information, it has also been leveraged to spread misinformation and rumors in recent years. In this… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: Accepted for publication in: International ACM SIGIR Conference on Research and Development in Information Retrieval 2021

  28. arXiv:2104.11530  [pdf, other

    cs.CV cs.AI cs.IR cs.LG cs.MM

    Supervised Video Summarization via Multiple Feature Sets with Parallel Attention

    Authors: Junaid Ahmed Ghauri, Sherzod Hakimov, Ralph Ewerth

    Abstract: The assignment of importance scores to particular frames or (short) segments in a video is crucial for summarization, but also a difficult task. Previous work utilizes only one source of visual features. In this paper, we suggest a novel model architecture that combines three feature sets for visual content and motion to predict importance scores. The proposed architecture utilizes an attention me… ▽ More

    Submitted 13 May, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

    Comments: Accepted in IEEE International Conference on Multimedia and Expo (ICME) 2021 (They have copyright to publish camera ready version of this work)

  29. arXiv:2103.09602  [pdf, other

    cs.SI cs.CL cs.CV

    On the Role of Images for Analyzing Claims in Social Media

    Authors: Gullal S. Cheema, Sherzod Hakimov, Eric Müller-Budack, Ralph Ewerth

    Abstract: Fake news is a severe problem in social media. In this paper, we present an empirical study on visual, textual, and multimodal models for the tasks of claim, claim check-worthiness, and conspiracy detection, all of which are related to fake news detection. Recent work suggests that images are more influential than text and often appear alongside fake text. To this end, several multimodal models ha… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

    Comments: CLEOPATRA-2021 Workshop co-located with The Web Conf 2021

  30. arXiv:2102.06021  [pdf, other

    cs.DL cs.IR

    Analysing the Requirements for an Open Research Knowledge Graph: Use Cases, Quality Requirements and Construction Strategies

    Authors: Arthur Brack, Anett Hoppe, Markus Stocker, Sören Auer, Ralph Ewerth

    Abstract: Current science communication has a number of drawbacks and bottlenecks which have been subject of discussion lately: Among others, the rising number of published articles makes it nearly impossible to get a full overview of the state of the art in a certain field, or reproducibility is hampered by fixed-length, document-based publications which normally cannot cover all details of a research work… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    Comments: arXiv admin note: text overlap with arXiv:2005.10334

  31. arXiv:2102.06008  [pdf, other

    cs.CL cs.IR cs.LG

    Cross-Domain Multi-Task Learning for Sequential Sentence Classification in Research Papers

    Authors: Arthur Brack, Anett Hoppe, Pascal Buschermöhle, Ralph Ewerth

    Abstract: Sequential sentence classification deals with the categorisation of sentences based on their content and context. Applied to scientific texts, it enables the automatic structuring of research papers and the improvement of academic search engines. However, previous work has not investigated the potential of transfer learning for sentence classification across different scientific domains and the is… ▽ More

    Submitted 18 March, 2022; v1 submitted 11 February, 2021; originally announced February 2021.

    Comments: Accepted for publication in ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2022

  32. arXiv:2101.03529  [pdf, other

    cs.SI cs.CL

    TIB's Visual Analytics Group at MediaEval '20: Detecting Fake News on Corona Virus and 5G Conspiracy

    Authors: Gullal S. Cheema, Sherzod Hakimov, Ralph Ewerth

    Abstract: Fake news on social media has become a hot topic of research as it negatively impacts the discourse of real news in the public. Specifically, the ongoing COVID-19 pandemic has seen a rise of inaccurate and misleading information due to the surrounding controversies and unknown details at the beginning of the pandemic. The FakeNews task at MediaEval 2020 tackles this problem by creating a challenge… ▽ More

    Submitted 10 January, 2021; originally announced January 2021.

    Comments: MediaEval 2020 Fake News Task

  33. arXiv:2101.00884  [pdf, ps, other

    cs.IR

    Coreference Resolution in Research Papers from Multiple Domains

    Authors: Arthur Brack, Daniel Uwe Müller, Anett Hoppe, Ralph Ewerth

    Abstract: Coreference resolution is essential for automatic text understanding to facilitate high-level information retrieval tasks such as text summarisation or question answering. Previous work indicates that the performance of state-of-the-art approaches (e.g. based on BERT) noticeably declines when applied to scientific papers. In this paper, we investigate the task of coreference resolution in research… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

    Comments: Accepted for publication in 43rd European Conference on Information Retrieval (ECIR), 2021

  34. arXiv:2011.04714  [pdf, other

    cs.CV

    Ontology-driven Event Type Classification in Images

    Authors: Eric Müller-Budack, Matthias Springstein, Sherzod Hakimov, Kevin Mrutzek, Ralph Ewerth

    Abstract: Event classification can add valuable information for semantic search and the increasingly important topic of fact validation in news. So far, only few approaches address image classification for newsworthy event types such as natural disasters, sports events, or elections. Previous work distinguishes only between a limited number of event types and relies on rather small datasets for training. In… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: Accepted for publication in: IEEE Winter Conference on Applications of Computer Vision (WACV) 2021

  35. arXiv:2010.13626  [pdf, other

    cs.CV cs.LG

    Classification of Important Segments in Educational Videos using Multimodal Features

    Authors: Junaid Ahmed Ghauri, Sherzod Hakimov, Ralph Ewerth

    Abstract: Videos are a commonly-used type of content in learning during Web search. Many e-learning platforms provide quality content, but sometimes educational videos are long and cover many topics. Humans are good in extracting important sections from videos, but it remains a significant challenge for computers. In this paper, we address the problem of assigning importance scores to video segments, that i… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: Proceedings of the CIKM 2020 Workshops, October 19 to 20, Galway, Ireland

  36. arXiv:2010.13118  [pdf, other

    cs.CV cs.LG

    Monocular Depth Estimation via Listwise Ranking using the Plackett-Luce Model

    Authors: Julian Lienen, Eyke Hüllermeier, Ralph Ewerth, Nils Nommensen

    Abstract: In many real-world applications, the relative depth of objects in an image is crucial for scene understanding. Recent approaches mainly tackle the problem of depth prediction in monocular images by treating the problem as a regression task. Yet, being interested in an order relation in the first place, ranking methods suggest themselves as a natural alternative to regression, and indeed, ranking a… ▽ More

    Submitted 7 July, 2021; v1 submitted 25 October, 2020; originally announced October 2020.

    Comments: 15 pages, 5 figures, 7 tables, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 14595-14604

  37. MLM: A Benchmark Dataset for Multitask Learning with Multiple Languages and Modalities

    Authors: Jason Armitage, Endri Kacupaj, Golsa Tahmasebzadeh, Swati, Maria Maleshkova, Ralph Ewerth, Jens Lehmann

    Abstract: In this paper, we introduce the MLM (Multiple Languages and Modalities) dataset - a new resource to train and evaluate multitask systems on samples in multiple modalities and three languages. The generation process and inclusion of semantic data provide a resource that further tests the ability for multitask systems to learn relationships between entities. The dataset is designed for researchers a… ▽ More

    Submitted 4 September, 2020; v1 submitted 14 August, 2020; originally announced August 2020.

    Journal ref: Proceedings of the 29th ACM International Conference on Information & Knowledge Management, pp. 2967-2974. 2020

  38. arXiv:2007.10534  [pdf, other

    cs.CL cs.SI

    Check_square at CheckThat! 2020: Claim Detection in Social Media via Fusion of Transformer and Syntactic Features

    Authors: Gullal S. Cheema, Sherzod Hakimov, Ralph Ewerth

    Abstract: In this digital age of news consumption, a news reader has the ability to react, express and share opinions with others in a highly interactive and fast manner. As a consequence, fake news has made its way into our daily life because of very limited capacity to verify news on the Internet by large companies as well as individuals. In this paper, we focus on solving two problems which are part of t… ▽ More

    Submitted 20 September, 2020; v1 submitted 20 July, 2020; originally announced July 2020.

    Comments: CLEF2020-CheckThat!

  39. arXiv:2007.06390  [pdf, other

    cs.CL cs.IR cs.LG

    A Feature Analysis for Multimodal News Retrieval

    Authors: Golsa Tahmasebzadeh, Sherzod Hakimov, Eric Müller-Budack, Ralph Ewerth

    Abstract: Content-based information retrieval is based on the information contained in documents rather than using metadata such as keywords. Most information retrieval methods are either based on text or image. In this paper, we investigate the usefulness of multimodal features for cross-lingual news search in various domains: politics, health, environment, sport, and finance. To this end, we consider five… ▽ More

    Submitted 1 October, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: CLEOPATRA Workshop co-located with ESWC 2020

    Journal ref: CLEOPATRA Workshop co-located with ESWC 2020

  40. Investigating Correlations of Automatically Extracted Multimodal Features and Lecture Video Quality

    Authors: Jianwei Shi, Christian Otto, Anett Hoppe, Peter Holtz, Ralph Ewerth

    Abstract: Ranking and recommendation of multimedia content such as videos is usually realized with respect to the relevance to a user query. However, for lecture videos and MOOCs (Massive Open Online Courses) it is not only required to retrieve relevant videos, but particularly to find lecture videos of high quality that facilitate learning, for instance, independent of the video's or speaker's popularity.… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    ACM Class: H.5.1

    Journal ref: SALMM '19: Proceedings of the 1st International Workshop on Search as Learning with Multimedia Information, co-located with ACM Multimedia 2019

  41. arXiv:2005.10595  [pdf, other

    cs.CY

    A Recommender System For Open Educational Videos Based On Skill Requirements

    Authors: Mohammadreza Tavakoli, Sherzod Hakimov, Ralph Ewerth, Gábor Kismihók

    Abstract: In this paper, we suggest a novel method to help learners find relevant open educational videos to master skills demanded on the labour market. We have built a prototype, which 1) applies text classification and text mining methods on job vacancy announcements to match jobs and their required skills; 2) predicts the quality of videos; and 3) creates an open educational video recommender system to… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: This paper has been accepted to be published in the proceedings of International Conference on Advanced Learning Technologies (ICALT) 2020 by IEEE Computer Society

  42. Requirements Analysis for an Open Research Knowledge Graph

    Authors: Arthur Brack, Anett Hoppe, Markus Stocker, Sören Auer, Ralph Ewerth

    Abstract: Current science communication has a number of drawbacks and bottlenecks which have been subject of discussion lately: Among others, the rising number of published articles makes it nearly impossible to get an overview of the state of the art in a certain field, or reproducibility is hampered by fixed-length, document-based publications which normally cannot cover all details of a research work. Re… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

    Comments: Accepted for publishing in 24th International Conference on Theory and Practice of Digital Libraries, TPDL 2020

    Journal ref: Digital Libraries for Open Knowledge. TPDL 2020. Lecture Notes in Computer Science, vol 12246. Springer, Cham

  43. arXiv:2003.10421  [pdf, other

    cs.CL cs.IR cs.MM

    Multimodal Analytics for Real-world News using Measures of Cross-modal Entity Consistency

    Authors: Eric Müller-Budack, Jonas Theiner, Sebastian Diering, Maximilian Idahl, Ralph Ewerth

    Abstract: The World Wide Web has become a popular source for gathering information and news. Multimodal information, e.g., enriching text with photos, is typically used to convey the news more effectively or to attract attention. Photo content can range from decorative, depict additional important information, or can even contain misleading information. Therefore, automatic approaches to quantify cross-moda… ▽ More

    Submitted 23 October, 2020; v1 submitted 23 March, 2020; originally announced March 2020.

    Comments: Accepted for publication in: International Conference on Multimedia Retrieval (ICMR), Dublin, 2020

  44. arXiv:2003.01006  [pdf, other

    cs.IR cs.AI cs.CL cs.DL

    The STEM-ECR Dataset: Grounding Scientific Entity References in STEM Scholarly Content to Authoritative Encyclopedic and Lexicographic Sources

    Authors: Jennifer D'Souza, Anett Hoppe, Arthur Brack, Mohamad Yaser Jaradeh, Sören Auer, Ralph Ewerth

    Abstract: We introduce the STEM (Science, Technology, Engineering, and Medicine) Dataset for Scientific Entity Extraction, Classification, and Resolution, version 1.0 (STEM-ECR v1.0). The STEM-ECR v1.0 dataset has been developed to provide a benchmark for the evaluation of scientific entity extraction, classification, and resolution tasks in a domain-independent fashion. It comprises abstracts in 10 STEM di… ▽ More

    Submitted 28 July, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: Published in LREC 2020. Publication URL https://www.aclweb.org/anthology/2020.lrec-1.268/; Dataset DOI https://doi.org/10.25835/0017546

  45. arXiv:2001.06823  [pdf, other

    cs.CV cs.IR

    SlideImages: A Dataset for Educational Image Classification

    Authors: David Morris, Eric Müller-Budack, Ralph Ewerth

    Abstract: In the past few years, convolutional neural networks (CNNs) have achieved impressive results in computer vision tasks, which however mainly focus on photos with natural scene content. Besides, non-sensor derived images such as illustrations, data visualizations, figures, etc. are typically used to convey complex information or to explore large datasets. However, this kind of images has received li… ▽ More

    Submitted 19 January, 2020; originally announced January 2020.

    Comments: 8 pages, 2 figures, to be presented at ECIR 2020

  46. Domain-independent Extraction of Scientific Concepts from Research Articles

    Authors: Arthur Brack, Jennifer D'Souza, Anett Hoppe, Sören Auer, Ralph Ewerth

    Abstract: We examine the novel task of domain-independent scientific concept extraction from abstracts of scholarly articles and present two contributions. First, we suggest a set of generic scientific concepts that have been identified in a systematic annotation process. This set of concepts is utilised to annotate a corpus of scientific abstracts from 10 domains of Science, Technology and Medicine at the… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

    Comments: Accepted for publishing in 42nd European Conference on IR Research, ECIR 2020

    Journal ref: Advances in Information Retrieval. 2020

  47. Visual Summarization of Scholarly Videos using Word Embeddings and Keyphrase Extraction

    Authors: Hang Zhou, Christian Otto, Ralph Ewerth

    Abstract: Effective learning with audiovisual content depends on many factors. Besides the quality of the learning resource's content, it is essential to discover the most relevant and suitable video in order to support the learning process most effectively. Video summarization techniques facilitate this goal by providing a quick overview over the content. It is especially useful for longer recordings such… ▽ More

    Submitted 25 November, 2019; originally announced December 2019.

    Comments: 12 pages, 5 figures

  48. arXiv:1910.00412  [pdf, other

    cs.LG cs.AI

    "Does 4-4-2 exist?" -- An Analytics Approach to Understand and Classify Football Team Formations in Single Match Situations

    Authors: Eric Müller-Budack, Jonas Theiner, Robert Rein, Ralph Ewerth

    Abstract: The chances to win a football match can be significantly increased if the right tactic is chosen and the behavior of the opposite team is well anticipated. For this reason, every professional football club employs a team of game analysts. However, at present game performance analysis is done manually and therefore highly time-consuming. Consequently, automated tools to support the analysis process… ▽ More

    Submitted 2 September, 2019; originally announced October 2019.

    Comments: Accepted at MMSports 2019 (Workshop of ACM Multimedia 2019)

  49. arXiv:1907.10450  [pdf, other

    cs.CV cs.DL cs.IR cs.MM

    Investigating Correlations of Inter-coder Agreement and Machine Annotation Performance for Historical Video Data

    Authors: Kader Pustu-Iren, Markus Mühling, Nikolaus Korfhage, Joanna Bars, Sabrina Bernhöft, Angelika Hörth, Bernd Freisleben, Ralph Ewerth

    Abstract: Video indexing approaches such as visual concept classification and person recognition are essential to enable fine-grained semantic search in large-scale video archives such as the historical video collection of former German Democratic Republic (GDR) maintained by the German Broadcasting Archive (DRA). Typically, a lexicon of visual concepts has to be defined for semantic search. However, the de… ▽ More

    Submitted 24 July, 2019; originally announced July 2019.

  50. Understanding, Categorizing and Predicting Semantic Image-Text Relations

    Authors: Christian Otto, Matthias Springstein, Avishek Anand, Ralph Ewerth

    Abstract: Two modalities are often used to convey information in a complementary and beneficial manner, e.g., in online news, videos, educational resources, or scientific publications. The automatic understanding of semantic correlations between text and associated images as well as their interplay has a great potential for enhanced multimodal web search and recommender systems. However, automatic understan… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: 8 pages, 8 Figures, 5 tables

    Journal ref: In Proceedings of the 2019 on International Conference on Multimedia Retrieval (ICMR '19). ACM, New York, NY, USA, 168-176