Zum Hauptinhalt springen

Showing 1–50 of 54 results for author: Lange, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.15956  [pdf, other

    q-bio.QM cs.CV cs.LG

    Generating Binary Species Range Maps

    Authors: Filip Dorm, Christian Lange, Scott Loarie, Oisin Mac Aodha

    Abstract: Accurately predicting the geographic ranges of species is crucial for assisting conservation efforts. Traditionally, range maps were manually created by experts. However, species distribution models (SDMs) and, more recently, deep learning-based variants offer a potential automated alternative. Deep learning-based SDMs generate a continuous probability representing the predicted presence of a spec… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Journal ref: Workshop on Computer Vision for Ecology at ECCV 2024

  2. arXiv:2403.15451  [pdf, other

    cs.CL

    Towards Enabling FAIR Dataspaces Using Large Language Models

    Authors: Benedikt T. Arnold, Johannes Theissen-Lipp, Diego Collarana, Christoph Lange, Sandra Geisler, Edward Curry, Stefan Decker

    Abstract: Dataspaces have recently gained adoption across various sectors, including traditionally less digitized domains such as culture. Leveraging Semantic Web technologies helps to make dataspaces FAIR, but their complexity poses a significant challenge to the adoption of dataspaces and increases their cost. The advent of Large Language Models (LLMs) raises the question of how these models can support t… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 8 pages. Preprint. Under review

  3. arXiv:2402.03812  [pdf, other

    cs.DC

    FDO Manager: Minimum Viable FAIR Digital Object Implementation

    Authors: Oussama Zoubia, Zeyd Boukhers, Nagaraj Bahubali Asundi, Sezin Dogan, Adamantios Koumpis, Christoph Lange, Oya Beyan

    Abstract: The concept of FAIR Digital Objects (FDOs) aims to revolutionise the field of digital preservation and accessibility in the next few years. Central to this revolution is the alignment of FDOs with the FAIR (Findable, Accessible, Interoperable, Reusable) Principles, particularly emphasizing machine-actionability and interoperability across diverse data ecosystems. This abstract introduces the "FDO… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  4. arXiv:2402.00851  [pdf, other

    cs.LG q-bio.QM

    Data Augmentation Scheme for Raman Spectra with Highly Correlated Annotations

    Authors: Christoph Lange, Isabel Thiele, Lara Santolin, Sebastian L. Riedel, Maxim Borisyak, Peter Neubauer, M. Nicolas Cruz Bournazou

    Abstract: In biotechnology Raman Spectroscopy is rapidly gaining popularity as a process analytical technology (PAT) that measures cell densities, substrate- and product concentrations. As it records vibrational modes of molecules it provides that information non-invasively in a single spectrum. Typically, partial least squares (PLS) is the model of choice to infer information about variables of interest fr… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  5. arXiv:2401.09199  [pdf, other

    cs.DC cs.DB

    Data Trading and Monetization: Challenges and Open Research Directions

    Authors: Qusai Ramadan, Zeyd Boukhers, Muath AlShaikh, Christoph Lange, Jan Jürjens

    Abstract: Traditional data monetization approaches face challenges related to data protection and logistics. In response, digital data marketplaces have emerged as intermediaries simplifying data transactions. Despite the growing establishment and acceptance of digital data marketplaces, significant challenges hinder efficient data trading. As a result, few companies can derive tangible value from their dat… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Paper accepted by the International Conference on Future Networks and Distributed Systems (ICFNDS 2023)

  6. arXiv:2312.02858  [pdf, other

    cs.LG cs.AI physics.ao-ph stat.ME

    Towards Causal Representations of Climate Model Data

    Authors: Julien Boussard, Chandni Nagda, Julia Kaltenborn, Charlotte Emilie Elektra Lange, Philippe Brouillard, Yaniv Gurwicz, Peer Nowack, David Rolnick

    Abstract: Climate models, such as Earth system models (ESMs), are crucial for simulating future climate change based on projected Shared Socioeconomic Pathways (SSP) greenhouse gas emissions scenarios. While ESMs are sophisticated and invaluable, machine learning-based emulators trained on existing simulation data can project additional climate scenarios much faster and are computationally efficient. Howeve… ▽ More

    Submitted 6 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

  7. arXiv:2311.03721  [pdf, other

    cs.LG cs.AI cs.CE physics.ao-ph

    ClimateSet: A Large-Scale Climate Model Dataset for Machine Learning

    Authors: Julia Kaltenborn, Charlotte E. E. Lange, Venkatesh Ramesh, Philippe Brouillard, Yaniv Gurwicz, Chandni Nagda, Jakob Runge, Peer Nowack, David Rolnick

    Abstract: Climate models have been key for assessing the impact of climate change and simulating future climate scenarios. The machine learning (ML) community has taken an increased interest in supporting climate scientists' efforts on various tasks such as climate model emulation, downscaling, and prediction tasks. Many of those tasks have been addressed on datasets created with single climate models. Howe… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: To be published in the 37th Conference on Neural Information Processing Systems (NeurIPS 2023): Track on Datasets and Benchmarks. Project website: https://climateset.github.io/

  8. arXiv:2311.02061  [pdf, other

    cs.LG

    Active Learning-Based Species Range Estimation

    Authors: Christian Lange, Elijah Cole, Grant Van Horn, Oisin Mac Aodha

    Abstract: We propose a new active learning approach for efficiently estimating the geographic range of a species from a limited number of on the ground observations. We model the range of an unmapped species of interest as the weighted combination of estimated ranges obtained from a set of different species. We show that it is possible to generate this candidate set of ranges by using models that have been… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023

  9. arXiv:2306.02564  [pdf, other

    cs.LG cs.CV

    Spatial Implicit Neural Representations for Global-Scale Species Mapping

    Authors: Elijah Cole, Grant Van Horn, Christian Lange, Alexander Shepard, Patrick Leary, Pietro Perona, Scott Loarie, Oisin Mac Aodha

    Abstract: Estimating the geographical range of a species from sparse observations is a challenging and important geospatial prediction problem. Given a set of locations where a species has been observed, the goal is to build a model to predict whether the species is present or absent at any location. This problem has a long history in ecology, but traditional methods struggle to take advantage of emerging l… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  10. arXiv:2303.08932  [pdf, other

    cs.DB cs.DC cs.IT cs.LG

    Enhancing Data Space Semantic Interoperability through Machine Learning: a Visionary Perspective

    Authors: Zeyd Boukhers, Christoph Lange, Oya Beyan

    Abstract: Our vision paper outlines a plan to improve the future of semantic interoperability in data spaces through the application of machine learning. The use of data spaces, where data is exchanged among members in a self-regulated environment, is becoming increasingly popular. However, the current manual practices of managing metadata and vocabularies in these spaces are time-consuming, prone to errors… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: Accepted for publication @ The First International Workshop on Semantics in Dataspaces (In conjunction with The Web Conference - WWW 2023)

  11. arXiv:2212.13261  [pdf, other

    q-bio.QM cs.AI cs.LG

    Explainable AI for Bioinformatics: Methods, Tools, and Applications

    Authors: Md. Rezaul Karim, Tanhim Islam, Oya Beyan, Christoph Lange, Michael Cochez, Dietrich Rebholz-Schuhmann, Stefan Decker

    Abstract: Artificial intelligence (AI) systems utilizing deep neural networks (DNNs) and machine learning (ML) algorithms are widely used for solving important problems in bioinformatics, biomedical informatics, and precision medicine. However, complex DNNs or ML models, which are often perceived as opaque and black-box, can make it difficult to understand the reasoning behind their decisions. This lack of… ▽ More

    Submitted 23 February, 2023; v1 submitted 25 December, 2022; originally announced December 2022.

  12. arXiv:2201.05070  [pdf, other

    cs.LG cs.CY

    Applying Machine Learning and AI Explanations to Analyze Vaccine Hesitancy

    Authors: Carsten Lange, Jian Lange

    Abstract: The paper quantifies the impact of race, poverty, politics, and age on COVID-19 vaccination rates in counties in the continental US. Both, OLS regression analysis and Random Forest machine learning algorithms are applied to quantify factors for county-level vaccination hesitancy. The machine learning model considers joint effects of variables (race/ethnicity, partisanship, age, etc.) simultaneousl… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

  13. Software Sustainability & High Energy Physics

    Authors: Daniel S. Katz, Sudhir Malik, Mark S. Neubauer, Graeme A. Stewart, Kétévi A. Assamagan, Erin A. Becker, Neil P. Chue Hong, Ian A. Cosden, Samuel Meehan, Edward J. W. Moyse, Adrian M. Price-Whelan, Elizabeth Sexton-Kennedy, Meirin Oan Evans, Matthew Feickert, Clemens Lange, Kilian Lieret, Rob Quick, Arturo Sánchez Pineda, Christopher Tunnell

    Abstract: New facilities of the 2020s, such as the High Luminosity Large Hadron Collider (HL-LHC), will be relevant through at least the 2030s. This means that their software efforts and those that are used to analyze their data need to consider sustainability to enable their adaptability to new challenges, longevity, and efficiency, over at least this period. This will help ensure that this software will b… ▽ More

    Submitted 16 October, 2020; v1 submitted 10 October, 2020; originally announced October 2020.

    Comments: A report from the "Sustainable Software in HEP" IRIS-HEP blueprint workshop: https://indico.cern.ch/event/930127/

  14. Comparing Pedestrian Navigation Methods in Virtual Reality and Real Life

    Authors: Gian-Luca Savino, Niklas Emanuel, Steven Kowalzik, Felix A. Kroll, Marvin C. Lange, Matthis Laudan, Rieke Leder, Zhanhua Liang, Dayana Markhabayeva, Martin Schmeißer, Nicolai Schütz, Carolin Stellmacher, Zihe Xu, Kerstin Bub, Thorsten Kluss, Jaime Maldonado, Ernst Kruijff, Johannes Schöning

    Abstract: Mobile navigation apps are among the most used mobile applications and are often used as a baseline to evaluate new mobile navigation technologies in field studies. As field studies often introduce external factors that are hard to control for, we investigate how pedestrian navigation methods can be evaluated in virtual reality (VR). We present a study comparing navigation methods in real life (RL… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

  15. arXiv:2007.07060  [pdf, other

    cs.IR cs.DB

    Template-Based Question Answering over Linked Geospatial Data

    Authors: Dharmen Punjani, Markos Iliakis, Theodoros Stefou, Kuldeep Singh, Andreas Both, Manolis Koubarakis, Iosif Angelidis, Konstantina Bereta, Themis Beris, Dimitris Bilidas, Theofilos Ioannidis, Nikolaos Karalis, Christoph Lange, Despina-Athanasia Pantazi, Christos Papaloukas, Georgios Stamoulis

    Abstract: Large amounts of geospatial data have been made available recently on the linked open data cloud and the portals of many national cartographic agencies (e.g., OpenStreetMap data, administrative geographies of various countries, or land cover/land use data sets). These datasets use various geospatial vocabularies and can be queried using SPARQL or its OGC-standardized extension GeoSPARQL. In this p… ▽ More

    Submitted 29 April, 2021; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: 27 pages, 2 figures

  16. arXiv:1909.04169  [pdf, other

    q-bio.QM cs.LG q-bio.GN

    OncoNetExplainer: Explainable Predictions of Cancer Types Based on Gene Expression Data

    Authors: Md. Rezaul Karim, Michael Cochez, Oya Beyan, Stefan Decker, Christoph Lange

    Abstract: The discovery of important biomarkers is a significant step towards understanding the molecular mechanisms of carcinogenesis; enabling accurate diagnosis for, and prognosis of, a certain cancer type. Before recommending any diagnosis, genomics data such as gene expressions(GE) and clinical outcomes need to be analyzed. However, complex nature, high dimensionality, and heterogeneity in genomics dat… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: In proc. of 19th IEEE International Conference on Bioinformatics and Bioengineering(IEEE BIBE 2019)

    Journal ref: IEEE International Conference on Bioinformatics and Bioengineering(IEEE BIBE 2019)

  17. arXiv:1812.01027  [pdf

    cs.IR cs.DL

    Automatically Annotating Articles Towards Opening and Reusing Transparent Peer Reviews

    Authors: Afshin Sadeghi, Sarven Capadisli, Johannes Wilm, Christoph Lange, Philipp Mayr

    Abstract: An increasing number of scientific publications are created in open and transparent peer review models: a submission is published first, and then reviewers are invited, or a submission is reviewed in a closed environment but then these reviews are published with the final article, or combinations of these. Reasons for open peer review include giving better credit to reviewers and enabling readers… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

    Comments: submitted to review to "Publications"

  18. arXiv:1807.06816  [pdf, other

    cs.DL physics.soc-ph

    Unveiling Scholarly Communities over Knowledge Graphs

    Authors: Sahar Vahdati, Guillermo Palma, Rahul Jyoti Nath, Christoph Lange, Sören Auer, Maria-Esther Vidal

    Abstract: Knowledge graphs represent the meaning of properties of real-world entities and relationships among them in a natural way. Exploiting semantics encoded in knowledge graphs enables the implementation of knowledge-driven tasks such as semantic retrieval, query processing, and question answering, as well as solutions to knowledge discovery tasks including pattern discovery and link prediction. In thi… ▽ More

    Submitted 18 July, 2018; originally announced July 2018.

    Comments: 12 pages. Paper accepted in the 22nd International Conference on Theory and Practice of Digital Libraries, 2018

  19. arXiv:1804.07708  [pdf, other

    cs.DL

    A Survey of User Expectations and Tool Limitations in Collaborative Scientific Authoring and Reviewing

    Authors: Afshin Sadeghi, Mahdi Jaberzadeh Ansari, Johannes Wilm, Christoph Lange

    Abstract: Collaborative scientific authoring is increasingly being supported by software tools. Traditionally, desktop-based authoring tools had the most advanced editing features, allowed for more formatting options, and included more import/export filters. Web-based tools have excelled in their collaboration support. Recently, developers on both sides have been trying to close this gap by extending deskto… ▽ More

    Submitted 20 April, 2018; originally announced April 2018.

    Comments: This is the version of the article before review and it is currently under review. This version is subject to updates

  20. arXiv:1711.04548  [pdf, other

    cs.DL

    Towards a Cloud-Based Service for Maintaining and Analyzing Data About Scientific Events

    Authors: Andreas Behrend, Sahar Vahdati, Christoph Lange, Christiane Engels

    Abstract: We propose the new cloud-based service OpenResearch for managing and analyzing data about scientific events such as conferences and workshops in a persistent and reliable way. This includes data about scientific articles, participants, acceptance rates, submission numbers, impact values as well as organizational details such as program committees, chairs, fees and sponsors. OpenResearch is a centr… ▽ More

    Submitted 28 November, 2017; v1 submitted 13 November, 2017; originally announced November 2017.

    Comments: A completed version of this paper had been accepted in SAVE-SD workshop 2017 at WWW conference

  21. arXiv:1706.04185  [pdf

    cs.DL

    Decentralized creation of academic documents using a Network Attached Storage (NAS) server

    Authors: Johannes Wilm, Afshin Sadeghi, Christoph Lange, Philipp Mayr

    Abstract: Scholarly document creation continues to face various obstacles. Scholarly text production requires more complex word processors than other forms of texts because of the complex structures of citations, formulas and figures. The need for peer review, often single-blind or double-blind, creates needs for document management that other texts do not require. Additionally, the need for collaborative e… ▽ More

    Submitted 11 June, 2017; originally announced June 2017.

    Comments: 15 pages, paper presented at the Enabling Decentralised Scholarly Communication workshop co-located with the Extended Semantic Web Conference (ESWC 2017)

  22. arXiv:1703.04428  [pdf

    cs.DL

    Opening Scholarly Communication in Social Sciences by Connecting Collaborative Authoring to Peer Review

    Authors: Afshin Sadeghi, Johannes Wilm, Philipp Mayr, Christoph Lange

    Abstract: The objective of the OSCOSS research project on "Opening Scholarly Communication in the Social Sciences" is to build a coherent collaboration environment that facilitates scholarly communication workflows of social scientists in the roles of authors, reviewers, editors and readers. This paper presents the implementation of the core of this environment: the integration of the Fidus Writer academic… ▽ More

    Submitted 13 March, 2017; originally announced March 2017.

    Comments: 11 pages, 2 figures, submitted to the journal IWP

  23. arXiv:1611.04760  [pdf

    cs.DL cs.CY

    The Opening Scholarly Communication in Social Sciences project OSCOSS

    Authors: Philipp Mayr, Christoph Lange

    Abstract: The OSCOSS project (Opening Scholarly Communication in Social Sciences), which will be outlined, aims at providing integrated support for all steps of the scholarly communication process. Incl. collaborative writing of a scientific paper, collecting data related to existing publications, interpreting and including data in a paper, submitting the paper for peer review, reviewing the paper, publishi… ▽ More

    Submitted 17 October, 2017; v1 submitted 15 November, 2016; originally announced November 2016.

    Comments: 9 pages, 1 figure, Book chapter in the Festschrift for Konrad Umlauf. "Bibliothek. Forschung für die Praxis"

  24. arXiv:1611.01820  [pdf, other

    cs.DL

    A Semi-Automatic Approach for Detecting Dataset References in Social Science Texts

    Authors: Behnam Ghavimi, Philipp Mayr, Christoph Lange, Sahar Vahdati, Sören AUER

    Abstract: Today, full-texts of scientific articles are often stored in different locations than the used datasets. Dataset registries aim at a closer integration by making datasets citable but authors typically refer to datasets using inconsistent abbreviations and heterogeneous metadata (e.g. title, publication year). It is thus hard to reproduce research results, to access datasets for further analysis, a… ▽ More

    Submitted 6 November, 2016; originally announced November 2016.

    Comments: Pre-print IS&U journal. arXiv admin note: substantial text overlap with arXiv:1603.01774

  25. arXiv:1608.02800  [pdf, other

    cs.PF cs.DB

    LITMUS: An Open Extensible Framework for Benchmarking RDF Data Management Solutions

    Authors: Harsh Thakkar, Mohnish Dubey, Gezim Sejdiu, Axel-Cyrille Ngonga Ngomo, Jeremy Debattista, Christoph Lange, Jens Lehmann, Sören Auer, Maria-Esther Vidal

    Abstract: Developments in the context of Open, Big, and Linked Data have led to an enormous growth of structured data on the Web. To keep up with the pace of efficient consumption and management of the data at this rate, many data Management solutions have been developed for specific tasks and applications. We present LITMUS, a framework for benchmarking data management solutions. LITMUS goes beyond classic… ▽ More

    Submitted 9 August, 2016; originally announced August 2016.

    Comments: 8 pages, 1 figure, position paper

  26. An Introduction to Mechanized Reasoning

    Authors: Manfred Kerber, Christoph Lange, Colin Rowat

    Abstract: Mechanized reasoning uses computers to verify proofs and to help discover new theorems. Computer scientists have applied mechanized reasoning to economic problems but -- to date -- this work has not yet been properly presented in economics journals. We introduce mechanized reasoning to economists in three ways. First, we introduce mechanized reasoning in general, describing both the techniques and… ▽ More

    Submitted 10 August, 2016; v1 submitted 8 March, 2016; originally announced March 2016.

    MSC Class: 62P20; 91B26; 91B14; 68T15; 03B15; 03B10 ACM Class: F.4.1; I.2.3; J.4

    Journal ref: Mathematical Economics 66, pp. 26-39. Elsevier, October 2016

  27. arXiv:1603.01774  [pdf, other

    cs.DL cs.IR

    Identifying and Improving Dataset References in Social Sciences Full Texts

    Authors: Behnam Ghavimi, Philipp Mayr, Sahar Vahdati, Christoph Lange

    Abstract: Scientific full text papers are usually stored in separate places than their underlying research datasets. Authors typically make references to datasets by mentioning them for example by using their titles and the year of publication. However, in most cases explicit links that would provide readers with direct access to referenced datasets are missing. Manually detecting references to datasets in… ▽ More

    Submitted 29 March, 2016; v1 submitted 5 March, 2016; originally announced March 2016.

  28. arXiv:1601.03541  [pdf, ps, other

    cs.IR

    Question Answering on Linked Data: Challenges and Future Directions

    Authors: Saeedeh Shekarpour, Denis Lukovnikov, Ashwini Jaya Kumar, Kemele Endris, Kuldeep Singh, Harsh Thakkar, Christoph Lange

    Abstract: Question Answering (QA) systems are becoming the inspiring model for the future of search engines. While recently, underlying datasets for QA systems have been promoted from unstructured datasets to structured datasets with highly semantic-enriched metadata, but still question answering systems involve serious challenges which cause to be far beyond desired expectations. In this paper, we raise th… ▽ More

    Submitted 16 February, 2016; v1 submitted 14 January, 2016; originally announced January 2016.

    Comments: Submitted to Question Answering And Activity Analysis in Participatory Sites (Q4APS) 2016

  29. arXiv:1601.02927  [pdf

    cs.DL

    Opening Scholarly Communication in Social Sciences: Supporting Open Peer Review with Fidus Writer

    Authors: Philipp Mayr, Fakhri Momeni, Christoph Lange

    Abstract: Our system will initially provide readers, authors and reviewers with an alternative, thus having the potential to gain wider acceptance and gradually replace the old, incoherent publication process of our journals and of others in related fields. It will make journals more "open" (in terms of reusability) that are open access already, and it has the potential to serve as an incentive for turning… ▽ More

    Submitted 12 January, 2016; originally announced January 2016.

    Comments: 4 pages, 1 figure, poster paper accepted at the 2016 Annual EA Conference: "Innovating the Gutenberg Galaxis. The role of peer review and open access in university knowledge dissemination and evaluation"

  30. arXiv:1508.06206  [pdf, other

    cs.DL

    Semantic Publishing Challenge - Assessing the Quality of Scientific Output by Information Extraction and Interlinking

    Authors: Angelo Di Iorio, Christoph Lange, Anastasia Dimou, Sahar Vahdati

    Abstract: The Semantic Publishing Challenge series aims at investigating novel approaches for improving scholarly publishing using Linked Data technology. In 2014 we had bootstrapped this effort with a focus on extracting information from non-semantic publications - computer science workshop proceedings volumes and their papers - to assess their quality. The objective of this second edition was to improve i… ▽ More

    Submitted 25 August, 2015; originally announced August 2015.

    Comments: To appear in: E. Cabrio and M. Stankovic and M. Dragoni and A. Gangemi and R. Navigli and V. Presutti and D. Garigliotti and A. L. Gentile and A. Nuzzolese and A. Di Iorio and A. Dimou and C. Lange and S. Vahdati and A. Freitas and C. Unger and D. Reforgiato Recupero (eds.). Semantic Web Evaluation Challenges 2015. Communications in Computer and Information Science, Springer, 2015. arXiv admin note: text overlap with arXiv:1408.3863

    ACM Class: H.3.7; I.7.4; H.3.3

  31. arXiv:1506.04094  [pdf, ps, other

    cs.IR

    The WDAqua ITN: Answering Questions using Web Data

    Authors: Christoph Lange, Saeedeh Shekarpour, Soren Auer

    Abstract: WDAqua is a Marie Curie Innovative Training Network (ITN) and is funded under EU grant number 642795 and runs from January 2015 to December 2018. WDAqua aims at advancing the state of the art by intertwining training, research and innovation efforts, centered around one service: data-driven question answering. Question answering is immediately useful to a wide audience of end users, and we will de… ▽ More

    Submitted 10 June, 2015; originally announced June 2015.

  32. arXiv:1506.04006  [pdf, other

    cs.DB cs.DL cs.PF

    Mapping Large Scale Research Metadata to Linked Data: A Performance Comparison of HBase, CSV and XML

    Authors: Sahar Vahdati, Farah Karim, Jyun-Yao Huang, Christoph Lange

    Abstract: OpenAIRE, the Open Access Infrastructure for Research in Europe, comprises a database of all EC FP7 and H2020 funded research projects, including metadata of their results (publications and datasets). These data are stored in an HBase NoSQL database, post-processed, and exposed as HTML for human consumption, and as XML through a web service interface. As an intermediate format to facilitate statis… ▽ More

    Submitted 6 July, 2015; v1 submitted 12 June, 2015; originally announced June 2015.

    Comments: Accepted in 0th Metadata and Semantics Research Conference

  33. arXiv:1504.07758  [pdf, other

    cs.DB

    Luzzu Quality Metric Language -- A DSL for Linked Data Quality Assessment

    Authors: Jeremy Debattista, Christoph Lange, Sören Auer

    Abstract: The steadily growing number of linked open datasets brought about a number of reservations amongst data consumers with regard to the datasets' quality. Quality assessment requires significant effort and consideration, including the definition of data quality metrics and a process to assess datasets based on these definitions. Luzzu is a quality assessment framework for linked data that allows doma… ▽ More

    Submitted 29 April, 2015; originally announced April 2015.

    Comments: arXiv admin note: text overlap with arXiv:1412.3750

  34. arXiv:1503.05157  [pdf, other

    cs.DB cs.DS

    Quality Assessment of Linked Datasets using Probabilistic Approximation

    Authors: Jeremy Debattista, Santiago Londoño, Christoph Lange, Sören Auer

    Abstract: With the increasing application of Linked Open Data, assessing the quality of datasets by computing quality metrics becomes an issue of crucial importance. For large and evolving datasets, an exact, deterministic computation of the quality metrics is too time consuming or expensive. We employ probabilistic techniques such as Reservoir Sampling, Bloom Filters and Clustering Coefficient estimation f… ▽ More

    Submitted 17 March, 2015; originally announced March 2015.

    Comments: 15 pages, 2 figures, To appear in ESWC 2015 proceedings

  35. arXiv:1412.3750  [pdf, other

    cs.DB cs.SE

    Luzzu - A Framework for Linked Data Quality Assessment

    Authors: Jeremy Debattista, Christoph Lange, Sören Auer

    Abstract: With the increasing adoption and growth of the Linked Open Data cloud [9], with RDFa, Microformats and other ways of embedding data into ordinary Web pages, and with initiatives such as schema.org, the Web is currently being complemented with a Web of Data. Thus, the Web of Data shares many characteristics with the original Web of Documents, which also varies in quality. This heterogeneity makes i… ▽ More

    Submitted 7 January, 2016; v1 submitted 11 December, 2014; originally announced December 2014.

  36. OpenCourseWare Observatory -- Does the Quality of OpenCourseWare Live up to its Promise?

    Authors: Sahar Vahdati, Christoph Lange, Sören Auer

    Abstract: A vast amount of OpenCourseWare (OCW) is meanwhile being published online to make educational content accessible to larger audiences. The awareness of such courses among users and the popularity of systems providing such courses are increasing. However, from a subjective experience, OCW is frequently cursory, outdated or non-reusable. In order to obtain a better understanding of the quality of OCW… ▽ More

    Submitted 14 April, 2015; v1 submitted 21 October, 2014; originally announced October 2014.

    Comments: A later version of this paper was presented in the proceedings of the Fifth International Conference on Learning Analytics And Knowledge(2015), pages 73-82. http://dl.acm.org/citation.cfm?id=2723605 On Zenodo: https://zenodo.org/deposit/21264/

    ACM Class: K.3

  37. arXiv:1408.3863  [pdf, other

    cs.DL cs.IR

    Semantic Publishing Challenge -- Assessing the Quality of Scientific Output

    Authors: Christoph Lange, Angelo Di Iorio

    Abstract: Linked Open Datasets about scholarly publications enable the development and integration of sophisticated end-user services; however, richer datasets are still needed. The first goal of this Challenge was to investigate novel approaches to obtain such semantic data. In particular, we were seeking methods and tools to extract information from scholarly publications, to publish it as LOD, and to use… ▽ More

    Submitted 20 August, 2014; v1 submitted 17 August, 2014; originally announced August 2014.

    Comments: To appear in: Valentina Presutti and Milan Stankovic and Erik Cambria and Reforgiato Recupero, Diego and Di Iorio, Angelo and Christoph Lange and Di Noia, Tommaso and Ivan Cantador (eds.). Semantic Web Evaluation Challenges 2014. Number 457 in Communications in Computer and Information Science, Springer, 2014

    ACM Class: H.3.7; I.7.4; H.3.3

  38. arXiv:1408.2468  [pdf, other

    cs.DB cs.DL

    Representing Dataset Quality Metadata using Multi-Dimensional Views

    Authors: Jeremy Debattista, Christoph Lange, Sören Auer

    Abstract: Data quality is commonly defined as fitness for use. The problem of identifying quality of data is faced by many data consumers. Data publishers often do not have the means to identify quality problems in their data. To make the task for both stakeholders easier, we have developed the Dataset Quality Ontology (daQ). daQ is a core vocabulary for representing the results of quality benchmarking of a… ▽ More

    Submitted 11 August, 2014; originally announced August 2014.

    Comments: Preprint of a paper submitted to the forthcoming SEMANTiCS 2014, 4-5 September 2014, Leipzig, Germany

    ACM Class: H.2.7

  39. arXiv:1406.0774  [pdf, other

    cs.LO

    Set Theory or Higher Order Logic to Represent Auction Concepts in Isabelle?

    Authors: Marco B. Caminati, Manfred Kerber, Christoph Lange, Colin Rowat

    Abstract: When faced with the question of how to represent properties in a formal proof system any user has to make design decisions. We have proved three of the theorems from Maskin's 2004 survey article on Auction Theory using the Isabelle/HOL system, and we have produced verified code for combinatorial Vickrey auctions. A fundamental question in this was how to represent some basic concepts: since set th… ▽ More

    Submitted 1 June, 2014; originally announced June 2014.

    Comments: Preprint of a paper accepted for the forthcoming CICM 2014 conference (cicm-conference.org/2014): S.M. Watt et al. (Eds.): CICM 2014, LNAI 8543, Springer International Publishing Switzerland 2014. 16 pages, 1 figure

    MSC Class: 03E02; 03B35 (primary); 68W05 (Secondary) ACM Class: F.4.1; D.2.4; F.3.1

  40. arXiv:1308.1779  [pdf, other

    cs.GT cs.CE cs.LO

    Proving soundness of combinatorial Vickrey auctions and generating verified executable code

    Authors: Marco B. Caminati, Manfred Kerber, Christoph Lange, Colin Rowat

    Abstract: Using mechanised reasoning we prove that combinatorial Vickrey auctions are soundly specified in that they associate a unique outcome (allocation and transfers) to any valid input (bids). Having done so, we auto-generate verified executable code from the formally defined auction. This removes a source of error in implementing the auction design. We intend to use formal methods to verify new auctio… ▽ More

    Submitted 2 September, 2013; v1 submitted 8 August, 2013; originally announced August 2013.

    MSC Class: 91-04; 68T15; 03B35; 91B26; 03B70; 03B15 ACM Class: J.4; D.2.4; F.4.1; I.2.4

  41. arXiv:1303.4194  [pdf, ps, other

    cs.CE cs.LO

    The ForMaRE Project - Formal Mathematical Reasoning in Economics

    Authors: Christoph Lange, Colin Rowat, Manfred Kerber

    Abstract: The ForMaRE project applies formal mathematical reasoning to economics. We seek to increase confidence in economics' theoretical results, to aid in discovering new results, and to foster interest in formal methods, i.e. computer-aided reasoning, within economics. To formal methods, we seek to contribute user experience feedback from new audiences, as well as new challenge problems. In the first pr… ▽ More

    Submitted 18 May, 2013; v1 submitted 18 March, 2013; originally announced March 2013.

    Comments: Conference on Intelligent Computer Mathematics, 8--12 July, Bath, UK. Published as number 7961 in Lecture Notes in Artificial Intelligence, Springer

    MSC Class: 91B26; 68T15 ACM Class: J.4; I.2.3; K.6.1

  42. arXiv:1303.4193  [pdf, ps, other

    cs.LO cs.GT cs.MS

    A Qualitative Comparison of the Suitability of Four Theorem Provers for Basic Auction Theory

    Authors: Christoph Lange, Marco B. Caminati, Manfred Kerber, Till Mossakowski, Colin Rowat, Makarius Wenzel, Wolfgang Windsteiger

    Abstract: Novel auction schemes are constantly being designed. Their design has significant consequences for the allocation of goods and the revenues generated. But how to tell whether a new design has the desired properties, such as efficiency, i.e. allocating goods to those bidders who value them most? We say: by formal, machine-checked proofs. We investigated the suitability of the Isabelle, Theorema, Mi… ▽ More

    Submitted 23 May, 2013; v1 submitted 18 March, 2013; originally announced March 2013.

    Comments: Conference on Intelligent Computer Mathematics, 8-12 July, Bath, UK. Published as number 7961 in Lecture Notes in Artificial Intelligence, Springer

    MSC Class: 68T15; 03B35; 68T35; 91B26; 03B70; 03B10; 03B15 ACM Class: I.2.3; I.2.4; F.4.1; H.1.2; J.4

  43. arXiv:1208.0293  [pdf, other

    cs.AI cs.DL cs.LO

    The Distributed Ontology Language (DOL): Use Cases, Syntax, and Extensibility

    Authors: Christoph Lange, Till Mossakowski, Oliver Kutz, Christian Galinski, Michael Grüninger, Daniel Couto Vale

    Abstract: The Distributed Ontology Language (DOL) is currently being standardized within the OntoIOp (Ontology Integration and Interoperability) activity of ISO/TC 37/SC 3. It aims at providing a unified framework for (1) ontologies formalized in heterogeneous logics, (2) modular ontologies, (3) links between ontologies, and (4) annotation of ontologies. This paper presents the current state of DOL's standa… ▽ More

    Submitted 1 August, 2012; originally announced August 2012.

    Comments: Terminology and Knowledge Engineering Conference (TKE) 2012-06-20 to 2012-06-21 Madrid, Spain

    MSC Class: 68T30; 68T35 ACM Class: I.2.4

  44. arXiv:1204.5094  [pdf, other

    cs.MS cs.DL

    Point-and-write --- Documenting Formal Mathematics by Reference

    Authors: Carst Tankink, Christoph Lange, Josef Urban

    Abstract: This paper describes the design and implementation of mechanisms for light-weight inclusion of formal mathematics in informal mathematical writings, particularly in a Web-based setting. This is conceptually done in three stages: (i) by choosing a suitable representation layer (based on RDF) for encoding the information about available resources of formal mathematics, (ii) by exporting this informa… ▽ More

    Submitted 10 July, 2012; v1 submitted 23 April, 2012; originally announced April 2012.

    Comments: Conference on Intelligent Computer Mathematics, July 8--13, Bremen, Germany. Published as number 7362 in Lecture Notes in Artificial Intelligence, Springer

    MSC Class: 68T30; 68T35; 03B35 ACM Class: F.4.1; F.4.m; H.3.5; H.5.3; H.5.4; I.2.4; I.7.1; I.7.2

  45. arXiv:1204.5093  [pdf, other

    cs.LO

    The Distributed Ontology Language (DOL): Ontology Integration and Interoperability Applied to Mathematical Formalization

    Authors: Christoph Lange, Oliver Kutz, Till Mossakowski, Michael Grüninger

    Abstract: The Distributed Ontology Language (DOL) is currently being standardized within the OntoIOp (Ontology Integration and Interoperability) activity of ISO/TC 37/SC 3. It aims at providing a unified framework for (1) ontologies formalized in heterogeneous logics, (2) modular ontologies, (3) links between ontologies, and (4) annotation of ontologies. This paper focuses on an application of DOL's meta-… ▽ More

    Submitted 23 April, 2012; originally announced April 2012.

    Comments: Conference on Intelligent Computer Mathematics, July 9-14, Bremen, Germany. Published as number 7362 in Lecture Notes in Artificial Intelligence, Springer

    MSC Class: 68T30; 03B10; 03B70; 16B50 ACM Class: F.4.1; F.4.m; H.3.5; H.5.3; H.5.4; I.2.4; I.7.1; I.7.2

  46. arXiv:1204.5086  [pdf, ps, other

    cs.DL cs.MS

    Reimplementing the Mathematical Subject Classification (MSC) as a Linked Open Dataset

    Authors: Christoph Lange, Patrick Ion, Anastasia Dimou, Charalampos Bratsas, Joseph Corneli, Wolfram Sperber, Michael Kohlhase, Ioannis Antoniou

    Abstract: The Mathematics Subject Classification (MSC) is a widely used scheme for classifying documents in mathematics by subject. Its traditional, idiosyncratic conceptualization and representation makes the scheme hard to maintain and requires custom implementations of search, query and annotation support. This limits uptake e.g. in semantic web technologies in general and the creation and exploration of… ▽ More

    Submitted 23 April, 2012; originally announced April 2012.

    Comments: Conference on Intelligent Computer Mathematics, July 9-14, Bremen, Germany. Published as number 7362 in Lecture Notes in Artificial Intelligence, Springer

    MSC Class: 68T30 ACM Class: H.3.5; I.2.4; J.2

  47. arXiv:1103.1482  [pdf, other

    cs.DL cs.MS

    The Planetary System: Executable Science, Technology, Engineering and Math Papers

    Authors: Christoph Lange, Michael Kohlhase, Catalin David, Deyan Ginev, Andrea Kohlhase, Bogdan Matican, Stefan Mirea, Vyacheslav Zholudev

    Abstract: Executable scientific papers contain not just layouted text for reading. They contain, or link to, machine-comprehensible representations of the scientific findings or experiments they describe. Client-side players can thus enable readers to "check, manipulate and explore the result space". We have realized executable papers in the STEM domain with the Planetary system. Semantic annotations associ… ▽ More

    Submitted 8 March, 2011; originally announced March 2011.

    Comments: Extended Semantic Web Conference (ESWC 2011), Demo Track. To be published in the Springer LNCS series

    MSC Class: 68T35 ACM Class: H.3.5; I.2.1; I.2.6; I.7.1; I.7.2; J.2; K.4.3

  48. arXiv:1006.4474  [pdf, other

    cs.SE cs.AI

    sTeX+ - a System for Flexible Formalization of Linked Data

    Authors: Andrea Kohlhase, Michael Kohlhase, Christoph Lange

    Abstract: We present the sTeX+ system, a user-driven advancement of sTeX - a semantic extension of LaTeX that allows for producing high-quality PDF documents for (proof)reading and printing, as well as semantic XML/OMDoc documents for the Web or further processing. Originally sTeX had been created as an invasive, semantic frontend for authoring XML documents. Here, we used sTeX in a Software Engineering cas… ▽ More

    Submitted 23 June, 2010; originally announced June 2010.

    Comments: I-SEMANTICS 2010, September 1-3, 2010, Graz, Austria

    MSC Class: 68T35; 68T30 ACM Class: H.5.3; H.5.4; I.7.2; F.4.m; H.3.5; D.2.1; D.2.7; I.2.4; K.6.3

  49. arXiv:1006.4057  [pdf, other

    cs.DL cs.MS

    Towards OpenMath Content Dictionaries as Linked Data

    Authors: Christoph Lange

    Abstract: "The term 'Linked Data' refers to a set of best practices for publishing and connecting structured data on the web". Linked Data make the Semantic Web work practically, which means that information can be retrieved without complicated lookup mechanisms, that a lightweight semantics enables scalable reasoning, and that the decentral nature of the Web is respected. OpenMath Content Dictionaries (CDs… ▽ More

    Submitted 21 June, 2010; originally announced June 2010.

    Comments: Presented at the OpenMath Workshop 2010, http://cicm2010.cnam.fr/om/

    MSC Class: 68T35; 68T30; 68W30 ACM Class: H.3.5; F.4.m; G.4; H.5.4; I.2.4; J.2; J.1

  50. arXiv:1004.5071  [pdf, other

    cs.DL cs.AI cs.SE

    Dimensions of Formality: A Case Study for MKM in Software Engineering

    Authors: Andrea Kohlhase, Michael Kohlhase, Christoph Lange

    Abstract: We study the formalization of a collection of documents created for a Software Engineering project from an MKM perspective. We analyze how document and collection markup formats can cope with an open-ended, multi-dimensional space of primary and secondary classifications and relationships. We show that RDFa-based extensions of MKM formats, employing flexible "metadata" relationships referencing sp… ▽ More

    Submitted 28 April, 2010; originally announced April 2010.

    Comments: To appear in The 9th International Conference on Mathematical Knowledge Management: MKM 2010

    MSC Class: 68T35; 68T30 ACM Class: H.5.3; H.5.4; I.7.2; F.4.m; H.3.5; D.2.7; I.2.4; K.6.3