Skip to main content

Showing 1–48 of 48 results for author: Bauckhage, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04156  [pdf, other

    cs.CL cs.AI cs.LG

    Pointer-Guided Pre-Training: Infusing Large Language Models with Paragraph-Level Contextual Awareness

    Authors: Lars Hillebrand, Prabhupad Pradhan, Christian Bauckhage, Rafet Sifa

    Abstract: We introduce "pointer-guided segment ordering" (SO), a novel pre-training technique aimed at enhancing the contextual understanding of paragraph-level text representations in large language models. Our methodology leverages a self-attention-driven pointer network to restore the original sequence of shuffled text segments, addressing the challenge of capturing the structural coherence and contextua… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 17 pages, 3 figures, 5 tables, accepted at ECML-PKDD 2024

  2. arXiv:2311.15679  [pdf, other

    cs.CV

    Model-agnostic Body Part Relevance Assessment for Pedestrian Detection

    Authors: Maurice Günder, Sneha Banerjee, Rafet Sifa, Christian Bauckhage

    Abstract: Model-agnostic explanation methods for deep learning models are flexible regarding usability and availability. However, due to the fact that they can only manipulate input to see changes in output, they suffer from weak performance when used with complex model architectures. For models with large inputs as, for instance, in object detection, sampling-based methods like KernelSHAP are inefficient d… ▽ More

    Submitted 1 February, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  3. arXiv:2311.03076  [pdf, other

    cs.CV cs.AI

    SugarViT -- Multi-objective Regression of UAV Images with Vision Transformers and Deep Label Distribution Learning Demonstrated on Disease Severity Prediction in Sugar Beet

    Authors: Maurice Günder, Facundo Ramón Ispizua Yamati, Abel Andree Barreto Alcántara, Anne-Katrin Mahlein, Rafet Sifa, Christian Bauckhage

    Abstract: Remote sensing and artificial intelligence are pivotal technologies of precision agriculture nowadays. The efficient retrieval of large-scale field imagery combined with machine learning techniques shows success in various tasks like phenotyping, weeding, cropping, and disease control. This work will introduce a machine learning framework for automatized large-scale plant-specific trait annotation… ▽ More

    Submitted 1 February, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: submitted to Computers and Electronics in Agriculture

  4. arXiv:2310.13526  [pdf, ps, other

    cs.CL cs.LG

    Controlled Randomness Improves the Performance of Transformer Models

    Authors: Tobias Deußer, Cong Zhao, Wolfgang Krämer, David Leonhard, Christian Bauckhage, Rafet Sifa

    Abstract: During the pre-training step of natural language models, the main objective is to learn a general representation of the pre-training dataset, usually requiring large amounts of textual data to capture the complexity and diversity of natural language. Contrasting this, in most cases, the size of the data available to solve the specific downstream task is often dwarfed by the aforementioned pre-trai… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted at ICMLA 2023, 10 pages, 2 tables

  5. arXiv:2308.07791  [pdf, other

    cs.CL cs.AI cs.LG

    Informed Named Entity Recognition Decoding for Generative Language Models

    Authors: Tobias Deußer, Lars Hillebrand, Christian Bauckhage, Rafet Sifa

    Abstract: Ever-larger language models with ever-increasing capabilities are by now well-established text processing tools. Alas, information extraction tasks such as named entity recognition are still largely unaffected by this progress as they are primarily based on the previous generation of encoder-only transformer models. Here, we propose a simple yet effective approach, Informed Named Entity Recognitio… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 12 pages, 2 figures, 4 tables

  6. arXiv:2308.06111  [pdf, other

    cs.CL cs.AI

    Improving Zero-Shot Text Matching for Financial Auditing with Large Language Models

    Authors: Lars Hillebrand, Armin Berger, Tobias Deußer, Tim Dilmaghani, Mohamed Khaled, Bernd Kliem, Rüdiger Loitz, Maren Pielka, David Leonhard, Christian Bauckhage, Rafet Sifa

    Abstract: Auditing financial documents is a very tedious and time-consuming process. As of today, it can already be simplified by employing AI-based solutions to recommend relevant text passages from a report for each legal requirement of rigorous accounting standards. However, these methods need to be fine-tuned regularly, and they require abundant annotated data, which is often lacking in industrial envir… ▽ More

    Submitted 14 August, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: Accepted at DocEng 2023, 4 pages, 1 figure, 2 tables

  7. arXiv:2306.15786  [pdf, other

    cs.LG cs.AI

    An Empirical Evaluation of the Rashomon Effect in Explainable Machine Learning

    Authors: Sebastian Müller, Vanessa Toborek, Katharina Beckh, Matthias Jakobs, Christian Bauckhage, Pascal Welke

    Abstract: The Rashomon Effect describes the following phenomenon: for a given dataset there may exist many models with equally good performance but with different solution strategies. The Rashomon Effect has implications for Explainable Machine Learning, especially for the comparability of explanations. We provide a unified view on three different comparison scenarios and conduct a quantitative evaluation a… ▽ More

    Submitted 29 June, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

  8. arXiv:2211.06112  [pdf, other

    cs.CL cs.AI cs.LG

    Towards automating Numerical Consistency Checks in Financial Reports

    Authors: Lars Hillebrand, Tobias Deußer, Tim Dilmaghani, Bernd Kliem, Rüdiger Loitz, Christian Bauckhage, Rafet Sifa

    Abstract: We introduce KPI-Check, a novel system that automatically identifies and cross-checks semantically equivalent key performance indicators (KPIs), e.g. "revenue" or "total costs", in real-world German financial reports. It combines a financial named entity and relation extraction module with a BERT-based filtering and text pair classification component to extract KPIs from unstructured sentences bef… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: Accepted at BigData 2022, 10 pages, 3 figure, 5 tables

  9. KPI-EDGAR: A Novel Dataset and Accompanying Metric for Relation Extraction from Financial Documents

    Authors: Tobias Deußer, Syed Musharraf Ali, Lars Hillebrand, Desiana Nurchalifah, Basil Jacob, Christian Bauckhage, Rafet Sifa

    Abstract: We introduce KPI-EDGAR, a novel dataset for Joint Named Entity Recognition and Relation Extraction building on financial reports uploaded to the Electronic Data Gathering, Analysis, and Retrieval (EDGAR) system, where the main objective is to extract Key Performance Indicators (KPIs) from financial documents and link them to their numerical values and other attributes. We further provide four acco… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Accepted at ICMLA 2022, 6 pages, 5 tables

  10. arXiv:2210.01241  [pdf, other

    cs.CL cs.LG

    Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization

    Authors: Rajkumar Ramamurthy, Prithviraj Ammanabrolu, Kianté Brantley, Jack Hessel, Rafet Sifa, Christian Bauckhage, Hannaneh Hajishirzi, Yejin Choi

    Abstract: We tackle the problem of aligning pre-trained large language models (LMs) with human preferences. If we view text generation as a sequential decision-making problem, reinforcement learning (RL) appears to be a natural conceptual framework. However, using RL for LM-based generation faces empirical challenges, including training instability due to the combinatorial action space, as well as a lack of… ▽ More

    Submitted 28 February, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: In Proceedings of ICLR 2023. Code found at https://github.com/allenai/rl4lms and Project website at https://rl4lms.apps.allenai.org/

  11. arXiv:2209.02055  [pdf, other

    cs.LG

    Full Kullback-Leibler-Divergence Loss for Hyperparameter-free Label Distribution Learning

    Authors: Maurice Günder, Nico Piatkowski, Christian Bauckhage

    Abstract: The concept of Label Distribution Learning (LDL) is a technique to stabilize classification and regression problems with ambiguous and/or imbalanced labels. A prototypical use-case of LDL is human age estimation based on profile images. Regarding this regression problem, a so called Deep Label Distribution Learning (DLDL) method has been developed. The main idea is the joint regression of the labe… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

    Comments: 8 pages, 4 figures

  12. arXiv:2209.01106  [pdf, other

    cs.CL

    A New Aligned Simple German Corpus

    Authors: Vanessa Toborek, Moritz Busch, Malte Boßert, Christian Bauckhage, Pascal Welke

    Abstract: "Leichte Sprache", the German counterpart to Simple English, is a regulated language aiming to facilitate complex written language that would otherwise stay inaccessible to different groups of people. We present a new sentence-aligned monolingual corpus for Simple German -- German. It contains multiple document-aligned sources which we have aligned using automatic sentence-alignment methods. We ev… ▽ More

    Submitted 26 May, 2023; v1 submitted 2 September, 2022; originally announced September 2022.

    Comments: Accepted at ACL 2023

  13. arXiv:2208.04365  [pdf, other

    cs.LG

    Gradient Flows for L2 Support Vector Machine Training

    Authors: Christian Bauckhage, Helen Schneider, Benjamin Wulff, Rafet Sifa

    Abstract: We explore the merits of training of support vector machines for binary classification by means of solving systems of ordinary differential equations. We thus assume a continuous time perspective on a machine learning problem which may be of interest for implementations on (re)emerging hardware platforms such as analog- or quantum computers.

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: Peer-reviewed and presented as part of the workshop on Continuous Time Methods for Machine Learning at the 39th International Conference on Machine Learning, Baltimore, Maryland, USA, 2022

  14. arXiv:2208.02140  [pdf, other

    cs.CL cs.AI cs.LG

    KPI-BERT: A Joint Named Entity Recognition and Relation Extraction Model for Financial Reports

    Authors: Lars Hillebrand, Tobias Deußer, Tim Dilmaghani, Bernd Kliem, Rüdiger Loitz, Christian Bauckhage, Rafet Sifa

    Abstract: We present KPI-BERT, a system which employs novel methods of named entity recognition (NER) and relation extraction (RE) to extract and link key performance indicators (KPIs), e.g. "revenue" or "interest expenses", of companies from real-world German financial documents. Specifically, we introduce an end-to-end trainable architecture that is based on Bidirectional Encoder Representations from Tran… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: Accepted at ICPR 2022, 8 pages, 1 figure, 6 tables

  15. arXiv:2206.03960  [pdf, other

    quant-ph cs.LG

    Predict better with less training data using a QNN

    Authors: Barry D. Reese, Marek Kowalik, Christian Metzl, Christian Bauckhage, Eldar Sultanow

    Abstract: Over the past decade, machine learning revolutionized vision-based quality assessment for which convolutional neural networks (CNNs) have now become the standard. In this paper, we consider a potential next step in this development and describe a quanvolutional neural network (QNN) algorithm that efficiently maps classical image data to quantum states and allows for reliable image analysis. We pra… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: 23 pages, 15 figures

    MSC Class: 81P68 ACM Class: I.5.1

  16. arXiv:2205.11433  [pdf, other

    cs.LG cs.AI

    Informed Pre-Training on Prior Knowledge

    Authors: Laura von Rueden, Sebastian Houben, Kostadin Cvejoski, Christian Bauckhage, Nico Piatkowski

    Abstract: When training data is scarce, the incorporation of additional prior knowledge can assist the learning process. While it is common to initialize neural networks with weights that have been pre-trained on other large data sets, pre-training on more concise forms of knowledge has rather been overlooked. In this paper, we propose a novel informed machine learning approach and suggest to pre-train on p… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  17. arXiv:2204.11133  [pdf, other

    quant-ph cs.CV stat.ML

    Towards Bundle Adjustment for Satellite Imaging via Quantum Machine Learning

    Authors: Nico Piatkowski, Thore Gerlach, Romain Hugues, Rafet Sifa, Christian Bauckhage, Frederic Barbaresco

    Abstract: Given is a set of images, where all images show views of the same area at different points in time and from different viewpoints. The task is the alignment of all images such that relevant information, e.g., poses, changes, and terrain, can be extracted from the fused image. In this work, we focus on quantum methods for keypoint extraction and feature matching, due to the demanding computational c… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

    ACM Class: C.3; I.2; I.4

  18. arXiv:2203.08815  [pdf, other

    cs.DS cs.LG quant-ph

    QUBOs for Sorting Lists and Building Trees

    Authors: Christian Bauckhage, Thore Gerlach, Nico Piatkowski

    Abstract: We show that the fundamental tasks of sorting lists and building search trees or heaps can be modeled as quadratic unconstrained binary optimization problems (QUBOs). The idea is to understand these tasks as permutation problems and to devise QUBOs whose solutions represent appropriate permutation matrices. We discuss how to construct such QUBOs and how to solve them using Hopfield nets or adiabat… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  19. Agricultural Plant Cataloging and Establishment of a Data Framework from UAV-based Crop Images by Computer Vision

    Authors: Maurice Günder, Facundo R. Ispizua Yamati, Jana Kierdorf, Ribana Roscher, Anne-Katrin Mahlein, Christian Bauckhage

    Abstract: UAV-based image retrieval in modern agriculture enables gathering large amounts of spatially referenced crop image data. In large-scale experiments, however, UAV images suffer from containing a multitudinous amount of crops in a complex canopy architecture. Especially for the observation of temporal effects, this complicates the recognition of individual plants over several images and the extracti… ▽ More

    Submitted 11 January, 2022; v1 submitted 8 January, 2022; originally announced January 2022.

    Comments: Preprint submitted to GigaScience

    Journal ref: GigaScience, Volume 11, 2022

  20. arXiv:2112.10712  [pdf, other

    cs.CY cs.CE

    Evolutionary Hierarchical Harvest Schedule Optimization for Food Waste Prevention

    Authors: Maurice Günder, Nico Piatkowski, Laura von Rueden, Rafet Sifa, Christian Bauckhage

    Abstract: In order to avoid disadvantages of monocropping for soil and environment, it is advisable to practice intercropping of various plant species whenever possible. However, intercropping is challenging as it requires a balanced planting schedule due to individual cultivation time frames. Maintaining a continuous harvest reduces logistical costs and related greenhouse gas emissions, and contributes to… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

    Comments: 4 pages, AAAI-2022 Workshop AI for Agriculture and Food Systems (AIAFS)

  21. arXiv:2110.14747  [pdf, other

    cs.IR cs.CL cs.LG

    Dynamic Review-based Recommenders

    Authors: Kostadin Cvejoski, Ramses J. Sanchez, Christian Bauckhage, Cesar Ojeda

    Abstract: Just as user preferences change with time, item reviews also reflect those same preference changes. In a nutshell, if one is to sequentially incorporate review content knowledge into recommender systems, one is naturally led to dynamical models of text. In the present work we leverage the known power of reviews to enhance rating predictions in a way that (i) respects the causality of review genera… ▽ More

    Submitted 22 March, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: 6pages, Published at International Data Science Conference 2021 (iDSC21)

  22. arXiv:2104.07538  [pdf, other

    cs.CV cs.AI

    Street-Map Based Validation of Semantic Segmentation in Autonomous Driving

    Authors: Laura von Rueden, Tim Wirtz, Fabian Hueger, Jan David Schneider, Nico Piatkowski, Christian Bauckhage

    Abstract: Artificial intelligence for autonomous driving must meet strict requirements on safety and robustness, which motivates the thorough validation of learned models. However, current validation approaches mostly require ground truth data and are thus both cost-intensive and limited in their applicability. We propose to overcome these limitations by a model agnostic validation using a-priori knowledge… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: Final version accepted at the International Conference on Pattern Recognition (ICPR). arXiv admin note: substantial text overlap with arXiv:2011.08008

  23. arXiv:2012.13453  [pdf, other

    quant-ph cs.LG stat.ML

    Quantum Circuit Evolution on NISQ Devices

    Authors: Lukas Franken, Bogdan Georgiev, Sascha Mücke, Moritz Wolter, Raoul Heese, Christian Bauckhage, Nico Piatkowski

    Abstract: Variational quantum circuits build the foundation for various classes of quantum algorithms. In a nutshell, the weights of a parametrized quantum circuit are varied until the empirical sampling distribution of the circuit is sufficiently close to a desired outcome. Numerical first-order methods are applied frequently to fit the parameters of the circuit, but most of the time, the circuit itself, t… ▽ More

    Submitted 23 May, 2022; v1 submitted 23 December, 2020; originally announced December 2020.

    Comments: 8 pages, 7 figures. To appear in the proceedings of IEEE Congress on Evolutionary Computation (CEC) 2022

    Journal ref: 2022 IEEE Congress on Evolutionary Computation (CEC), pp. 1-8

  24. Recurrent Point Review Models

    Authors: Kostadin Cvejoski, Ramses J. Sanchez, Bogdan Georgiev, Christian Bauckhage, Cesar Ojeda

    Abstract: Deep neural network models represent the state-of-the-art methodologies for natural language processing. Here we build on top of these methodologies to incorporate temporal information and model how to review data changes with time. Specifically, we use the dynamic representations of recurrent point process models, which encode the history of how business or service reviews are received in time, t… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

    Comments: 8 pages, 6 figures, Published in: 2020 International Joint Conference on Neural Networks (IJCNN)

    Journal ref: 2020 International Joint Conference on Neural Networks (IJCNN), Glasgow, United Kingdom, 2020, pp. 1-8

  25. arXiv:2011.08272  [pdf, other

    cs.CL cs.AI

    NLPGym -- A toolkit for evaluating RL agents on Natural Language Processing Tasks

    Authors: Rajkumar Ramamurthy, Rafet Sifa, Christian Bauckhage

    Abstract: Reinforcement learning (RL) has recently shown impressive performance in complex game AI and robotics tasks. To a large extent, this is thanks to the availability of simulated environments such as OpenAI Gym, Atari Learning Environment, or Malmo which allow agents to learn complex tasks through interaction with virtual environments. While RL is also increasingly applied to natural language process… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.

    Comments: Accepted at Wordplay: When Language Meets Games Workshop @ NeurIPS 2020

  26. arXiv:2011.08008  [pdf, other

    cs.CV cs.AI

    Towards Map-Based Validation of Semantic Segmentation Masks

    Authors: Laura von Rueden, Tim Wirtz, Fabian Hueger, Jan David Schneider, Christian Bauckhage

    Abstract: Artificial intelligence for autonomous driving must meet strict requirements on safety and robustness. We propose to validate machine learning models for self-driving vehicles not only with given ground truth labels, but also with additional a-priori knowledge. In particular, we suggest to validate the drivable area in semantic segmentation masks using given street map data. We present first resul… ▽ More

    Submitted 26 November, 2020; v1 submitted 3 November, 2020; originally announced November 2020.

  27. arXiv:2007.07320  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Syllogism with Euler Neural-Networks

    Authors: Tiansi Dong, Chengjiang Li, Christian Bauckhage, Juanzi Li, Stefan Wrobel, Armin B. Cremers

    Abstract: Traditional neural networks represent everything as a vector, and are able to approximate a subset of logical reasoning to a certain degree. As basic logic relations are better represented by topological relations between regions, we propose a novel neural network that represents everything as a ball and is able to learn topological configuration as an Euler diagram. So comes the name Euler Neural… ▽ More

    Submitted 20 July, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: 16 pages, 6 figures

  28. arXiv:1912.04132  [pdf, other

    cs.LG stat.ML

    Recurrent Point Processes for Dynamic Review Models

    Authors: Kostadin Cvejoski, Ramses J. Sanchez, Bogdan Georgiev, Jannis Schuecker, Christian Bauckhage, Cesar Ojeda

    Abstract: Recent progress in recommender system research has shown the importance of including temporal representations to improve interpretability and performance. Here, we incorporate temporal representations in continuous time via recurrent point process for a dynamical model of reviews. Our goal is to characterize how changes in perception, user interest and seasonal effects affect review text.

    Submitted 15 January, 2020; v1 submitted 9 December, 2019; originally announced December 2019.

    Comments: Presented at the AAAI 2020 Workshop on Interactive and Conversational Recommendation Systems

  29. arXiv:1911.06121  [pdf, other

    cs.CL

    Towards Supervised Extractive Text Summarization via RNN-based Sequence Classification

    Authors: Eduardo Brito, Max Lübbering, David Biesner, Lars Patrick Hillebrand, Christian Bauckhage

    Abstract: This article briefly explains our submitted approach to the DocEng'19 competition on extractive summarization. We implemented a recurrent neural network based model that learns to classify whether an article's sentence belongs to the corresponding extractive summary or not. We bypass the lack of large annotated news corpora for extractive summarization by generating extractive summaries from abstr… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

  30. arXiv:1906.09808  [pdf, ps, other

    stat.ML cs.LG

    Recurrent Adversarial Service Times

    Authors: César Ojeda, Kostadin Cvejosky, Ramsés J. Sánchez, Jannis Schuecker, Bogdan Georgiev, Christian Bauckhage

    Abstract: Service system dynamics occur at the interplay between customer behaviour and a service provider's response. This kind of dynamics can effectively be modeled within the framework of queuing theory where customers' arrivals are described by point process models. However, these approaches are limited by parametric assumptions as to, for example, inter-event time distributions. In this paper, we addr… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

  31. arXiv:1903.12394  [pdf, other

    stat.ML cs.AI cs.LG

    Informed Machine Learning -- A Taxonomy and Survey of Integrating Knowledge into Learning Systems

    Authors: Laura von Rueden, Sebastian Mayer, Katharina Beckh, Bogdan Georgiev, Sven Giesselbach, Raoul Heese, Birgit Kirsch, Julius Pfrommer, Annika Pick, Rajkumar Ramamurthy, Michal Walczak, Jochen Garcke, Christian Bauckhage, Jannis Schuecker

    Abstract: Despite its great success, machine learning can have its limits when dealing with insufficient training data. A potential solution is the additional integration of prior knowledge into the training process which leads to the notion of informed machine learning. In this paper, we present a structured overview of various approaches in this field. We provide a definition and propose a concept for inf… ▽ More

    Submitted 28 May, 2021; v1 submitted 29 March, 2019; originally announced March 2019.

    Comments: Accepted at IEEE Transactions on Knowledge and Data Engineering: https://ieeexplore.ieee.org/document/9429985

  32. arXiv:1803.04300  [pdf, other

    cs.LG stat.ML

    Neural Conditional Gradients

    Authors: Patrick Schramowski, Christian Bauckhage, Kristian Kersting

    Abstract: The move from hand-designed to learned optimizers in machine learning has been quite successful for gradient-based and -free optimizers. When facing a constrained problem, however, maintaining feasibility typically requires a projection step, which might be computationally expensive and not differentiable. We show how the design of projection-free convex optimization algorithms can be cast as a le… ▽ More

    Submitted 30 July, 2018; v1 submitted 12 March, 2018; originally announced March 2018.

    Comments: arXiv admin note: text overlap with arXiv:1610.05120 by other authors

  33. arXiv:1710.11395  [pdf, other

    cs.SI physics.soc-ph

    The Slashdot Zoo: Mining a Social Network with Negative Edges

    Authors: Jérôme Kunegis, Andreas Lommatzsch, Christian Bauckhage

    Abstract: We analyse the corpus of user relationships of the Slashdot technology news site. The data was collected from the Slashdot Zoo feature where users of the website can tag other users as friends and foes, providing positive and negative endorsements. We adapt social network analysis techniques to the problem of negative edge weights. In particular, we consider signed variants of global network chara… ▽ More

    Submitted 31 October, 2017; originally announced October 2017.

    Comments: 10 pages, color, accepted at WWW 2009

    ACM Class: I.2.6; H.4.0

    Journal ref: Proc. WWW 2009

  34. arXiv:1704.01046  [pdf, other

    cs.CR cs.NE

    Using Echo State Networks for Cryptography

    Authors: Rajkumar Ramamurthy, Christian Bauckhage, Krisztian Buza, Stefan Wrobel

    Abstract: Echo state networks are simple recurrent neural networks that are easy to implement and train. Despite their simplicity, they show a form of memory and can predict or regenerate sequences of data. We make use of this property to realize a novel neural cryptography scheme. The key idea is to assume that Alice and Bob share a copy of an echo state network. If Alice trains her copy to memorize a mess… ▽ More

    Submitted 4 April, 2017; originally announced April 2017.

    Comments: 8 pages, ICANN 2017

  35. arXiv:1511.01523  [pdf, other

    physics.soc-ph cs.SI

    SGPD Volume Maximization for Community Detection

    Authors: Kasra Manshaei, Christian Bauckhage

    Abstract: In this note we briefly study the feasibility of community detection in complex networks using peripheral vertices. Our method suggests a novel direction in axiomizing the problem of clustering in graphs and complex networks by looking at the topological role each vertex plays in the community structure, regardless of the attributes. The promising strength of pseudo-peripheral vertices as a lever… ▽ More

    Submitted 4 November, 2015; originally announced November 2015.

  36. Exploring Human Vision Driven Features for Pedestrian Detection

    Authors: Shanshan Zhang, Christian Bauckhage, Dominik A. Klein, Armin B. Cremers

    Abstract: Motivated by the center-surround mechanism in the human visual attention system, we propose to use average contrast maps for the challenge of pedestrian detection in street scenes due to the observation that pedestrians indeed exhibit discriminative contrast texture. Our main contributions are first to design a local, statistical multi-channel descriptorin order to incorporate both color and gradi… ▽ More

    Submitted 25 January, 2015; originally announced January 2015.

    Comments: Accepted for publication in IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

  37. arXiv:1501.04232  [pdf, other

    cs.SI physics.soc-ph

    Maximum Entropy Models of Shortest Path and Outbreak Distributions in Networks

    Authors: Christian Bauckhage, Kristian Kersting, Fabian Hadiji

    Abstract: Properties of networks are often characterized in terms of features such as node degree distributions, average path lengths, diameters, or clustering coefficients. Here, we study shortest path length distributions. On the one hand, average as well as maximum distances can be determined therefrom; on the other hand, they are closely related to the dynamics of network spreading processes. Because of… ▽ More

    Submitted 17 January, 2015; originally announced January 2015.

  38. arXiv:1410.3314  [pdf, other

    stat.ML cs.LG

    Propagation Kernels

    Authors: Marion Neumann, Roman Garnett, Christian Bauckhage, Kristian Kersting

    Abstract: We introduce propagation kernels, a general graph-kernel framework for efficiently measuring the similarity of structured data. Propagation kernels are based on monitoring how information spreads through a set of given graphs. They leverage early-stage distributions from propagation schemes such as random walks to capture structural information encoded in node labels, attributes, and edge informat… ▽ More

    Submitted 13 October, 2014; originally announced October 2014.

  39. arXiv:1410.0642  [pdf, other

    math.NA cs.CG

    A Note on Archetypal Analysis and the Approximation of Convex Hulls

    Authors: Christian Bauckhage

    Abstract: We briefly review the basic ideas behind archetypal analysis for matrix factorization and discuss its behavior in approximating the convex hull of a data sample. We then ask how good such approximations can be and consider different cases. Understanding archetypal analysis as the problem of computing a convexity constrained low-rank approximation of the identity matrix provides estimates for arche… ▽ More

    Submitted 27 September, 2014; originally announced October 2014.

  40. arXiv:1409.0104  [pdf, ps, other

    cs.IR

    Marginalizing over the PageRank Damping Factor

    Authors: Christian Bauckhage

    Abstract: In this note, we show how to marginalize over the damping parameter of the PageRank equation so as to obtain a parameter-free version known as TotalRank. Our discussion is meant as a reference and intended to provide a guided tour towards an interesting result that has applications in information retrieval and classification.

    Submitted 30 August, 2014; originally announced September 2014.

  41. arXiv:1407.3950  [pdf

    cs.HC

    A Comparison of Methods for Player Clustering via Behavioral Telemetry

    Authors: Anders Drachen, Christian Thurau, Rafet Sifa, Christian Bauckhage

    Abstract: The analysis of user behavior in digital games has been aided by the introduction of user telemetry in game development, which provides unprecedented access to quantitative data on user behavior from the installed game clients of the entire population of players. Player behavior telemetry datasets can be exceptionally complex, with features recorded for a varying population of users over a tempora… ▽ More

    Submitted 15 July, 2014; originally announced July 2014.

    Comments: Foundations of Digital Games 2013

    MSC Class: N/A ACM Class: H.2.8

  42. arXiv:1406.6529  [pdf, other

    cs.SI physics.soc-ph

    Strong Regularities in Growth and Decline of Popularity of Social Media Services

    Authors: Christian Bauckhage, Kristian Kersting

    Abstract: We analyze general trends and pattern in time series that characterize the dynamics of collective attention to social media services and Web-based businesses. Our study is based on search frequency data available from Google Trends and considers 175 different services. For each service, we collect data from 45 different countries as well as global averages. This way, we obtain more than 8,000 time… ▽ More

    Submitted 25 June, 2014; originally announced June 2014.

    ACM Class: G.3; H.3.5

  43. arXiv:1402.3193  [pdf, ps, other

    cs.IT

    Characterizations and Kullback-Leibler Divergence of Gompertz Distributions

    Authors: Christian Bauckhage

    Abstract: In this note, we characterize the Gompertz distribution in terms of extreme value distributions and point out that it implicitly models the interplay of two antagonistic growth processes. In addition, we derive a closed form expressions for the Kullback-Leibler divergence between two Gompertz Distributions. Although the latter is rather easy to obtain, it seems not to have been widely reported bef… ▽ More

    Submitted 13 February, 2014; originally announced February 2014.

  44. arXiv:1401.6853  [pdf, ps, other

    cs.IT

    Computing the Kullback-Leibler Divergence between two Generalized Gamma Distributions

    Authors: Christian Bauckhage

    Abstract: We derive a closed form solution for the Kullback-Leibler divergence between two generalized gamma distributions. These notes are meant as a reference and provide a guided tour towards a result of practical interest that is rarely explicated in the literature.

    Submitted 27 January, 2014; originally announced January 2014.

  45. arXiv:1310.7114  [pdf, other

    cs.CV

    Efficient Information Theoretic Clustering on Discrete Lattices

    Authors: Christian Bauckhage, Kristian Kersting

    Abstract: We consider the problem of clustering data that reside on discrete, low dimensional lattices. Canonical examples for this setting are found in image segmentation and key point extraction. Our solution is based on a recent approach to information theoretic clustering where clusters result from an iterative procedure that minimizes a divergence measure. We replace costly processing steps in the orig… ▽ More

    Submitted 26 October, 2013; originally announced October 2013.

    Comments: This paper has been presented at the workshop LWA 2012

  46. arXiv:1310.3713  [pdf, ps, other

    cs.IT

    Computing the Kullback-Leibler Divergence between two Weibull Distributions

    Authors: Christian Bauckhage

    Abstract: We derive a closed form solution for the Kullback-Leibler divergence between two Weibull distributions. These notes are meant as reference material and intended to provide a guided tour towards a result that is often mentioned but seldom made explicit in the literature.

    Submitted 14 October, 2013; originally announced October 2013.

  47. arXiv:1304.7984  [pdf, other

    cs.SI cs.DL physics.soc-ph

    GeoDBLP: Geo-Tagging DBLP for Mining the Sociology of Computer Science

    Authors: Fabian Hadiji, Kristian Kersting, Christian Bauckhage, Babak Ahmadi

    Abstract: Many collective human activities have been shown to exhibit universal patterns. However, the possibility of universal patterns across timing events of researcher migration has barely been explored at global scale. Here, we show that timing events of migration within different countries exhibit remarkable similarities. Specifically, we look at the distribution governing the data of researcher migra… ▽ More

    Submitted 30 April, 2013; originally announced April 2013.

  48. arXiv:1210.4919  [pdf

    cs.LG cs.CE stat.ML

    Latent Dirichlet Allocation Uncovers Spectral Characteristics of Drought Stressed Plants

    Authors: Mirwaes Wahabzada, Kristian Kersting, Christian Bauckhage, Christoph Roemer, Agim Ballvora, Francisco Pinto, Uwe Rascher, Jens Leon, Lutz Ploemer

    Abstract: Understanding the adaptation process of plants to drought stress is essential in improving management practices, breeding strategies as well as engineering viable crops for a sustainable agriculture in the coming decades. Hyper-spectral imaging provides a particularly promising approach to gain such understanding since it allows to discover non-destructively spectral characteristics of plants gove… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-852-862