Search | arXiv e-print repository

Self-Supervised Learning for Interventional Image Analytics: Towards Robust Device Trackers

Authors: Saahil Islam, Venkatesh N. Murthy, Dominik Neumann, Badhan Kumar Das, Puneet Sharma, Andreas Maier, Dorin Comaniciu, Florin C. Ghesu

Abstract: An accurate detection and tracking of devices such as guiding catheters in live X-ray image acquisitions is an essential prerequisite for endovascular cardiac interventions. This information is leveraged for procedural guidance, e.g., directing stent placements. To ensure procedural safety and efficacy, there is a need for high robustness no failures during tracking. To achieve that, one needs to… ▽ More An accurate detection and tracking of devices such as guiding catheters in live X-ray image acquisitions is an essential prerequisite for endovascular cardiac interventions. This information is leveraged for procedural guidance, e.g., directing stent placements. To ensure procedural safety and efficacy, there is a need for high robustness no failures during tracking. To achieve that, one needs to efficiently tackle challenges, such as: device obscuration by contrast agent or other external devices or wires, changes in field-of-view or acquisition angle, as well as the continuous movement due to cardiac and respiratory motion. To overcome the aforementioned challenges, we propose a novel approach to learn spatio-temporal features from a very large data cohort of over 16 million interventional X-ray frames using self-supervision for image sequence data. Our approach is based on a masked image modeling technique that leverages frame interpolation based reconstruction to learn fine inter-frame temporal correspondences. The features encoded in the resulting model are fine-tuned downstream. Our approach achieves state-of-the-art performance and in particular robustness compared to ultra optimized reference solutions (that use multi-stage feature fusion, multi-task and flow regularization). The experiments show that our method achieves 66.31% reduction in maximum tracking error against reference solutions (23.20% when flow regularization is used); achieving a success score of 97.95% at a 3x faster inference speed of 42 frames-per-second (on GPU). The results encourage the use of our approach in various other tasks within interventional image analytics that require effective understanding of spatio-temporal semantics. △ Less

Submitted 2 May, 2024; originally announced May 2024.

arXiv:2312.03751 [pdf, other]

Which linguistic cues make people fall for fake news? A comparison of cognitive and affective processing

Authors: Bernhard Lutz, Marc Adam, Stefan Feuerriegel, Nicolas Pröllochs, Dirk Neumann

Abstract: Fake news on social media has large, negative implications for society. However, little is known about what linguistic cues make people fall for fake news and, hence, how to design effective countermeasures for social media. In this study, we seek to understand which linguistic cues make people fall for fake news. Linguistic cues (e.g., adverbs, personal pronouns, positive emotion words, negative… ▽ More Fake news on social media has large, negative implications for society. However, little is known about what linguistic cues make people fall for fake news and, hence, how to design effective countermeasures for social media. In this study, we seek to understand which linguistic cues make people fall for fake news. Linguistic cues (e.g., adverbs, personal pronouns, positive emotion words, negative emotion words) are important characteristics of any text and also affect how people process real vs. fake news. Specifically, we compare the role of linguistic cues across both cognitive processing (related to careful thinking) and affective processing (related to unconscious automatic evaluations). To this end, we performed a within-subject experiment where we collected neurophysiological measurements of 42 subjects while these read a sample of 40 real and fake news articles. During our experiment, we measured cognitive processing through eye fixations, and affective processing in situ through heart rate variability. We find that users engage more in cognitive processing for longer fake news articles, while affective processing is more pronounced for fake news written in analytic words. To the best of our knowledge, this is the first work studying the role of linguistic cues in fake news processing. Altogether, our findings have important implications for designing online platforms that encourage users to engage in careful thinking and thus prevent them from falling for fake news. △ Less

Submitted 2 December, 2023; originally announced December 2023.

arXiv:2308.08560 [pdf]

3D Analytics: Opportunities and Guidelines for Information Systems Research

Authors: Gunther Gust, Tobias Brandt, Otto Koppius, Markus Rosenfelder, Dirk Neumann

Abstract: Progress in sensor technologies has made three-dimensional (3D) representations of the physical world available at a large scale. Leveraging such 3D representations with analytics has the potential to advance Information Systems (IS) research in several areas. However, this novel data type has rarely been incorporated. To address this shortcoming, this article first presents two showcases of 3D an… ▽ More Progress in sensor technologies has made three-dimensional (3D) representations of the physical world available at a large scale. Leveraging such 3D representations with analytics has the potential to advance Information Systems (IS) research in several areas. However, this novel data type has rarely been incorporated. To address this shortcoming, this article first presents two showcases of 3D analytics applications together with general modeling guidelines for 3D analytics, in order to support IS researchers in implementing research designs with 3D components. Second, the article presents several promising opportunities for 3D analytics to advance behavioral and design-oriented IS research in several contextual areas, such as healthcare IS, human-computer interaction, mobile commerce, energy informatics and others. Third, we investigate the nature of the benefits resulting from the application of 3D analytics, resulting in a list of common tasks of research projects that 3D analytics can support, regardless of the contextual application area. Based on the given showcases, modeling guidelines, research opportunities and task-related benefits, we encourage IS researchers to start their journey into this largely unexplored third spatial dimension. △ Less

Submitted 14 August, 2023; originally announced August 2023.

arXiv:2303.04689 [pdf, other]

doi 10.1145/3634686

A Privacy Preserving System for Movie Recommendations Using Federated Learning

Authors: David Neumann, Andreas Lutz, Karsten Müller, Wojciech Samek

Abstract: Recommender systems have become ubiquitous in the past years. They solve the tyranny of choice problem faced by many users, and are utilized by many online businesses to drive engagement and sales. Besides other criticisms, like creating filter bubbles within social networks, recommender systems are often reproved for collecting considerable amounts of personal data. However, to personalize recomm… ▽ More Recommender systems have become ubiquitous in the past years. They solve the tyranny of choice problem faced by many users, and are utilized by many online businesses to drive engagement and sales. Besides other criticisms, like creating filter bubbles within social networks, recommender systems are often reproved for collecting considerable amounts of personal data. However, to personalize recommendations, personal information is fundamentally required. A recent distributed learning scheme called federated learning has made it possible to learn from personal user data without its central collection. Consequently, we present a recommender system for movie recommendations, which provides privacy and thus trustworthiness on multiple levels: First and foremost, it is trained using federated learning and thus, by its very nature, privacy-preserving, while still enabling users to benefit from global insights. Furthermore, a novel federated learning scheme, called FedQ, is employed, which not only addresses the problem of non-i.i.d.-ness and small local datasets, but also prevents input data reconstruction attacks by aggregating client updates early. Finally, to reduce the communication overhead, compression is applied, which significantly compresses the exchanged neural network parametrizations to a fraction of their original size. We conjecture that this may also improve data privacy through its lossy quantization stage. △ Less

Submitted 16 May, 2024; v1 submitted 7 March, 2023; originally announced March 2023.

Comments: Accepted for publication in the ACM Transactions on Recommender Systems (TORS) Special Issue on Trustworthy Recommender Systems

arXiv:2201.01283 [pdf, other]

Self-supervised Learning from 100 Million Medical Images

Authors: Florin C. Ghesu, Bogdan Georgescu, Awais Mansoor, Youngjin Yoo, Dominik Neumann, Pragneshkumar Patel, R. S. Vishwanath, James M. Balter, Yue Cao, Sasa Grbic, Dorin Comaniciu

Abstract: Building accurate and robust artificial intelligence systems for medical image assessment requires not only the research and design of advanced deep learning models but also the creation of large and curated sets of annotated training examples. Constructing such datasets, however, is often very costly -- due to the complex nature of annotation tasks and the high level of expertise required for the… ▽ More Building accurate and robust artificial intelligence systems for medical image assessment requires not only the research and design of advanced deep learning models but also the creation of large and curated sets of annotated training examples. Constructing such datasets, however, is often very costly -- due to the complex nature of annotation tasks and the high level of expertise required for the interpretation of medical images (e.g., expert radiologists). To counter this limitation, we propose a method for self-supervised learning of rich image features based on contrastive learning and online feature clustering. For this purpose we leverage large training datasets of over 100,000,000 medical images of various modalities, including radiography, computed tomography (CT), magnetic resonance (MR) imaging and ultrasonography. We propose to use these features to guide model training in supervised and hybrid self-supervised/supervised regime on various downstream tasks. We highlight a number of advantages of this strategy on challenging image assessment problems in radiography, CT and MR: 1) Significant increase in accuracy compared to the state-of-the-art (e.g., AUC boost of 3-7% for detection of abnormalities from chest radiography scans and hemorrhage detection on brain CT); 2) Acceleration of model convergence during training by up to 85% compared to using no pretraining (e.g., 83% when training a model for detection of brain metastases in MR scans); 3) Increase in robustness to various image augmentations, such as intensity variations, rotations or scaling reflective of data variation seen in the field. △ Less

Submitted 4 January, 2022; originally announced January 2022.

arXiv:2106.13200 [pdf, other]

Software for Dataset-wide XAI: From Local Explanations to Global Insights with Zennit, CoRelAy, and ViRelAy

Authors: Christopher J. Anders, David Neumann, Wojciech Samek, Klaus-Robert Müller, Sebastian Lapuschkin

Abstract: Deep Neural Networks (DNNs) are known to be strong predictors, but their prediction strategies can rarely be understood. With recent advances in Explainable Artificial Intelligence (XAI), approaches are available to explore the reasoning behind those complex models' predictions. Among post-hoc attribution methods, Layer-wise Relevance Propagation (LRP) shows high performance. For deeper quantitati… ▽ More Deep Neural Networks (DNNs) are known to be strong predictors, but their prediction strategies can rarely be understood. With recent advances in Explainable Artificial Intelligence (XAI), approaches are available to explore the reasoning behind those complex models' predictions. Among post-hoc attribution methods, Layer-wise Relevance Propagation (LRP) shows high performance. For deeper quantitative analysis, manual approaches exist, but without the right tools they are unnecessarily labor intensive. In this software paper, we introduce three software packages targeted at scientists to explore model reasoning using attribution approaches and beyond: (1) Zennit - a highly customizable and intuitive attribution framework implementing LRP and related approaches in PyTorch, (2) CoRelAy - a framework to easily and quickly construct quantitative analysis pipelines for dataset-wide analyses of explanations, and (3) ViRelAy - a web-application to interactively explore data, attributions, and analysis results. With this, we provide a standardized implementation solution for XAI, to contribute towards more reproducibility in our field. △ Less

Submitted 28 February, 2023; v1 submitted 24 June, 2021; originally announced June 2021.

Comments: 20 pages, 6 figures, 2 listings, 1 table

arXiv:2012.03690 [pdf, other]

An Enriched Automated PV Registry: Combining Image Recognition and 3D Building Data

Authors: Benjamin Rausch, Kevin Mayer, Marie-Louise Arlt, Gunther Gust, Philipp Staudt, Christof Weinhardt, Dirk Neumann, Ram Rajagopal

Abstract: While photovoltaic (PV) systems are installed at an unprecedented rate, reliable information on an installation level remains scarce. As a result, automatically created PV registries are a timely contribution to optimize grid planning and operations. This paper demonstrates how aerial imagery and three-dimensional building data can be combined to create an address-level PV registry, specifying are… ▽ More While photovoltaic (PV) systems are installed at an unprecedented rate, reliable information on an installation level remains scarce. As a result, automatically created PV registries are a timely contribution to optimize grid planning and operations. This paper demonstrates how aerial imagery and three-dimensional building data can be combined to create an address-level PV registry, specifying area, tilt, and orientation angles. We demonstrate the benefits of this approach for PV capacity estimation. In addition, this work presents, for the first time, a comparison between automated and officially-created PV registries. Our results indicate that our enriched automated registry proves to be useful to validate, update, and complement official registries. △ Less

Submitted 7 December, 2020; originally announced December 2020.

Comments: Tackling Climate Change with Machine Learning at NeurIPS 2020 (Spotlight talk)

arXiv:2004.11841 [pdf, other]

Risk Estimation of SARS-CoV-2 Transmission from Bluetooth Low Energy Measurements

Authors: Felix Sattler, Jackie Ma, Patrick Wagner, David Neumann, Markus Wenzel, Ralf Schäfer, Wojciech Samek, Klaus-Robert Müller, Thomas Wiegand

Abstract: Digital contact tracing approaches based on Bluetooth low energy (BLE) have the potential to efficiently contain and delay outbreaks of infectious diseases such as the ongoing SARS-CoV-2 pandemic. In this work we propose a novel machine learning based approach to reliably detect subjects that have spent enough time in close proximity to be at risk of being infected. Our study is an important proof… ▽ More Digital contact tracing approaches based on Bluetooth low energy (BLE) have the potential to efficiently contain and delay outbreaks of infectious diseases such as the ongoing SARS-CoV-2 pandemic. In this work we propose a novel machine learning based approach to reliably detect subjects that have spent enough time in close proximity to be at risk of being infected. Our study is an important proof of concept that will aid the battery of epidemiological policies aiming to slow down the rapid spread of COVID-19. △ Less

Submitted 22 April, 2020; originally announced April 2020.

arXiv:1912.11425 [pdf, other]

Finding and Removing Clever Hans: Using Explanation Methods to Debug and Improve Deep Models

Authors: Christopher J. Anders, Leander Weber, David Neumann, Wojciech Samek, Klaus-Robert Müller, Sebastian Lapuschkin

Abstract: Contemporary learning models for computer vision are typically trained on very large (benchmark) datasets with millions of samples. These may, however, contain biases, artifacts, or errors that have gone unnoticed and are exploitable by the model. In the worst case, the trained model does not learn a valid and generalizable strategy to solve the problem it was trained for, and becomes a 'Clever-Ha… ▽ More Contemporary learning models for computer vision are typically trained on very large (benchmark) datasets with millions of samples. These may, however, contain biases, artifacts, or errors that have gone unnoticed and are exploitable by the model. In the worst case, the trained model does not learn a valid and generalizable strategy to solve the problem it was trained for, and becomes a 'Clever-Hans' (CH) predictor that bases its decisions on spurious correlations in the training data, potentially yielding an unrepresentative or unfair, and possibly even hazardous predictor. In this paper, we contribute by providing a comprehensive analysis framework based on a scalable statistical analysis of attributions from explanation methods for large data corpora. Based on a recent technique - Spectral Relevance Analysis - we propose the following technical contributions and resulting findings: (a) a scalable quantification of artifactual and poisoned classes where the machine learning models under study exhibit CH behavior, (b) several approaches denoted as Class Artifact Compensation (ClArC), which are able to effectively and significantly reduce a model's CH behavior. I.e., we are able to un-Hans models trained on (poisoned) datasets, such as the popular ImageNet data corpus. We demonstrate that ClArC, defined in a simple theoretical framework, may be implemented as part of a Neural Network's training or fine-tuning process, or in a post-hoc manner by injecting additional layers, preventing any further propagation of undesired CH features, into the network architecture. Using our proposed methods, we provide qualitative and quantitative analyses of the biases and artifacts in various datasets. We demonstrate that these insights can give rise to improved, more representative and fairer models operating on implicitly cleaned data corpora. △ Less

Submitted 18 December, 2020; v1 submitted 22 December, 2019; originally announced December 2019.

Comments: 47 pages, 21 figures

arXiv:1909.05192 [pdf, other]

The Longer the Better? The Interplay Between Review Length and Line of Argumentation in Online Consumer Reviews

Authors: Bernhard Lutz, Nicolas Pröllochs, Dirk Neumann

Abstract: Review helpfulness serves as focal point in understanding customers' purchase decision-making process on online retailer platforms. An overwhelming majority of previous works find longer reviews to be more helpful than short reviews. In this paper, we propose that longer reviews should not be assumed to be uniformly more helpful; instead, we argue that the effect depends on the line of argumentati… ▽ More Review helpfulness serves as focal point in understanding customers' purchase decision-making process on online retailer platforms. An overwhelming majority of previous works find longer reviews to be more helpful than short reviews. In this paper, we propose that longer reviews should not be assumed to be uniformly more helpful; instead, we argue that the effect depends on the line of argumentation in the review text. To test this idea, we use a large dataset of customer reviews from Amazon in combination with a state-of-the-art approach from natural language processing that allows us to study argumentation lines at sentence level. Our empirical analysis suggests that the frequency of argumentation changes moderates the effect of review length on helpfulness. Altogether, we disprove the prevailing narrative that longer reviews are uniformly perceived as more helpful. Our findings allow retailer platforms to improve their customer feedback systems and to feature more useful product reviews. △ Less

Submitted 26 September, 2019; v1 submitted 10 September, 2019; originally announced September 2019.

Comments: arXiv admin note: text overlap with arXiv:1810.10942

arXiv:1907.11900 [pdf, other]

doi 10.1109/JSTSP.2020.2969554

DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks

Authors: Simon Wiedemann, Heiner Kirchoffer, Stefan Matlage, Paul Haase, Arturo Marban, Talmaj Marinc, David Neumann, Tung Nguyen, Ahmed Osman, Detlev Marpe, Heiko Schwarz, Thomas Wiegand, Wojciech Samek

Abstract: The field of video compression has developed some of the most sophisticated and efficient compression algorithms known in the literature, enabling very high compressibility for little loss of information. Whilst some of these techniques are domain specific, many of their underlying principles are universal in that they can be adapted and applied for compressing different types of data. In this wor… ▽ More The field of video compression has developed some of the most sophisticated and efficient compression algorithms known in the literature, enabling very high compressibility for little loss of information. Whilst some of these techniques are domain specific, many of their underlying principles are universal in that they can be adapted and applied for compressing different types of data. In this work we present DeepCABAC, a compression algorithm for deep neural networks that is based on one of the state-of-the-art video coding techniques. Concretely, it applies a Context-based Adaptive Binary Arithmetic Coder (CABAC) to the network's parameters, which was originally designed for the H.264/AVC video coding standard and became the state-of-the-art for lossless compression. Moreover, DeepCABAC employs a novel quantization scheme that minimizes the rate-distortion function while simultaneously taking the impact of quantization onto the accuracy of the network into account. Experimental results show that DeepCABAC consistently attains higher compression rates than previously proposed coding techniques for neural network compression. For instance, it is able to compress the VGG16 ImageNet model by x63.6 with no loss of accuracy, thus being able to represent the entire network with merely 8.7MB. The source code for encoding and decoding can be found at https://github.com/fraunhoferhhi/DeepCABAC. △ Less

Submitted 27 July, 2019; originally announced July 2019.

arXiv:1905.08318 [pdf, other]

DeepCABAC: Context-adaptive binary arithmetic coding for deep neural network compression

Authors: Simon Wiedemann, Heiner Kirchhoffer, Stefan Matlage, Paul Haase, Arturo Marban, Talmaj Marinc, David Neumann, Ahmed Osman, Detlev Marpe, Heiko Schwarz, Thomas Wiegand, Wojciech Samek

Abstract: We present DeepCABAC, a novel context-adaptive binary arithmetic coder for compressing deep neural networks. It quantizes each weight parameter by minimizing a weighted rate-distortion function, which implicitly takes the impact of quantization on to the accuracy of the network into account. Subsequently, it compresses the quantized values into a bitstream representation with minimal redundancies.… ▽ More We present DeepCABAC, a novel context-adaptive binary arithmetic coder for compressing deep neural networks. It quantizes each weight parameter by minimizing a weighted rate-distortion function, which implicitly takes the impact of quantization on to the accuracy of the network into account. Subsequently, it compresses the quantized values into a bitstream representation with minimal redundancies. We show that DeepCABAC is able to reach very high compression ratios across a wide set of different network architectures and datasets. For instance, we are able to compress by x63.6 the VGG16 ImageNet model with no loss of accuracy, thus being able to represent the entire network with merely 8.7MB. △ Less

Submitted 15 May, 2019; originally announced May 2019.

Comments: ICML 2019, Joint Workshop on On-Device Machine Learning and Compact Deep Neural Network Representations (ODML-CDNNR)

arXiv:1901.00400 [pdf, other]

Sentence-Level Sentiment Analysis of Financial News Using Distributed Text Representations and Multi-Instance Learning

Authors: Bernhard Lutz, Nicolas Pröllochs, Dirk Neumann

Abstract: Researchers and financial professionals require robust computerized tools that allow users to rapidly operationalize and assess the semantic textual content in financial news. However, existing methods commonly work at the document-level while deeper insights into the actual structure and the sentiment of individual sentences remain blurred. As a result, investors are required to apply the utmost… ▽ More Researchers and financial professionals require robust computerized tools that allow users to rapidly operationalize and assess the semantic textual content in financial news. However, existing methods commonly work at the document-level while deeper insights into the actual structure and the sentiment of individual sentences remain blurred. As a result, investors are required to apply the utmost attention and detailed, domain-specific knowledge in order to assess the information on a fine-grained basis. To facilitate this manual process, this paper proposes the use of distributed text representations and multi-instance learning to transfer information from the document-level to the sentence-level. Compared to alternative approaches, this method features superior predictive performance while preserving context and interpretability. Our analysis of a manually-labeled dataset yields a predictive accuracy of up to 69.90%, exceeding the performance of alternative approaches by at least 3.80 percentage points. Accordingly, this study not only benefits investors with regard to their financial decision-making, but also helps companies to communicate their messages as intended. △ Less

Submitted 31 December, 2018; originally announced January 2019.

arXiv:1810.10942 [pdf, other]

Understanding the Role of Two-Sided Argumentation in Online Consumer Reviews: A Language-Based Perspective

Authors: Bernhard Lutz, Nicolas Pröllochs, Dirk Neumann

Abstract: This paper examines the effect of two-sided argumentation on the perceived helpfulness of online consumer reviews. In contrast to previous works, our analysis thereby sheds light on the reception of reviews from a language-based perspective. For this purpose, we propose an intriguing text analysis approach based on distributed text representations and multi-instance learning to operationalize the… ▽ More This paper examines the effect of two-sided argumentation on the perceived helpfulness of online consumer reviews. In contrast to previous works, our analysis thereby sheds light on the reception of reviews from a language-based perspective. For this purpose, we propose an intriguing text analysis approach based on distributed text representations and multi-instance learning to operationalize the two-sidedness of argumentation in review texts. A subsequent empirical analysis using a large corpus of Amazon reviews suggests that two-sided argumentation in reviews significantly increases their helpfulness. We find this effect to be stronger for positive reviews than for negative reviews, whereas a higher degree of emotional language weakens the effect. Our findings have immediate implications for retailer platforms, which can utilize our results to optimize their customer feedback system and to present more useful product reviews. △ Less

Submitted 24 December, 2018; v1 submitted 25 October, 2018; originally announced October 2018.

arXiv:1810.08515 [pdf, ps, other]

Transfer Learning versus Multi-agent Learning regarding Distributed Decision-Making in Highway Traffic

Authors: Mark Schutera, Niklas Goby, Dirk Neumann, Markus Reischl

Abstract: Transportation and traffic are currently undergoing a rapid increase in terms of both scale and complexity. At the same time, an increasing share of traffic participants are being transformed into agents driven or supported by artificial intelligence resulting in mixed-intelligence traffic. This work explores the implications of distributed decision-making in mixed-intelligence traffic. The invest… ▽ More Transportation and traffic are currently undergoing a rapid increase in terms of both scale and complexity. At the same time, an increasing share of traffic participants are being transformed into agents driven or supported by artificial intelligence resulting in mixed-intelligence traffic. This work explores the implications of distributed decision-making in mixed-intelligence traffic. The investigations are carried out on the basis of an online-simulated highway scenario, namely the MIT \emph{DeepTraffic} simulation. In the first step traffic agents are trained by means of a deep reinforcement learning approach, being deployed inside an elitist evolutionary algorithm for hyperparameter search. The resulting architectures and training parameters are then utilized in order to either train a single autonomous traffic agent and transfer the learned weights onto a multi-agent scenario or else to conduct multi-agent learning directly. Both learning strategies are evaluated on different ratios of mixed-intelligence traffic. The strategies are assessed according to the average speed of all agents driven by artificial intelligence. Traffic patterns that provoke a reduction in traffic flow are analyzed with respect to the different strategies. △ Less

Submitted 19 October, 2018; originally announced October 2018.

Comments: Proc. of the 10th International Workshop on Agents in Traffic and Transportation (ATT 2018), co-located with ECAI/IJCAI, AAMAS and ICML 2018 conferences (FAIM 2018)

Report number: CEUR-WS.org/Vol-2129

Journal ref: CEUR Workshop Proceedings 2018

arXiv:1707.09940 [pdf, ps, other]

doi 10.1109/TSP.2018.2838577

A Bilinear Equalizer for Massive MIMO Systems

Authors: David Neumann, Thomas Wiese, Michael Joham, Wolfgang Utschick

Abstract: We present a novel approach for low-complexity equalizer design well-suited for cellular massive MIMO systems. Our design allows to exploit the channel structure in terms of covariance matrices to improve the performance in the face of pilot-contamination, while basically keeping the complexity of a matched filter. This is achieved by restricting the equalizer to functions which are bilinear in th… ▽ More We present a novel approach for low-complexity equalizer design well-suited for cellular massive MIMO systems. Our design allows to exploit the channel structure in terms of covariance matrices to improve the performance in the face of pilot-contamination, while basically keeping the complexity of a matched filter. This is achieved by restricting the equalizer to functions which are bilinear in the received data signals and the observations from a training phase. The proposed design generalizes several previous approaches to equalizer design for massive MIMO. We show by asymptotic analysis that with the proposed design the achievable rate grows without bound for growing numbers of antennas even in the presence of pilot-contamination. We demonstrate with numerical results that the proposed design is competitive with more complex approaches in a practical cellular setup. △ Less

Submitted 3 May, 2018; v1 submitted 31 July, 2017; originally announced July 2017.

arXiv:1707.05674 [pdf, other]

doi 10.1109/TSP.2018.2799164

Learning the MMSE Channel Estimator

Authors: David Neumann, Thomas Wiese, Wolfgang Utschick

Abstract: We present a method for estimating conditionally Gaussian random vectors with random covariance matrices, which uses techniques from the field of machine learning. Such models are typical in communication systems, where the covariance matrix of the channel vector depends on random parameters, e.g., angles of propagation paths. If the covariance matrices exhibit certain Toeplitz and shift-invarianc… ▽ More We present a method for estimating conditionally Gaussian random vectors with random covariance matrices, which uses techniques from the field of machine learning. Such models are typical in communication systems, where the covariance matrix of the channel vector depends on random parameters, e.g., angles of propagation paths. If the covariance matrices exhibit certain Toeplitz and shift-invariance structures, the complexity of the MMSE channel estimator can be reduced to O(M log M) floating point operations, where M is the channel dimension. While in the absence of structure the complexity is much higher, we obtain a similarly efficient (but suboptimal) estimator by using the MMSE estimator of the structured model as a blueprint for the architecture of a neural network. This network learns the MMSE estimator for the unstructured model, but only within the given class of estimators that contains the MMSE estimator for the structured model. Numerical simulations with typical spatial channel models demonstrate the generalization properties of the chosen class of estimators to realistic channel models. △ Less

Submitted 6 February, 2018; v1 submitted 18 July, 2017; originally announced July 2017.

Comments: To appear in IEEE Transactions on Signal Processing

arXiv:1706.06996 [pdf, ps, other]

doi 10.1371/journal.pone.0209323

Statistical Inferences for Polarity Identification in Natural Language

Authors: Nicolas Pröllochs, Stefan Feuerriegel, Dirk Neumann

Abstract: Information forms the basis for all human behavior, including the ubiquitous decision-making that people constantly perform in their every day lives. It is thus the mission of researchers to understand how humans process information to reach decisions. In order to facilitate this task, this work proposes a novel method of studying the reception of granular expressions in natural language. The appr… ▽ More Information forms the basis for all human behavior, including the ubiquitous decision-making that people constantly perform in their every day lives. It is thus the mission of researchers to understand how humans process information to reach decisions. In order to facilitate this task, this work proposes a novel method of studying the reception of granular expressions in natural language. The approach utilizes LASSO regularization as a statistical tool to extract decisive words from textual content and draw statistical inferences based on the correspondence between the occurrences of words and an exogenous response variable. Accordingly, the method immediately suggests significant implications for social sciences and Information Systems research: everyone can now identify text segments and word choices that are statistically relevant to authors or readers and, based on this knowledge, test hypotheses from behavioral research. We demonstrate the contribution of our method by examining how authors communicate subjective information through narrative materials. This allows us to answer the question of which words to choose when communicating negative information. On the other hand, we show that investors trade not only upon facts in financial disclosures but are distracted by filler words and non-informative language. Practitioners - for example those in the fields of investor communications or marketing - can exploit our insights to enhance their writings based on the true perception of word choice. △ Less

Submitted 5 April, 2018; v1 submitted 21 June, 2017; originally announced June 2017.

arXiv:1705.02895 [pdf, ps, other]

doi 10.1109/LSP.2018.2827323

Covariance Matrix Estimation in Massive MIMO

Authors: David Neumann, Michael Joham, Wolfgang Utschick

Abstract: Interference during the uplink training phase significantly deteriorates the performance of a massive MIMO system. The impact of the interference can be reduced by exploiting second order statistics of the channel vectors, e.g., to obtain minimum mean squared error estimates of the channel. In practice, the channel covariance matrices have to be estimated. The estimation of the covariance matrices… ▽ More Interference during the uplink training phase significantly deteriorates the performance of a massive MIMO system. The impact of the interference can be reduced by exploiting second order statistics of the channel vectors, e.g., to obtain minimum mean squared error estimates of the channel. In practice, the channel covariance matrices have to be estimated. The estimation of the covariance matrices is also impeded by the interference during the training phase. However, the coherence interval of the covariance matrices is larger than that of the channel vectors. This allows us to derive methods for accurate covariance matrix estimation by appropriate assignment of pilot sequences to users in consecutive channel coherence intervals. △ Less

Submitted 16 February, 2018; v1 submitted 8 May, 2017; originally announced May 2017.

Comments: submitted to IEEE Signal Processing Letters

arXiv:1704.05356 [pdf, other]

Understanding Negations in Information Processing: Learning from Replicating Human Behavior

Authors: Nicolas Pröllochs, Stefan Feuerriegel, Dirk Neumann

Abstract: Information systems experience an ever-growing volume of unstructured data, particularly in the form of textual materials. This represents a rich source of information from which one can create value for people, organizations and businesses. For instance, recommender systems can benefit from automatically understanding preferences based on user reviews or social media. However, it is difficult for… ▽ More Information systems experience an ever-growing volume of unstructured data, particularly in the form of textual materials. This represents a rich source of information from which one can create value for people, organizations and businesses. For instance, recommender systems can benefit from automatically understanding preferences based on user reviews or social media. However, it is difficult for computer programs to correctly infer meaning from narrative content. One major challenge is negations that invert the interpretation of words and sentences. As a remedy, this paper proposes a novel learning strategy to detect negations: we apply reinforcement learning to find a policy that replicates the human perception of negations based on an exogenous response, such as a user rating for reviews. Our method yields several benefits, as it eliminates the former need for expensive and subjective manual labeling in an intermediate stage. Moreover, the inferred policy can be used to derive statistical inferences and implications regarding how humans process and act on negations. △ Less

Submitted 18 April, 2017; originally announced April 2017.

Comments: 39 pages

arXiv:1605.00303 [pdf, other]

A Self-Taught Artificial Agent for Multi-Physics Computational Model Personalization

Authors: Dominik Neumann, Tommaso Mansi, Lucian Itu, Bogdan Georgescu, Elham Kayvanpour, Farbod Sedaghat-Hamedani, Ali Amr, Jan Haas, Hugo Katus, Benjamin Meder, Stefan Steidl, Joachim Hornegger, Dorin Comaniciu

Abstract: Personalization is the process of fitting a model to patient data, a critical step towards application of multi-physics computational models in clinical practice. Designing robust personalization algorithms is often a tedious, time-consuming, model- and data-specific process. We propose to use artificial intelligence concepts to learn this task, inspired by how human experts manually perform it. T… ▽ More Personalization is the process of fitting a model to patient data, a critical step towards application of multi-physics computational models in clinical practice. Designing robust personalization algorithms is often a tedious, time-consuming, model- and data-specific process. We propose to use artificial intelligence concepts to learn this task, inspired by how human experts manually perform it. The problem is reformulated in terms of reinforcement learning. In an off-line phase, Vito, our self-taught artificial agent, learns a representative decision process model through exploration of the computational model: it learns how the model behaves under change of parameters. The agent then automatically learns an optimal strategy for on-line personalization. The algorithm is model-independent; applying it to a new model requires only adjusting few hyper-parameters of the agent and defining the observations to match. The full knowledge of the model itself is not required. Vito was tested in a synthetic scenario, showing that it could learn how to optimize cost functions generically. Then Vito was applied to the inverse problem of cardiac electrophysiology and the personalization of a whole-body circulation model. The obtained results suggested that Vito could achieve equivalent, if not better goodness of fit than standard methods, while being more robust (up to 11% higher success rates) and with faster (up to seven times) convergence rate. Our artificial intelligence approach could thus make personalization algorithms generalizable and self-adaptable to any patient and any model. △ Less

Submitted 1 May, 2016; originally announced May 2016.

Comments: Submitted to Medical Image Analysis, Elsevier

arXiv:1503.08691 [pdf, ps, other]

Channel Estimation in Massive MIMO Systems

Authors: David Neumann, Michael Joham, Wolfgang Utschick

Abstract: We introduce novel blind and semi-blind channel estimation methods for cellular time-division duplexing systems with a large number of antennas at each base station. The methods are based on the maximum a-posteriori principle given a prior for the distribution of the channel vectors and the received signals from the uplink training and data phases. Contrary to the state-of-the-art massive MIMO cha… ▽ More We introduce novel blind and semi-blind channel estimation methods for cellular time-division duplexing systems with a large number of antennas at each base station. The methods are based on the maximum a-posteriori principle given a prior for the distribution of the channel vectors and the received signals from the uplink training and data phases. Contrary to the state-of-the-art massive MIMO channel estimators which either perform linear estimation based on the pilot symbols or rely on a blind principle, the proposed semi-blind method efficiently suppresses most of the interference caused by pilot-contamination. The simulative analysis illustrates that the semi-blind estimator outperforms state- of-the-art linear and non-linear approaches to the massive MIMO channel estimation problem. △ Less

Submitted 30 March, 2015; originally announced March 2015.

Showing 1–22 of 22 results for author: Neumann, D