Zum Hauptinhalt springen

Showing 1–38 of 38 results for author: Pernkopf, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.09127  [pdf, other

    cs.LG

    Robustness of Explainable Artificial Intelligence in Industrial Process Modelling

    Authors: Benedikt Kantz, Clemens Staudinger, Christoph Feilmayr, Johannes Wachlmayr, Alexander Haberl, Stefan Schuster, Franz Pernkopf

    Abstract: eXplainable Artificial Intelligence (XAI) aims at providing understandable explanations of black box models. In this paper, we evaluate current XAI methods by scoring them based on ground truth simulations and sensitivity analysis. To this end, we used an Electric Arc Furnace (EAF) model to better understand the limits and robustness characteristics of XAI methods such as SHapley Additive exPlanat… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 11 pages, 3 figures, accepted at the ICML'24 Workshop ML4MS

  2. arXiv:2405.15514  [pdf, other

    stat.ML cs.AI cs.LG

    On the Convexity and Reliability of the Bethe Free Energy Approximation

    Authors: Harald Leisenberger, Christian Knoll, Franz Pernkopf

    Abstract: The Bethe free energy approximation provides an effective way for relaxing NP-hard problems of probabilistic inference. However, its accuracy depends on the model parameters and particularly degrades if a phase transition in the model occurs. In this work, we analyze when the Bethe approximation is reliable and how this can be verified. We argue and show by experiment that it is mostly accurate if… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: This work has been submitted to the Journal of Machine Learning Research (JMLR) for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  3. arXiv:2402.14781  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Effective Bayesian Causal Inference via Structural Marginalisation and Autoregressive Orders

    Authors: Christian Toth, Christian Knoll, Franz Pernkopf, Robert Peharz

    Abstract: Bayesian causal inference (BCI) naturally incorporates epistemic uncertainty about the true causal model into down-stream causal reasoning tasks by posterior averaging over causal models. However, this poses a tremendously hard computational problem due to the intractable number of causal structures to marginalise over. In this work, we decompose the structure learning problem into inferring (i) a… ▽ More

    Submitted 16 July, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 8 pages + references + appendices (19 pages total)

  4. Angle-Equivariant Convolutional Neural Networks for Interference Mitigation in Automotive Radar

    Authors: Christian Oswald, Mate Toth, Paul Meissner, Franz Pernkopf

    Abstract: In automotive applications, frequency modulated continuous wave (FMCW) radar is an established technology to determine the distance, velocity and angle of objects in the vicinity of the vehicle. The quality of predictions might be seriously impaired if mutual interference between radar sensors occurs. Previous work processes data from the entire receiver array in parallel to increase interference… ▽ More

    Submitted 18 December, 2023; originally announced January 2024.

    Comments: 4 pages, 3 figures

    Journal ref: 2023 20th European Radar Conference (EuRAD) (pp. 135-138). IEEE

  5. arXiv:2312.09790  [pdf, other

    cs.LG eess.SP

    End-to-End Training of Neural Networks for Automotive Radar Interference Mitigation

    Authors: Christian Oswald, Mate Toth, Paul Meissner, Franz Pernkopf

    Abstract: In this paper we propose a new method for training neural networks (NNs) for frequency modulated continuous wave (FMCW) radar mutual interference mitigation. Instead of training NNs to regress from interfered to clean radar signals as in previous work, we train NNs directly on object detection maps. We do so by performing a continuous relaxation of the cell-averaging constant false alarm rate (CA-… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 2023 IEEE International Radar Conference (RADAR), 6 pages, 4 figures

  6. arXiv:2303.07821  [pdf, ps, other

    cs.IT eess.SP

    Self-attention for Enhanced OAMP Detection in MIMO Systems

    Authors: Alexander Fuchs, Christian Knoll, Nima N. Moghadam, Alexey Pak Jinliang Huang, Erik Leitinger, Franz Pernkopf

    Abstract: Multiple-Input Multiple-Output (MIMO) systems are essential for wireless communications. Sinceclassical algorithms for symbol detection in MIMO setups require large computational resourcesor provide poor results, data-driven algorithms are becoming more popular. Most of the proposedalgorithms, however, introduce approximations leading to degraded performance for realistic MIMOsystems. In this pape… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: 8 pages, 2 figures, ICASSP 2023

    ACM Class: I.2.1; H.1.1

  7. arXiv:2206.02063  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Active Bayesian Causal Inference

    Authors: Christian Toth, Lars Lorch, Christian Knoll, Andreas Krause, Franz Pernkopf, Robert Peharz, Julius von Kügelgen

    Abstract: Causal discovery and causal reasoning are classically treated as separate and consecutive tasks: one first infers the causal graph, and then uses it to estimate causal effects of interventions. However, such a two-stage approach is uneconomical, especially in terms of actively collected interventional data, since the causal query of interest may not require a fully-specified causal model. From a B… ▽ More

    Submitted 15 October, 2022; v1 submitted 4 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 camera-ready version. RP & JvK are shared last authors. 10 pages + Bibliography + Appendix (34 pages total)

  8. Explainable Machine Learning for Breakdown Prediction in High Gradient RF Cavities

    Authors: Christoph Obermair, Thomas Cartier-Michaud, Andrea Apollonio, William Millar, Lukas Felsberger, Lorenz Fischl, Holger Severin Bovbjerg, Daniel Wollmann, Walter Wuensch, Nuria Catalan-Lasheras, Marçà Boronat, Franz Pernkopf, Graeme Burt

    Abstract: The occurrence of vacuum arcs or radio frequency (rf) breakdowns is one of the most prevalent factors limiting the high-gradient performance of normal conducting rf cavities in particle accelerators. In this paper, we search for the existence of previously unrecognized features related to the incidence of rf breakdowns by applying a machine learning strategy to high-gradient cavity data from CERN'… ▽ More

    Submitted 8 December, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

  9. Resource-efficient Deep Neural Networks for Automotive Radar Interference Mitigation

    Authors: Johanna Rock, Wolfgang Roth, Mate Toth, Paul Meissner, Franz Pernkopf

    Abstract: Radar sensors are crucial for environment perception of driver assistance systems as well as autonomous vehicles. With a rising number of radar sensors and the so far unregulated automotive radar frequency band, mutual interference is inevitable and must be dealt with. Algorithms and models operating on radar data are required to run the early processing steps on specialized radar sensor hardware.… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: 15 pages; published in IEEE Journal of Selected Topics in Signal Processing, Special Issue on Recent Advances in Automotive Radar Signal Processing, Volume: 15, Issue: 4, June 2021. arXiv admin note: text overlap with arXiv:2011.12706

    Journal ref: IEEE Journal of Selected Topics in Signal Processing, vol. 15, no. 4, pp. 927-940, June 2021

  10. arXiv:2110.01955  [pdf, other

    cs.LG cs.CV

    Distribution Mismatch Correction for Improved Robustness in Deep Neural Networks

    Authors: Alexander Fuchs, Christian Knoll, Franz Pernkopf

    Abstract: Deep neural networks rely heavily on normalization methods to improve their performance and learning behavior. Although normalization methods spurred the development of increasingly deep and efficient architectures, they also increase the vulnerability with respect to noise and input corruptions. In most applications, however, noise is ubiquitous and diverse; this can often lead to complete failur… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    ACM Class: I.2.0; I.4.0

  11. arXiv:2108.01991  [pdf, ps, other

    eess.AS cs.LG cs.SD

    Lung Sound Classification Using Co-tuning and Stochastic Normalization

    Authors: Truc Nguyen, Franz Pernkopf

    Abstract: In this paper, we use pre-trained ResNet models as backbone architectures for classification of adventitious lung sounds and respiratory diseases. The knowledge of the pre-trained model is transferred by using vanilla fine-tuning, co-tuning, stochastic normalization and the combination of the co-tuning and stochastic normalization techniques. Furthermore, data augmentation in both time domain and… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

    Comments: Submitted to IEEE BE Transaction

  12. arXiv:2105.00929  [pdf, other

    eess.SP cs.CV

    Complex-valued Convolutional Neural Networks for Enhanced Radar Signal Denoising and Interference Mitigation

    Authors: Alexander Fuchs, Johanna Rock, Mate Toth, Paul Meissner, Franz Pernkopf

    Abstract: Autonomous driving highly depends on capable sensors to perceive the environment and to deliver reliable information to the vehicles' control systems. To increase its robustness, a diversified set of sensors is used, including radar sensors. Radar is a vital contribution of sensory information, providing high resolution range as well as velocity measurements. The increased use of radar sensors in… ▽ More

    Submitted 29 April, 2021; originally announced May 2021.

    Journal ref: IEEE International Radar Conference 2021

  13. arXiv:2104.14921  [pdf, ps, other

    eess.AS cs.LG cs.SD

    Crackle Detection In Lung Sounds Using Transfer Learning And Multi-Input Convolitional Neural Networks

    Authors: Truc Nguyen, Franz Pernkopf

    Abstract: Large annotated lung sound databases are publicly available and might be used to train algorithms for diagnosis systems. However, it might be a challenge to develop a well-performing algorithm for small non-public data, which have only a few subjects and show differences in recording devices and setup. In this paper, we use transfer learning to tackle the mismatch of the recording setup. This allo… ▽ More

    Submitted 30 April, 2021; originally announced April 2021.

    Comments: Under Review in Proceeding of EMBC 2021

  14. arXiv:2104.06666  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    End-to-end Keyword Spotting using Neural Architecture Search and Quantization

    Authors: David Peter, Wolfgang Roth, Franz Pernkopf

    Abstract: This paper introduces neural architecture search (NAS) for the automatic discovery of end-to-end keyword spotting (KWS) models in limited resource environments. We employ a differentiable NAS approach to optimize the structure of convolutional neural networks (CNNs) operating on raw audio waveforms. After a suitable KWS model is found with NAS, we conduct quantization of weights and activations to… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:2012.10138

  15. arXiv:2103.13443  [pdf, other

    cs.SD cs.LG eess.AS

    Blind Speech Separation and Dereverberation using Neural Beamforming

    Authors: Lukas Pfeifenberger, Franz Pernkopf

    Abstract: In this paper, we present the Blind Speech Separation and Dereverberation (BSSD) network, which performs simultaneous speaker separation, dereverberation and speaker identification in a single neural network. Speaker separation is guided by a set of predefined spatial cues. Dereverberation is performed by using neural beamforming, and speaker identification is aided by embedding vectors and triple… ▽ More

    Submitted 4 November, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

    Comments: 13 pages, 9 figures

  16. arXiv:2012.10138  [pdf, other

    eess.AS cs.LG

    Resource-efficient DNNs for Keyword Spotting using Neural Architecture Search and Quantization

    Authors: David Peter, Wolfgang Roth, Franz Pernkopf

    Abstract: This paper introduces neural architecture search (NAS) for the automatic discovery of small models for keyword spotting (KWS) in limited resource environments. We employ a differentiable NAS approach to optimize the structure of convolutional neural networks (CNNs) to maximize the classification accuracy while minimizing the number of operations per inference. Using NAS only, we were able to obtai… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

  17. Deep Interference Mitigation and Denoising of Real-World FMCW Radar Signals

    Authors: Johanna Rock, Mate Toth, Paul Meissner, Franz Pernkopf

    Abstract: Radar sensors are crucial for environment perception of driver assistance systems as well as autonomous cars. Key performance factors are a fine range resolution and the possibility to directly measure velocity. With a rising number of radar sensors and the so far unregulated automotive radar frequency band, mutual interference is inevitable and must be dealt with. Sensors must be capable of detec… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    Comments: 2020 IEEE International Radar Conference (RADAR)

  18. arXiv:2011.12706  [pdf, other

    eess.SP cs.LG

    Quantized Neural Networks for Radar Interference Mitigation

    Authors: Johanna Rock, Wolfgang Roth, Paul Meissner, Franz Pernkopf

    Abstract: Radar sensors are crucial for environment perception of driver assistance systems as well as autonomous vehicles. Key performance factors are weather resistance and the possibility to directly measure velocity. With a rising number of radar sensors and the so far unregulated automotive radar frequency band, mutual interference is inevitable and must be dealt with. Algorithms and models operating o… ▽ More

    Submitted 1 December, 2020; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: ITEM Workshop at ECML-PKDD 2020

  19. arXiv:2010.11773  [pdf, other

    cs.LG cs.AI stat.ML

    On Resource-Efficient Bayesian Network Classifiers and Deep Neural Networks

    Authors: Wolfgang Roth, Günther Schindler, Holger Fröning, Franz Pernkopf

    Abstract: We present two methods to reduce the complexity of Bayesian network (BN) classifiers. First, we introduce quantization-aware training using the straight-through gradient estimator to quantize the parameters of BNs to few bits. Second, we extend a recently proposed differentiable tree-augmented naive Bayes (TAN) structure learning approach by also considering the model size. Both methods are motiva… ▽ More

    Submitted 22 September, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted at ICPR 2020, fixed Figure 5

  20. arXiv:2008.09566  [pdf, other

    cs.LG cs.AI stat.ML

    Differentiable TAN Structure Learning for Bayesian Network Classifiers

    Authors: Wolfgang Roth, Franz Pernkopf

    Abstract: Learning the structure of Bayesian networks is a difficult combinatorial optimization problem. In this paper, we consider learning of tree-augmented naive Bayes (TAN) structures for Bayesian network classifiers with discrete input features. Instead of performing a combinatorial optimization over the space of possible graph structures, the proposed method learns a distribution over graph structures… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: Accepted at PGM 2020

  21. arXiv:2007.11477  [pdf, other

    eess.AS cs.LG cs.SD

    Resource-Efficient Speech Mask Estimation for Multi-Channel Speech Enhancement

    Authors: Lukas Pfeifenberger, Matthias Zöhrer, Günther Schindler, Wolfgang Roth, Holger Fröning, Franz Pernkopf

    Abstract: While machine learning techniques are traditionally resource intensive, we are currently witnessing an increased interest in hardware and energy efficient approaches. This need for resource-efficient machine learning is primarily driven by the demand for embedded systems and their usage in ubiquitous computing and IoT applications. In this article, we provide a resource-efficient approach for mult… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

  22. arXiv:2007.11465  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Wasserstein Routed Capsule Networks

    Authors: Alexander Fuchs, Franz Pernkopf

    Abstract: Capsule networks offer interesting properties and provide an alternative to today's deep neural network architectures. However, recent approaches have failed to consistently achieve competitive results across different image datasets. We propose a new parameter efficient capsule architecture, that is able to tackle complex tasks by using neural networks trained with an approximate Wasserstein obje… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

    Comments: 8 pages, 3 figures

    ACM Class: I.2.10

  23. arXiv:2001.03048  [pdf, other

    stat.ML cs.LG

    Resource-Efficient Neural Networks for Embedded Systems

    Authors: Wolfgang Roth, Günther Schindler, Bernhard Klein, Robert Peharz, Sebastian Tschiatschek, Holger Fröning, Franz Pernkopf, Zoubin Ghahramani

    Abstract: While machine learning is traditionally a resource intensive task, embedded systems, autonomous navigation, and the vision of the Internet of Things fuel the interest in resource-efficient approaches. These approaches aim for a carefully chosen trade-off between performance and resource consumption in terms of computation and energy. The development of such approaches is among the major challenges… ▽ More

    Submitted 7 April, 2024; v1 submitted 7 January, 2020; originally announced January 2020.

    Comments: arXiv admin note: text overlap with arXiv:1812.02240; accepted at JMLR

  24. arXiv:1910.04536  [pdf, other

    cs.LG stat.ML

    Deep Structured Mixtures of Gaussian Processes

    Authors: Martin Trapp, Robert Peharz, Franz Pernkopf, Carl E. Rasmussen

    Abstract: Gaussian Processes (GPs) are powerful non-parametric Bayesian regression models that allow exact posterior inference, but exhibit high computational and memory costs. In order to improve scalability of GPs, approximate posterior inference is frequently employed, where a prominent class of approximation techniques is based on local GP experts. However, local-expert techniques proposed so far are ei… ▽ More

    Submitted 26 April, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

    Comments: AISTATS 2020

  25. arXiv:1907.04708  [pdf, other

    cs.LG stat.ML

    Learning a Behavior Model of Hybrid Systems Through Combining Model-Based Testing and Machine Learning (Full Version)

    Authors: Bernhard K. Aichernig, Roderick Bloem, Masoud Ebrahimi, Martin Horn, Franz Pernkopf, Wolfgang Roth, Astrid Rupp, Martin Tappler, Markus Tranninger

    Abstract: Models play an essential role in the design process of cyber-physical systems. They form the basis for simulation and analysis and help in identifying design problems as early as possible. However, the construction of models that comprise physical and digital behavior is challenging. Therefore, there is considerable interest in learning such hybrid behavior by means of machine learning which requi… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

    Comments: This is an extended version of the conference paper "Learning a Behavior Model of Hybrid Systems Through Combining Model-Based Testing and Machine Learning" accepted for presentation at IFIP-ICTSS 2019, the 31st International Conference on Testing Software and Systems in Paris, France

  26. arXiv:1906.10044  [pdf, other

    eess.SP cs.CV

    Complex Signal Denoising and Interference Mitigation for Automotive Radar Using Convolutional Neural Networks

    Authors: Johanna Rock, Mate Toth, Elmar Messner, Paul Meissner, Franz Pernkopf

    Abstract: Driver assistance systems as well as autonomous cars have to rely on sensors to perceive their environment. A heterogeneous set of sensors is used to perform this task robustly. Among them, radar sensors are indispensable because of their range resolution and the possibility to directly measure velocity. Since more and more radar sensors are deployed on the streets, mutual interference must be dea… ▽ More

    Submitted 25 June, 2019; v1 submitted 24 June, 2019; originally announced June 2019.

    Comments: FUSION 2019; 8 pages

  27. arXiv:1906.05180  [pdf, other

    cs.LG stat.ML

    Parameterized Structured Pruning for Deep Neural Networks

    Authors: Guenther Schindler, Wolfgang Roth, Franz Pernkopf, Holger Froening

    Abstract: As a result of the growing size of Deep Neural Networks (DNNs), the gap to hardware capabilities in terms of memory and compute increases. To effectively compress DNNs, quantization and connection pruning are usually considered. However, unconstrained pruning usually leads to unstructured parallelism, which maps poorly to massively parallel processors, and substantially reduces the efficiency of g… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

  28. arXiv:1905.10884  [pdf, other

    cs.LG stat.ML

    Bayesian Learning of Sum-Product Networks

    Authors: Martin Trapp, Robert Peharz, Hong Ge, Franz Pernkopf, Zoubin Ghahramani

    Abstract: Sum-product networks (SPNs) are flexible density estimators and have received significant attention due to their attractive inference properties. While parameter learning in SPNs is well developed, structure learning leaves something to be desired: Even though there is a plethora of SPN structure learners, most of them are somewhat ad-hoc and based on intuition rather than a clear learning princip… ▽ More

    Submitted 4 November, 2019; v1 submitted 26 May, 2019; originally announced May 2019.

    Comments: NeurIPS 2019; See conference page for supplement

  29. arXiv:1905.08196  [pdf, other

    cs.LG stat.ML

    Optimisation of Overparametrized Sum-Product Networks

    Authors: Martin Trapp, Robert Peharz, Franz Pernkopf

    Abstract: It seems to be a pearl of conventional wisdom that parameter learning in deep sum-product networks is surprisingly fast compared to shallow mixture models. This paper examines the effects of overparameterization in sum-product networks on the speed of parameter optimisation. Using theoretical analysis and empirical experiments, we show that deep sum-product networks exhibit an implicit acceleratio… ▽ More

    Submitted 29 May, 2019; v1 submitted 20 May, 2019; originally announced May 2019.

    Comments: Workshop on Tractable Probabilistic Models (TPM) at ICML 2019

  30. arXiv:1812.02240  [pdf, other

    cs.LG stat.ML

    Efficient and Robust Machine Learning for Real-World Systems

    Authors: Franz Pernkopf, Wolfgang Roth, Matthias Zoehrer, Lukas Pfeifenberger, Guenther Schindler, Holger Froening, Sebastian Tschiatschek, Robert Peharz, Matthew Mattina, Zoubin Ghahramani

    Abstract: While machine learning is traditionally a resource intensive task, embedded systems, autonomous navigation and the vision of the Internet-of-Things fuel the interest in resource efficient approaches. These approaches require a carefully chosen trade-off between performance and resource consumption in terms of computation and energy. On top of this, it is crucial to treat uncertainty in a consisten… ▽ More

    Submitted 5 December, 2018; originally announced December 2018.

  31. arXiv:1812.01339  [pdf, other

    stat.ML cs.LG

    Self-Guided Belief Propagation -- A Homotopy Continuation Method

    Authors: Christian Knoll, Adrian Weller, Franz Pernkopf

    Abstract: Belief propagation (BP) is a popular method for performing probabilistic inference on graphical models. In this work, we enhance BP and propose self-guided belief propagation (SBP) that incorporates the pairwise potentials only gradually. This homotopy continuation method converges to a unique solution and increases the accuracy without increasing the computational burden. We provide a formal anal… ▽ More

    Submitted 19 March, 2021; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  32. arXiv:1810.06897  [pdf, other

    cs.SD eess.AS

    Sound event detection using weakly-labeled semi-supervised data with GCRNNS, VAT and Self-Adaptive Label Refinement

    Authors: Robert Harb, Franz Pernkopf

    Abstract: In this paper, we present a gated convolutional recurrent neural network based approach to solve task 4, large-scale weakly labelled semi-supervised sound event detection in domestic environments, of the DCASE 2018 challenge. Gated linear units and a temporal attention layer are used to predict the onset and offset of sound events in 10s long audio clips. Whereby for training only weakly-labelled… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

    Comments: Accepted at DCASE 2018 Workshop for oral presentation

  33. arXiv:1809.04400  [pdf, other

    cs.LG stat.ML

    Learning Deep Mixtures of Gaussian Process Experts Using Sum-Product Networks

    Authors: Martin Trapp, Robert Peharz, Carl E. Rasmussen, Franz Pernkopf

    Abstract: While Gaussian processes (GPs) are the method of choice for regression tasks, they also come with practical difficulties, as inference cost scales cubic in time and quadratic in memory. In this paper, we introduce a natural and expressive way to tackle these problems, by incorporating GPs in sum-product networks (SPNs), a recently proposed tractable probabilistic model allowing exact and efficient… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

    Comments: Presented at the Workshop on Tractable Probabilistic Models (TPM 2018), ICML 2018

  34. arXiv:1807.02324  [pdf, other

    cs.LG stat.ML

    Sum-Product Networks for Sequence Labeling

    Authors: Martin Ratajczak, Sebastian Tschiatschek, Franz Pernkopf

    Abstract: We consider higher-order linear-chain conditional random fields (HO-LC-CRFs) for sequence modelling, and use sum-product networks (SPNs) for representing higher-order input- and output-dependent factors. SPNs are a recently introduced class of deep models for which exact and efficient inference can be performed. By combining HO-LC-CRFs with SPNs, expressive models over both the output labels and t… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

  35. arXiv:1806.00981  [pdf, other

    cs.LG cs.CR stat.ML

    Automatic Clustering of a Network Protocol with Weakly-Supervised Clustering

    Authors: Tobias Schrank, Franz Pernkopf

    Abstract: Abstraction is a fundamental part when learning behavioral models of systems. Usually the process of abstraction is manually defined by domain experts. This paper presents a method to perform automatic abstraction for network protocols. In particular a weakly supervised clustering algorithm is used to build an abstraction with a small vocabulary size for the widely used TLS protocol. To show the e… ▽ More

    Submitted 4 June, 2018; originally announced June 2018.

  36. arXiv:1710.03444  [pdf, other

    stat.ML cs.LG

    Safe Semi-Supervised Learning of Sum-Product Networks

    Authors: Martin Trapp, Tamas Madl, Robert Peharz, Franz Pernkopf, Robert Trappl

    Abstract: In several domains obtaining class annotations is expensive while at the same time unlabelled data are abundant. While most semi-supervised approaches enforce restrictive assumptions on the data distribution, recent work has managed to learn semi-supervised models in a non-restrictive regime. However, so far such approaches have only been proposed for linear models. In this work, we introduce semi… ▽ More

    Submitted 10 October, 2017; originally announced October 2017.

    Comments: Conference on Uncertainty in Artificial Intelligence (UAI), 2017

  37. arXiv:1601.06180  [pdf, ps, other

    cs.AI cs.LG

    On the Latent Variable Interpretation in Sum-Product Networks

    Authors: Robert Peharz, Robert Gens, Franz Pernkopf, Pedro Domingos

    Abstract: One of the central themes in Sum-Product networks (SPNs) is the interpretation of sum nodes as marginalized latent variables (LVs). This interpretation yields an increased syntactic or semantic structure, allows the application of the EM algorithm and to efficiently perform MPE inference. In literature, the LV interpretation was justified by explicitly introducing the indicator variables correspon… ▽ More

    Submitted 28 October, 2016; v1 submitted 22 January, 2016; originally announced January 2016.

    Comments: Revised version, accepted for publication in IEEE Transactions on Machine Intelligence and Pattern Analysis (TPAMI). Shortened and revised Section 4: Thanks to our reviewers, pointing out that Theorem 2 holds for selective SPNs. Added paragraph in Section 2.1, relating sizes of original/augmented SPNs. Fixed typos, rephrased sentences, revised references

    MSC Class: 62

  38. arXiv:1206.6431  [pdf

    cs.LG stat.ML

    Exact Maximum Margin Structure Learning of Bayesian Networks

    Authors: Robert Peharz, Franz Pernkopf

    Abstract: Recently, there has been much interest in finding globally optimal Bayesian network structures. These techniques were developed for generative scores and can not be directly extended to discriminative scores, as desired for classification. In this paper, we propose an exact method for finding network structures maximizing the probabilistic soft margin, a successfully applied discriminative score.… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: ICML