Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Host-Madsen, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.17023  [pdf, other

    cs.IT cs.LG

    Out-of-Distribution Detection using Maximum Entropy Coding

    Authors: Mojtaba Abolfazli, Mohammad Zaeri Amirani, Anders Høst-Madsen, June Zhang, Andras Bratincsak

    Abstract: Given a default distribution $P$ and a set of test data $x^M=\{x_1,x_2,\ldots,x_M\}$ this paper seeks to answer the question if it was likely that $x^M$ was generated by $P$. For discrete distributions, the definitive answer is in principle given by Kolmogorov-Martin-Löf randomness. In this paper we seek to generalize this to continuous distributions. We consider a set of statistics… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  2. arXiv:2301.03128  [pdf, ps, other

    cs.IT

    Compress-and-Forward via Multilevel Coding and Trellis Coded Quantization

    Authors: Heping Wan, Anders Host-Madsen, Aria Nosratinia

    Abstract: Compress-forward (CF) relays can improve communication rates even when the relay cannot decode the source signal. Efficient implementation of CF is a topic of contemporary interest, in part because of its potential impact on wireless technologies such as cloud-RAN. There exists a gap between the performance of CF implementations in the high spectral efficiency regime and the corresponding informat… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

  3. arXiv:2206.01851  [pdf, other

    cs.LG cs.IT

    Out-of-Distribution Detection using BiGAN and MDL

    Authors: Mojtaba Abolfazli, Mohammad Zaeri Arimani, Anders Host-Madsen, June Zhang, Andras Bratincsak

    Abstract: We consider the following problem: we have a large dataset of normal data available. We are now given a new, possibly quite small, set of data, and we are to decide if these are normal data, or if they are indicating a new phenomenon. This is a novelty detection or out-of-distribution detection problem. An example is in medicine, where the normal data is for people with no known disease, and the n… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  4. arXiv:2110.00701  [pdf, other

    cs.IT

    Graph Compression with Application to Model Selection

    Authors: Mojtaba Abolfazli, Anders Host-Madsen, June Zhang, Andras Bratincsak

    Abstract: Many multivariate data such as social and biological data exhibit complex dependencies that are best characterized by graphs. Unlike sequential data, graphs are, in general, unordered structures. This means we can no longer use classic, sequential-based compression methods on these graph-based data. Therefore, it is necessary to develop new methods for graph compression. In this paper, we present… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: Submitted to IEEE Transactions on Signal Processing

  5. Graph Coding for Model Selection and Anomaly Detection in Gaussian Graphical Models

    Authors: Mojtaba Abolfazli, Anders Host-Madsen, June Zhang, Andras Bratincsak

    Abstract: A classic application of description length is for model selection with the minimum description length (MDL) principle. The focus of this paper is to extend description length for data analysis beyond simple model selection and sequences of scalars. More specifically, we extend the description length for data analysis in Gaussian graphical models. These are powerful tools to model interactions amo… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

    Comments: Submitted to ISIT 2021

  6. arXiv:2009.08562  [pdf, other

    cs.IT cs.LG

    Bounds for Learning Lossless Source Coding

    Authors: Anders Host-Madsen

    Abstract: This paper asks a basic question: how much training is required to beat a universal source coder? Traditionally, there have been two types of source coders: fixed, optimum coders such as Huffman coders; and universal source coders, such as Lempel-Ziv The paper considers a third type of source coders: learned coders. These are coders that are trained on data of a particular type, and then used to e… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

    Comments: Submitted to IEEE Transactions on Information Theory

  7. arXiv:1902.04699  [pdf, other

    cs.LG cs.IT stat.ML

    Differential Description Length for Hyperparameter Selection in Machine Learning

    Authors: Mojtaba Abolfazli, Anders Host-Madsen, June Zhang

    Abstract: This paper introduces a new method for model selection and more generally hyperparameter selection in machine learning. Minimum description length (MDL) is an established method for model selection, which is however not directly aimed at minimizing generalization error, which is often the primary goal in machine learning. The paper demonstrates a relationship between generalization error and a dif… ▽ More

    Submitted 22 May, 2019; v1 submitted 12 February, 2019; originally announced February 2019.

    Comments: Submitted to NeurIPS 2019

  8. arXiv:1804.02469  [pdf, other

    cs.IT

    Coding of Graphs with Application to Graph Anomaly Detection

    Authors: Anders Host-Madsen, June Zhang

    Abstract: This paper has dual aims. First is to develop practical universal coding methods for unlabeled graphs. Second is to use these for graph anomaly detection. The paper develops two coding methods for unlabeled graphs: one based on the degree distribution, the second based on the triangle distribution. It is shown that these are efficient for different types of random graphs, and on real-world graphs.… ▽ More

    Submitted 6 April, 2018; originally announced April 2018.

    Comments: To be presented at ISIT'18

  9. arXiv:1710.07319  [pdf, other

    cs.LG cs.IT

    Atypicality for Heart Rate Variability Using a Pattern-Tree Weighting Method

    Authors: Elyas Sabeti, Anders Høst-Madsen

    Abstract: Heart rate variability (HRV) is a vital measure of the autonomic nervous system functionality and a key indicator of cardiovascular condition. This paper proposes a novel method, called pattern tree which is an extension of Willem's context tree to real-valued data, to investigate HRV via an atypicality framework. In a previous paper atypicality was developed as method for mining and discovery in… ▽ More

    Submitted 11 October, 2017; originally announced October 2017.

    Comments: 5 pages

  10. arXiv:1709.03191  [pdf, other

    eess.SP cs.IT

    Data Discovery and Anomaly Detection Using Atypicality: Signal Processing Methods

    Authors: Elyas Sabeti, Anders Høst-Madsen

    Abstract: The aim of atypicality is to extract small, rare, unusual and interesting pieces out of big data. This complements statistics about typical data to give insight into data. In order to find such "interesting" parts of data, universal approaches are required, since it is not known in advance what we are looking for. We therefore base the atypicality criterion on codelength. In a prior paper we devel… ▽ More

    Submitted 10 September, 2017; originally announced September 2017.

    Comments: 13 pages, two columns

  11. arXiv:1709.03189  [pdf, other

    cs.IT

    Data Discovery and Anomaly Detection Using Atypicality: Theory

    Authors: Anders Høst-Madsen, Elyas Sabeti, Chad Walton

    Abstract: A central question in the era of 'big data' is what to do with the enormous amount of information. One possibility is to characterize it through statistics, e.g., averages, or classify it using machine learning, in order to understand the general structure of the overall data. The perspective in this paper is the opposite, namely that most of the value in the information in some applications is in… ▽ More

    Submitted 10 September, 2017; originally announced September 2017.

    Comments: 40 pages

  12. arXiv:1706.03436  [pdf, other

    cs.IT

    Repair of Multiple Descriptions on Distributed Storage

    Authors: Anders Host-Madsen, Heechoel Yang, Minchul Kim, Jungwoo Lee

    Abstract: In multiple descriptions on distributed storage, a source is stored in a shared fashion on multiple servers. When a subset of servers are contacted, the source should be estimated with a certain maximum distortion depending on the number of servers. The problem considered in this paper is how to restore the system operation when one of the servers fail and a new server replaces it, that is, repair… ▽ More

    Submitted 9 January, 2018; v1 submitted 11 June, 2017; originally announced June 2017.

    Comments: Preliminary journal version of ISIT'18 submission. Includes formal proofs

  13. arXiv:1301.1061   

    cs.IT

    On the Minimum Energy of Sending Gaussian Multiterminal Sources over the Gaussian MAC

    Authors: Nan Jiang, Yang Yang, Anders Høst-Madsen, Zixiang Xiong

    Abstract: In this work, we investigate the minimum energy of transmitting correlated sources over the Gaussian multiple-access channel (MAC). Compared to other works on joint source-channel coding, we consider the general scenario where the source and channel bandwidths are not naturally matched. In particular, we proposed the use of hybrid digital-analog coding over to improve the transmission energy effic… ▽ More

    Submitted 18 January, 2013; v1 submitted 6 January, 2013; originally announced January 2013.

    Comments: Under revision

  14. arXiv:1207.4252  [pdf, ps, other

    cs.IT

    The Wideband Slope of Interference Channels: The Small Bandwidth Case

    Authors: Minqi Shen, Anders Høst-Madsen

    Abstract: This paper studies the low-SNR regime performance of a scalar complex K -user interference channel with Gaussian noise. The finite bandwidth case is considered, where the low-SNR regime is approached by letting the input power go to zero while bandwidth is small and fixed. We show that for all δ>0 there exists a set with non-zero measure (probability) in which the wideband slope per user satisfies… ▽ More

    Submitted 17 July, 2012; originally announced July 2012.

    Comments: submitted to Information Theory, IEEE Transactions on

  15. arXiv:1010.5661  [pdf, ps, other

    cs.IT

    The Wideband Slope of Interference Channels: The Large Bandwidth Case

    Authors: Minqi Shen, Anders Høst-Madsen

    Abstract: It is well known that minimum received energy per bit in the interference channel is -1.59dB as if there were no interference. Thus, the best way to mitigate interference is to operate the interference channel in the low-SNR regime. However, when the SNR is small but non-zero, minimum energy per bit alone does not characterize performance. Verdu introduced the wideband slope S_0 to characterize th… ▽ More

    Submitted 11 November, 2011; v1 submitted 27 October, 2010; originally announced October 2010.

  16. Deterministic Capacity of MIMO Relay Networks

    Authors: Anders Host-Madsen

    Abstract: The deterministic capacity of a relay network is the capacity of a network when relays are restricted to transmitting \emph{reliable} information, that is, (asymptotically) deterministic function of the source message. In this paper it is shown that the deterministic capacity of a number of MIMO relay networks can be found in the low power regime where $\SNR\to0$. This is accomplished through de… ▽ More

    Submitted 31 March, 2009; originally announced April 2009.

    Comments: Submitted to IEEE Transactions on Information Theory