Zum Hauptinhalt springen

Showing 1–12 of 12 results for author: Heutte, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.14710  [pdf, other

    stat.ML cs.AI cs.LG

    Random Forest Kernel for High-Dimension Low Sample Size Classification

    Authors: Lucca Portes Cavalheiro, Simon Bernard, Jean Paul Barddal, Laurent Heutte

    Abstract: High dimension, low sample size (HDLSS) problems are numerous among real-world applications of machine learning. From medical images to text processing, traditional machine learning algorithms are usually unsuccessful in learning the best possible concept from such data. In a previous work, we proposed a dissimilarity-based approach for multi-view classification, the Random Forest Dissimilarity (R… ▽ More

    Submitted 17 November, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 23 pages. To be published in statistics and computing (accepted September 26, 2023)

    Journal ref: Stat Comput 34, 9 (2024)

  2. arXiv:2301.12873  [pdf, other

    cs.LG cs.AI

    Approximating DTW with a convolutional neural network on EEG data

    Authors: Hugo Lerogeron, Romain Picot-Clemente, Alain Rakotomamonjy, Laurent Heutte

    Abstract: Dynamic Time Wrapping (DTW) is a widely used algorithm for measuring similarities between two time series. It is especially valuable in a wide variety of applications, such as clustering, anomaly detection, classification, or video segmentation, where the time-series have different timescales, are irregularly sampled, or are shifted. However, it is not prone to be considered as a loss function in… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    ACM Class: I.5

  3. arXiv:2208.02397  [pdf, other

    cs.CV cs.IR cs.LG

    Pattern Spotting and Image Retrieval in Historical Documents using Deep Hashing

    Authors: Caio da S. Dias, Alceu de S. Britto Jr., Jean P. Barddal, Laurent Heutte, Alessandro L. Koerich

    Abstract: This paper presents a deep learning approach for image retrieval and pattern spotting in digital collections of historical documents. First, a region proposal algorithm detects object candidates in the document page images. Next, deep learning models are used for feature extraction, considering two distinct variants, which provide either real-valued or binary code representations. Finally, candida… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: 7 pages

  4. arXiv:2008.08920  [pdf, ps, other

    cs.LG

    scikit-dyn2sel -- A Dynamic Selection Framework for Data Streams

    Authors: Lucca Portes Cavalheiro, Jean Paul Barddal, Alceu de Souza Britto Jr, Laurent Heutte

    Abstract: Mining data streams is a challenge per se. It must be ready to deal with an enormous amount of data and with problems not present in batch machine learning, such as concept drift. Therefore, applying a batch-designed technique, such as dynamic selection of classifiers (DCS) also presents a challenge. The dynamic characteristic of ensembles that deal with streams presents barriers to the applicatio… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: Paper introducing scikit-dyn2sel, a dynamic selection framework for data streams

  5. Random Forest for Dissimilarity-based Multi-view Learning

    Authors: Simon Bernard, Hongliu Cao, Robert Sabourin, Laurent Heutte

    Abstract: Many classification problems are naturally multi-view in the sense their data are described through multiple heterogeneous descriptions. For such tasks, dissimilarity strategies are effective ways to make the different descriptions comparable and to easily merge them, by (i) building intermediate dissimilarity representations for each view and (ii) fusing these representations by averaging the dis… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

    Comments: Published in Handbook of Pattern Recognition and Computer Vision, 2020 (preprint)

  6. arXiv:2007.02572  [pdf, other

    cs.LG stat.ML

    A Novel Random Forest Dissimilarity Measure for Multi-View Learning

    Authors: Hongliu Cao, Simon Bernard, Robert Sabourin, Laurent Heutte

    Abstract: Multi-view learning is a learning task in which data is described by several concurrent representations. Its main challenge is most often to exploit the complementarities between these representations to help solve a classification/regression task. This is a challenge that can be met nowadays if there is a large amount of data available for learning. However, this is not necessarily true for all r… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

    Comments: accepted to ICPR 2020 (22/06/2020)

  7. arXiv:1907.09404  [pdf, other

    cs.CV cs.LG cs.MM

    Deep Learning Approaches for Image Retrieval and Pattern Spotting in Ancient Documents

    Authors: Kelly Lais Wiggers, Alceu de Souza Britto Junior, Alessandro Lameiras Koerich, Laurent Heutte, Luiz Eduardo Soares de Oliveira

    Abstract: This paper describes two approaches for content-based image retrieval and pattern spotting in document images using deep learning. The first approach uses a pre-trained CNN model to cope with the lack of training data, which is fine-tuned to achieve a compact yet discriminant representation of queries and image candidates. The second approach uses a Siamese Convolution Neural Network trained on a… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: The paper is under consideration at Pattern Recognition Letters

  8. arXiv:1906.09513  [pdf

    cs.CV

    Image Retrieval and Pattern Spotting using Siamese Neural Network

    Authors: Kelly L. Wiggers, Alceu S. Britto Jr., Laurent Heutte, Alessandro L. Koerich, Luiz S. Oliveira

    Abstract: This paper presents a novel approach for image retrieval and pattern spotting in document image collections. The manual feature engineering is avoided by learning a similarity-based representation using a Siamese Neural Network trained on a previously prepared subset of image pairs from the ImageNet dataset. The learned representation is used to provide the similarity-based feature maps used to fi… ▽ More

    Submitted 22 June, 2019; originally announced June 2019.

    Comments: Accepted for IJCNN 2019

  9. arXiv:1906.08580  [pdf, other

    cs.CV

    Pattern Spotting in Historical Documents Using Convolutional Models

    Authors: Ignacio Úbeda, Jose M. Saavedra, Stéphane Nicolas, Caroline Petitjean, Laurent Heutte

    Abstract: Pattern spotting consists of searching in a collection of historical document images for occurrences of a graphical object using an image query. Contrary to object detection, no prior information nor predefined class is given about the query so training a model of the object is not feasible. In this paper, a convolutional neural network approach is proposed to tackle this problem. We use RetinaNet… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: 6 pages, 9 figures

  10. arXiv:1806.07686  [pdf, other

    cs.CV

    Dynamic voting in multi-view learning for radiomics applications

    Authors: Hongliu Cao, Simon Bernard, Laurent Heutte, Robert Sabourin

    Abstract: Cancer diagnosis and treatment often require a personalized analysis for each patient nowadays, due to the heterogeneity among the different types of tumor and among patients. Radiomics is a recent medical imaging field that has shown during the past few years to be promising for achieving this personalization. However, a recent study shows that most of the state-of-the-art works in Radiomics fail… ▽ More

    Submitted 26 June, 2018; v1 submitted 20 June, 2018; originally announced June 2018.

    Comments: 10 pages

  11. arXiv:1803.11241  [pdf, ps, other

    cs.CV

    Improve the performance of transfer learning without fine-tuning using dissimilarity-based multi-view learning for breast cancer histology images

    Authors: Hongliu Cao, Simon Bernard, Laurent Heutte, Robert Sabourin

    Abstract: Breast cancer is one of the most common types of cancer and leading cancer-related death causes for women. In the context of ICIAR 2018 Grand Challenge on Breast Cancer Histology Images, we compare one handcrafted feature extractor and five transfer learning feature extractors based on deep learning. We find out that the deep learning networks pretrained on ImageNet have better performance than th… ▽ More

    Submitted 29 March, 2018; originally announced March 2018.

  12. arXiv:1803.04460  [pdf, other

    cs.CV

    Dissimilarity-based representation for radiomics applications

    Authors: Hongliu Cao, Simon Bernard, Laurent Heutte, Robert Sabourin

    Abstract: Radiomics is a term which refers to the analysis of the large amount of quantitative tumor features extracted from medical images to find useful predictive, diagnostic or prognostic information. Many recent studies have proved that radiomics can offer a lot of useful information that physicians cannot extract from the medical images and can be associated with other information like gene or protein… ▽ More

    Submitted 12 March, 2018; originally announced March 2018.

    Comments: conference, 6 pages, 2 figures