Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Vandewiele, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.07832  [pdf, ps, other

    cs.LG cs.AI stat.ME

    REFORMS: Reporting Standards for Machine Learning Based Science

    Authors: Sayash Kapoor, Emily Cantrell, Kenny Peng, Thanh Hien Pham, Christopher A. Bail, Odd Erik Gundersen, Jake M. Hofman, Jessica Hullman, Michael A. Lones, Momin M. Malik, Priyanka Nanayakkara, Russell A. Poldrack, Inioluwa Deborah Raji, Michael Roberts, Matthew J. Salganik, Marta Serra-Garcia, Brandon M. Stewart, Gilles Vandewiele, Arvind Narayanan

    Abstract: Machine learning (ML) methods are proliferating in scientific research. However, the adoption of these methods has been accompanied by failures of validity, reproducibility, and generalizability. These failures can hinder scientific progress, lead to false consensus around invalid claims, and undermine the credibility of ML-based science. ML methods are often applied and fail in similar ways acros… ▽ More

    Submitted 19 September, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

  2. arXiv:2211.05597  [pdf, other

    cs.LG stat.ME

    Perfectly predicting ICU length of stay: too good to be true

    Authors: Sandeep Ramachandra, Gilles Vandewiele, David Vander Mijnsbrugge, Femke Ongenae, Sofie Van Hoecke

    Abstract: A paper of Alsinglawi et al was recently accepted and published in Scientific Reports. In this paper, the authors aim to predict length of stay (LOS), discretized into either long (> 7 days) or short stays (< 7 days), of lung cancer patients in an ICU department using various machine learning techniques. The authors claim to achieve perfect results with an Area Under the Receiver Operating Charact… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: 3 pages, 1 figure, 2 tables

  3. arXiv:2207.07753  [pdf, other

    stat.ML cs.AI cs.LG eess.SP

    Do Not Sleep on Traditional Machine Learning: Simple and Interpretable Techniques Are Competitive to Deep Learning for Sleep Scoring

    Authors: Jeroen Van Der Donckt, Jonas Van Der Donckt, Emiel Deprost, Nicolas Vandenbussche, Michael Rademaker, Gilles Vandewiele, Sofie Van Hoecke

    Abstract: Over the last few years, research in automatic sleep scoring has mainly focused on developing increasingly complex deep learning architectures. However, recently these approaches achieved only marginal improvements, often at the expense of requiring more data and more expensive training procedures. Despite all these efforts and their satisfactory performance, automatic sleep staging solutions are… ▽ More

    Submitted 14 December, 2022; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: The first two authors contributed equally. Accepted to Biomedical Signal Processing and Control

  4. arXiv:2205.02283  [pdf, other

    cs.LG

    pyRDF2Vec: A Python Implementation and Extension of RDF2Vec

    Authors: Gilles Vandewiele, Bram Steenwinckel, Terencio Agozzino, Femke Ongenae

    Abstract: This paper introduces pyRDF2Vec, a Python software package that reimplements the well-known RDF2Vec algorithm along with several of its extensions. By making the algorithm available in the most popular data science language, and by bundling all extensions into a single place, the use of RDF2Vec is simplified for data scientists. The package is released under a MIT license and structured in such a… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

  5. arXiv:2203.02424  [pdf, other

    cs.LG cs.AI stat.ML

    R-GCN: The R Could Stand for Random

    Authors: Vic Degraeve, Gilles Vandewiele, Femke Ongenae, Sofie Van Hoecke

    Abstract: The inception of the Relational Graph Convolutional Network (R-GCN) marked a milestone in the Semantic Web domain as a widely cited method that generalises end-to-end hierarchical representation learning to Knowledge Graphs (KGs). R-GCNs generate representations for nodes of interest by repeatedly aggregating parameterised, relation-specific transformations of their neighbours. However, in this pa… ▽ More

    Submitted 6 May, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

  6. arXiv:2110.07531  [pdf

    stat.ML cs.LG physics.bio-ph q-bio.BM

    Deep learning models for predicting RNA degradation via dual crowdsourcing

    Authors: Hannah K. Wayment-Steele, Wipapat Kladwang, Andrew M. Watkins, Do Soon Kim, Bojan Tunguz, Walter Reade, Maggie Demkin, Jonathan Romano, Roger Wellington-Oguri, John J. Nicol, Jiayang Gao, Kazuki Onodera, Kazuki Fujikawa, Hanfei Mao, Gilles Vandewiele, Michele Tinti, Bram Steenwinckel, Takuya Ito, Taiga Noumi, Shujun He, Keiichiro Ishi, Youhan Lee, Fatih Öztürk, Anthony Chiu, Emin Öztürk , et al. (4 additional authors not shown)

    Abstract: Messenger RNA-based medicines hold immense potential, as evidenced by their rapid deployment as COVID-19 vaccines. However, worldwide distribution of mRNA molecules has been limited by their thermostability, which is fundamentally limited by the intrinsic instability of RNA molecules to a chemical degradation reaction called in-line hydrolysis. Predicting the degradation of an RNA molecule is a ke… ▽ More

    Submitted 22 April, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

  7. arXiv:2009.04404  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Walk Extraction Strategies for Node Embeddings with RDF2Vec in Knowledge Graphs

    Authors: Gilles Vandewiele, Bram Steenwinckel, Pieter Bonte, Michael Weyns, Heiko Paulheim, Petar Ristoski, Filip De Turck, Femke Ongenae

    Abstract: As KGs are symbolic constructs, specialized techniques have to be applied in order to make them compatible with data mining techniques. RDF2Vec is an unsupervised technique that can create task-agnostic numerical representations of the nodes in a KG by extending successful language modelling techniques. The original work proposed the Weisfeiler-Lehman (WL) kernel to improve the quality of the repr… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

  8. arXiv:2001.06296  [pdf, other

    eess.SP cs.LG stat.ML

    Overly Optimistic Prediction Results on Imbalanced Data: a Case Study of Flaws and Benefits when Applying Over-sampling

    Authors: Gilles Vandewiele, Isabelle Dehaene, György Kovács, Lucas Sterckx, Olivier Janssens, Femke Ongenae, Femke De Backere, Filip De Turck, Kristien Roelens, Johan Decruyenaere, Sofie Van Hoecke, Thomas Demeester

    Abstract: Information extracted from electrohysterography recordings could potentially prove to be an interesting additional source of information to estimate the risk on preterm birth. Recently, a large number of studies have reported near-perfect results to distinguish between recordings of patients that will deliver term or preterm using a public resource, called the Term/Preterm Electrohysterogram datab… ▽ More

    Submitted 28 November, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Journal ref: Artificial Intelligence in Medicine. 111 (2021). 101987

  9. arXiv:1910.12948  [pdf, ps, other

    cs.NE cs.LG stat.ML

    GENDIS: GENetic DIscovery of Shapelets

    Authors: Gilles Vandewiele, Femke Ongenae, Filip De Turck

    Abstract: In the time series classification domain, shapelets are small time series that are discriminative for a certain class. It has been shown that classifiers are able to achieve state-of-the-art results on a plethora of datasets by taking as input distances from the input time series to different discriminative shapelets. Additionally, these shapelets can easily be visualized and thus possess an inter… ▽ More

    Submitted 7 January, 2021; v1 submitted 13 September, 2019; originally announced October 2019.

  10. arXiv:1611.05722  [pdf, other

    stat.ML cs.LG

    GENESIM: genetic extraction of a single, interpretable model

    Authors: Gilles Vandewiele, Olivier Janssens, Femke Ongenae, Filip De Turck, Sofie Van Hoecke

    Abstract: Models obtained by decision tree induction techniques excel in being interpretable.However, they can be prone to overfitting, which results in a low predictive performance. Ensemble techniques are able to achieve a higher accuracy. However, this comes at a cost of losing interpretability of the resulting model. This makes ensemble techniques impractical in applications where decision support, inst… ▽ More

    Submitted 17 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems