Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Klikowski, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10807  [pdf, other

    cs.CL cs.LG

    Employing Sentence Space Embedding for Classification of Data Stream from Fake News Domain

    Authors: Paweł Zyblewski, Jakub Klikowski, Weronika Borek-Marciniec, Paweł Ksieniewicz

    Abstract: Tabular data is considered the last unconquered castle of deep learning, yet the task of data stream classification is stated to be an equally important and demanding research area. Due to the temporal constraints, it is assumed that deep learning methods are not the optimal solution for application in this field. However, excluding the entire -- and prevalent -- group of methods seems rather rash… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 8 pages, 8 figures

  2. arXiv:2406.10255  [pdf, other

    cs.CL cs.SI

    WarCov -- Large multilabel and multimodal dataset from social platform

    Authors: Weronika Borek-Marciniec, Pawel Zyblewski, Jakub Klikowski, Pawel Ksieniewicz

    Abstract: In the classification tasks, from raw data acquisition to the curation of a dataset suitable for use in evaluating machine learning models, a series of steps - often associated with high costs - are necessary. In the case of Natural Language Processing, initial cleaning and conversion can be performed automatically, but obtaining labels still requires the rationalized input of human experts. As a… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 13 pages, 6 figures

  3. arXiv:2102.00266  [pdf, ps, other

    cs.CV

    Hellinger Distance Weighted Ensemble for Imbalanced Data Stream Classification

    Authors: Joanna Grzyb, Jakub Klikowski, Michał Woźniak

    Abstract: The imbalanced data classification remains a vital problem. The key is to find such methods that classify both the minority and majority class correctly. The paper presents the classifier ensemble for classifying binary, non-stationary and imbalanced data streams where the Hellinger Distance is used to prune the ensemble. The paper includes an experimental evaluation of the method based on the con… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.