Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Goethals, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.05538  [pdf, other

    cs.SI cs.AI cs.LG cs.NI

    Efficient pattern-based anomaly detection in a network of multivariate devices

    Authors: Len Feremans, Boris Cule, Bart Goethals

    Abstract: Many organisations manage service quality and monitor a large set devices and servers where each entity is associated with telemetry or physical sensor data series. Recently, various methods have been proposed to detect behavioural anomalies, however existing approaches focus on multivariate time series and ignore communication between entities. Moreover, we aim to support end-users in not only in… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

  2. arXiv:2207.00350  [pdf, other

    cs.IR

    Modelling Users with Item Metadata for Explainable and Interactive Recommendation

    Authors: Joey De Pauw, Koen Ruymbeek, Bart Goethals

    Abstract: Recommender systems are used in many different applications and contexts, however their main goal can always be summarised as "connecting relevant content to interested users". Personalized recommendation algorithms achieve this goal by first building a profile of the user, either implicitly or explicitly, and then matching items with this profile to find relevant content. The more interpretable t… ▽ More

    Submitted 8 July, 2022; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: - Correct author affiliation - Place appendix after references - Update link to source code

  3. Proximity Forest: An effective and scalable distance-based classifier for time series

    Authors: Benjamin Lucas, Ahmed Shifaz, Charlotte Pelletier, Lachlan O'Neill, Nayyar Zaidi, Bart Goethals, Francois Petitjean, Geoffrey I. Webb

    Abstract: Research into the classification of time series has made enormous progress in the last decade. The UCR time series archive has played a significant role in challenging and guiding the development of new learners for time series classification. The largest dataset in the UCR archive holds 10 thousand time series only; which may explain why the primary research focus has been in creating algorithms… ▽ More

    Submitted 12 December, 2018; v1 submitted 31 August, 2018; originally announced August 2018.

    Comments: 30 pages, 12 figures

  4. arXiv:1704.00362  [pdf, other

    cs.LG

    Understanding Concept Drift

    Authors: Geoffrey I. Webb, Loong Kuan Lee, François Petitjean, Bart Goethals

    Abstract: Concept drift is a major issue that greatly affects the accuracy and reliability of many real-world applications of machine learning. We argue that to tackle concept drift it is important to develop the capacity to describe and analyze it. We propose tools for this purpose, arguing for the importance of quantitative descriptions of drift in marginal distributions. We present quantitative drift ana… ▽ More

    Submitted 2 April, 2017; originally announced April 2017.

  5. arXiv:cs/0206023  [pdf, ps, other

    cs.DB cs.AI

    Relational Association Rules: getting WARMeR

    Authors: Bart Goethals, Jan Van den Bussche

    Abstract: In recent years, the problem of association rule mining in transactional data has been well studied. We propose to extend the discovery of classical association rules to the discovery of association rules of conjunctive queries in arbitrary relational data, inspired by the WARMR algorithm, developed by Dehaspe and Toivonen, that discovers association rules over a limited set of conjunctive queri… ▽ More

    Submitted 15 June, 2002; originally announced June 2002.

    ACM Class: H.2.8

  6. arXiv:cs/0206004  [pdf, ps, other

    cs.DB cs.AI

    Mining All Non-Derivable Frequent Itemsets

    Authors: Toon Calders, Bart Goethals

    Abstract: Recent studies on frequent itemset mining algorithms resulted in significant performance improvements. However, if the minimal support threshold is set too low, or the data is highly correlated, the number of frequent itemsets itself can be prohibitively large. To overcome this problem, recently several proposals have been made to construct a concise representation of the frequent itemsets, inst… ▽ More

    Submitted 3 June, 2002; originally announced June 2002.

    Comments: 3 figures

    ACM Class: H.2.8

  7. arXiv:cs/0112013  [pdf, ps, other

    cs.DB cs.AI

    A Data Mining Framework for Optimal Product Selection in Retail Supermarket Data: The Generalized PROFSET Model

    Authors: Tom Brijs, Bart Goethals, Gilbert Swinnen, Koen Vanhoof, Geert Wets

    Abstract: In recent years, data mining researchers have developed efficient association rule algorithms for retail market basket analysis. Still, retailers often complain about how to adopt association rules to optimize concrete retail marketing-mix decisions. It is in this context that, in a previous paper, the authors have introduced a product selection model called PROFSET. This model selects the most… ▽ More

    Submitted 11 December, 2001; originally announced December 2001.

    ACM Class: H.2.8

  8. arXiv:cs/0112011  [pdf, ps, other

    cs.DB cs.AI

    Interactive Constrained Association Rule Mining

    Authors: Bart Goethals, Jan Van den Bussche

    Abstract: We investigate ways to support interactive mining sessions, in the setting of association rule mining. In such sessions, users specify conditions (queries) on the associations to be generated. Our approach is a combination of the integration of querying conditions inside the mining phase, and the incremental querying of already generated associations. We present several concrete algorithms and c… ▽ More

    Submitted 5 February, 2003; v1 submitted 10 December, 2001; originally announced December 2001.

    Comments: A preliminary report on this work was presented at the Second International Conference on Knowledge Discovery and Data Mining (DaWaK 2000)

    ACM Class: H.2.8

  9. arXiv:cs/0112007  [pdf, ps, other

    cs.DB cs.AI

    A Tight Upper Bound on the Number of Candidate Patterns

    Authors: Floris Geerts, Bart Goethals, Jan Van den Bussche

    Abstract: In the context of mining for frequent patterns using the standard levelwise algorithm, the following question arises: given the current level and the current set of frequent patterns, what is the maximal number of candidate patterns that can be generated on the next level? We answer this question by providing a tight upper bound, derived from a combinatorial result from the sixties by Kruskal an… ▽ More

    Submitted 29 November, 2002; v1 submitted 7 December, 2001; originally announced December 2001.

    ACM Class: H.2.8