-
Extracting Rules from Event Data for Study Planning
Authors:
Majid Rafiei,
Duygu Bayrak,
Mahsa Pourbafrani,
Gyunam Park,
Hayyan Helal,
Gerhard Lakemeyer,
Wil M. P. van der Aalst
Abstract:
In this study, we examine how event data from campus management systems can be used to analyze the study paths of higher education students. The main goal is to offer valuable guidance for their study planning. We employ process and data mining techniques to explore the impact of sequences of taken courses on academic success. Through the use of decision tree models, we generate data-driven recomm…
▽ More
In this study, we examine how event data from campus management systems can be used to analyze the study paths of higher education students. The main goal is to offer valuable guidance for their study planning. We employ process and data mining techniques to explore the impact of sequences of taken courses on academic success. Through the use of decision tree models, we generate data-driven recommendations in the form of rules for study planning and compare them to the recommended study plan. The evaluation focuses on RWTH Aachen University computer science bachelor program students and demonstrates that the proposed course sequence features effectively explain academic performance measures. Furthermore, the findings suggest avenues for developing more adaptable study plans.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
TraVaG: Differentially Private Trace Variant Generation Using GANs
Authors:
Majid Rafiei,
Frederik Wangelik,
Mahsa Pourbafrani,
Wil M. P. van der Aalst
Abstract:
Process mining is rapidly growing in the industry. Consequently, privacy concerns regarding sensitive and private information included in event data, used by process mining algorithms, are becoming increasingly relevant. State-of-the-art research mainly focuses on providing privacy guarantees, e.g., differential privacy, for trace variants that are used by the main process mining techniques, e.g.,…
▽ More
Process mining is rapidly growing in the industry. Consequently, privacy concerns regarding sensitive and private information included in event data, used by process mining algorithms, are becoming increasingly relevant. State-of-the-art research mainly focuses on providing privacy guarantees, e.g., differential privacy, for trace variants that are used by the main process mining techniques, e.g., process discovery. However, privacy preservation techniques for releasing trace variants still do not fulfill all the requirements of industry-scale usage. Moreover, providing privacy guarantees when there exists a high rate of infrequent trace variants is still a challenge. In this paper, we introduce TraVaG as a new approach for releasing differentially private trace variants based on \text{Generative Adversarial Networks} (GANs) that provides industry-scale benefits and enhances the level of privacy guarantees when there exists a high ratio of infrequent variants. Moreover, TraVaG overcomes shortcomings of conventional privacy preservation techniques such as bounding the length of variants and introducing fake variants. Experimental results on real-life event data show that our approach outperforms state-of-the-art techniques in terms of privacy guarantees, plain data utility preservation, and result utility preservation.
△ Less
Submitted 29 March, 2023;
originally announced March 2023.
-
Interactive Process Improvement using Simulation of Enriched Process Trees
Authors:
Mahsa Pourbafrani,
Wil M. P. van der Aalst
Abstract:
Event data provide the main source of information for analyzing and improving processes in organizations. Process mining techniques capture the state of running processes w.r.t. various aspects, such as activity-flow and performance metrics. The next step for process owners is to take the provided insights and turn them into actions in order to improve their processes. These actions may be taken i…
▽ More
Event data provide the main source of information for analyzing and improving processes in organizations. Process mining techniques capture the state of running processes w.r.t. various aspects, such as activity-flow and performance metrics. The next step for process owners is to take the provided insights and turn them into actions in order to improve their processes. These actions may be taken in different aspects of a process. However, simply being aware of the process aspects that need to be improved as well as potential actions is insufficient. The key step in between is to assess the outcomes of the decisions and improvements. In this paper, we propose a framework to systematically compare event data and the simulated event data of organizations, as well as comparing the results of modified processes in different settings. The proposed framework could be provided as an analytic service to enable organizations in easily accessing event data analytics. The framework is supported with a simulation tool that enables applying changes to the processes and re-running the process in various scenarios. The simulation step includes different perspectives of a process that can be captured automatically and modified by the user. Then, we apply a state-of-the-art comparison approach for processes using their event data which visually reflects the effects of these changes in the process, i.e., evaluating the process improvement. Our framework also includes the implementation of the change measurement module as a tool.
△ Less
Submitted 18 January, 2022;
originally announced January 2022.
-
SIMPT: Process Improvement Using Interactive Simulation of Time-aware Process Trees
Authors:
Mahsa Pourbafrani,
Shuai Jiao,
Wil M. P. van der Aalst
Abstract:
Process mining techniques including process discovery, conformance checking, and process enhancement provide extensive knowledge about processes. Discovering running processes and deviations as well as detecting performance problems and bottlenecks are well-supported by process mining tools. However, all the provided techniques represent the past/current state of the process. The improvement in a…
▽ More
Process mining techniques including process discovery, conformance checking, and process enhancement provide extensive knowledge about processes. Discovering running processes and deviations as well as detecting performance problems and bottlenecks are well-supported by process mining tools. However, all the provided techniques represent the past/current state of the process. The improvement in a process requires insights into the future states of the process w.r.t. the possible actions/changes. In this paper, we present a new tool that enables process owners to extract all the process aspects from their historical event data automatically, change these aspects, and re-run the process automatically using an interface. The combination of process mining and simulation techniques provides new evidence-driven ways to explore "what-if" questions. Therefore, assessing the effects of changes in process improvement is also possible. Our Python-based web-application provides a complete interactive platform to improve the flow of activities, i.e., process tree, along with possible changes in all the derived activity, resource, and process parameters. These parameters are derived directly from an event log without user-background knowledge.
△ Less
Submitted 2 August, 2021;
originally announced August 2021.
-
Removing Operational Friction Using Process Mining: Challenges Provided by the Internet of Production (IoP)
Authors:
Wil van der Aalst,
Tobias Brockhoff,
Anahita Farhang Ghahfarokhi,
Mahsa Pourbafrani,
Merih Seran Uysal,
Sebastiaan van Zelst
Abstract:
Operational processes in production, logistics, material handling, maintenance, etc., are supported by cyber-physical systems combining hardware and software components. As a result, the digital and the physical world are closely aligned, and it is possible to track operational processes in detail (e.g., using sensors). The abundance of event data generated by today's operational processes provide…
▽ More
Operational processes in production, logistics, material handling, maintenance, etc., are supported by cyber-physical systems combining hardware and software components. As a result, the digital and the physical world are closely aligned, and it is possible to track operational processes in detail (e.g., using sensors). The abundance of event data generated by today's operational processes provides opportunities and challenges for process mining techniques supporting process discovery, performance analysis, and conformance checking. Using existing process mining tools, it is already possible to automatically discover process models and uncover performance and compliance problems. In the DFG-funded Cluster of Excellence "Internet of Production" (IoP), process mining is used to create "digital shadows" to improve a wide variety of operational processes. However, operational processes are dynamic, distributed, and complex. Driven by the challenges identified in the IoP cluster, we work on novel techniques for comparative process mining (comparing process variants for different products at different locations at different times), object-centric process mining (to handle processes involving different types of objects that interact), and forward-looking process mining (to explore "What if?" questions). By addressing these challenges, we aim to develop valuable "digital shadows" that can be used to remove operational friction.
△ Less
Submitted 27 July, 2021;
originally announced July 2021.
-
A Python Extension to Simulate Petri nets in Process Mining
Authors:
M. Pourbafrani,
Sandhya Vasudevan,
Faizan Zafar,
Yuan Xingran,
Ravikumar Singh,
Wil M. P. van der Aalst
Abstract:
The capability of process mining techniques in providing extensive knowledge and insights into business processes has been widely acknowledged. Process mining techniques support discovering process models as well as analyzing process performance and bottlenecks in the past executions of processes. However, process mining tends to be "backward-looking" rather than "forward-looking" techniques like…
▽ More
The capability of process mining techniques in providing extensive knowledge and insights into business processes has been widely acknowledged. Process mining techniques support discovering process models as well as analyzing process performance and bottlenecks in the past executions of processes. However, process mining tends to be "backward-looking" rather than "forward-looking" techniques like simulation. For example, process improvement also requires "what-if" analyses. In this paper, we present a Python library that uses an event log to directly generate a simulated event log, with additional options for end-users to specify the duration of activities and the arrival rate. Since the generated simulation model is supported by historical data (event data)and it is based on the Discrete Event Simulation (DES) technique, the generated event data is similar to the behavior of the real process.
△ Less
Submitted 16 February, 2021;
originally announced February 2021.
-
PMSD: Data-Driven Simulation Using System Dynamics and Process Mining
Authors:
Mahsa Pourbafrani,
Wil M. P. van der Aalst
Abstract:
Process mining extends far beyond process discovery and conformance checking, and also provides techniques for bottleneck analysis and organizational mining. However, these techniques are mostly backward-looking. PMSD is a web application tool that supports forward-looking simulation techniques. It transforms the event data and process mining results into a simulation model which can be executed a…
▽ More
Process mining extends far beyond process discovery and conformance checking, and also provides techniques for bottleneck analysis and organizational mining. However, these techniques are mostly backward-looking. PMSD is a web application tool that supports forward-looking simulation techniques. It transforms the event data and process mining results into a simulation model which can be executed and validated. PMSD includes log transformation, time window selection, relation detection, interactive model generation, simulating and validating the models in the form of system dynamics, i.e., a technique for an aggregated simulation. The results of the modules are visualized in the tool for a better interpretation
△ Less
Submitted 24 September, 2020;
originally announced October 2020.