Using Sampling Strategy to Assist Consensus Sequence Analysis

08/19/2020
by   Zhichao Xu, et al.
0

Consensus Sequences of event logs are often used in process mining to quickly grasp the core sequence of events to be performed in a process, or to represent the backbone of the process for doing other analyses. However, it is still not clear how many traces are enough to properly represent the underlying process. In this paper, we propose a novel sampling strategy to determine the number of traces necessary to produce a representative consensus sequence. We show how to estimate the difference between the predefined Expert Model and the real processes carried out. This difference level can be used as reference for domain experts to adjust the Expert Model. In addition, we apply this strategy to several real-world workflow activity datasets as a case study. We show a sample curve fitting task to help readers better understand our proposed methodology.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/07/2019

Expert Sample Consensus Applied to Camera Re-Localization

Fitting model parameters to a set of noisy data points is a common probl...
research
02/17/2022

A Deep Learning Approach for Repairing Missing Activity Labels in Event Logs for Process Mining

Process mining is a relatively new subject that builds a bridge between ...
research
11/17/2020

Visual Drift Detection for Event Sequence Data of Business Processes

Event sequence data is increasingly available in various application dom...
research
08/08/2023

Event Abstraction for Enterprise Collaboration Systems to Support Social Process Mining

One aim of Process Mining (PM) is the discovery of process models from e...
research
04/20/2012

Automatic Sampling of Geographic objects

Today, one's disposes of large datasets composed of thousands of geograp...
research
08/24/2020

Infrastructure Recovery Curve Estimation Using Gaussian Process Regression on Expert Elicited Data

Infrastructure recovery time estimation is critical to disaster manageme...
research
07/16/2021

Estimation from Partially Sampled Distributed Traces

Sampling is often a necessary evil to reduce the processing and storage ...

Please sign up or login with your details

Forgot password? Click here to reset