Case Attribute Discovery

🚨 This repository is now a subpackage of pix-framework.discovery, https://github.com/AutomatedProcessImprovement/pix-framework/tree/main/src/pix_framework/discovery/case_attribute_discovery.

Case Attribute Discovery

Python package to discover case attributes from an event log and their value distribution (stochastic if discrete, probability distribution if continuous).

Example of use

import pandas as pd

from case_attribute_discovery.config import DEFAULT_CSV_IDS
from case_attribute_discovery.discovery import discover_case_attributes

# Read event log
event_log = pd.read_csv("path_to_event_log.csv")

# Simple call
case_attributes = discover_case_attributes(
    event_log=event_log,
    log_ids=DEFAULT_CSV_IDS
)

# Call specifying the columns to not take into account for case attribute analysis
case_attributes = discover_case_attributes(
    event_log=event_log,
    log_ids=DEFAULT_CSV_IDS,
    avoid_columns=[
        DEFAULT_CSV_IDS.case, DEFAULT_CSV_IDS.activity,
        DEFAULT_CSV_IDS.start_time, DEFAULT_CSV_IDS.end_time
    ]
)

# Call specifying a confidence (or noise) threshold to allow a certain noise 
# in the variability of the attribute along the trace: 
#  - For each trace, the confidence of the most frequent value is computed (i.e. 
#  the % of activity instances from that trace with that same value). For example, 
#  a trace with 8 activity instances with 'amount'=100 and 2 with 'amount'=150 
#  will have a confidence of 0.8.
#  - The average confidence in all traces must be higher or equal to the specified
#  one to consider that column a case attribute.

case_attributes = discover_case_attributes(
    event_log=event_log,
    log_ids=DEFAULT_CSV_IDS,
    confidence_threshold=0.9
)

To see a more detailed example of use, and the format of the output, you can check this test file.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github/workflows		.github/workflows
src/case_attribute_discovery		src/case_attribute_discovery
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Case Attribute Discovery

Example of use

About

Releases 8

Packages

Contributors 2

Languages

License

AutomatedProcessImprovement/case-attribute-discovery

Folders and files

Latest commit

History

Repository files navigation

Case Attribute Discovery

Example of use

About

Resources

License

Stars

Watchers

Forks

Releases 8

Packages 0

Contributors 2

Languages

Packages