Skip to content

amsterdata/arguseyes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ArgusEyes

ArgusEyes is a system which allows data scientists to declaratively specify a variety of pipeline issues that they are concerned about. Subsequently, ArgusEyes can instrument, execute and screen the pipeline for the configured pipeline issues, as part of continuous integration processes. ArgusEyes detects complex issues by tracking record-level provenance and understanding the semantics of operations in ML pipelines. ArgusEyes was presented as an abstract at CIDR'22.

We provide three example scenarios (Note that you have to locally install ArgusEyes first to execute them). You can run ArgusEyes to execute the pipeline and screen it for a particular issue issue:

  • Detecting mislabeled images in a computer vision pipeline:
    ./eyes arguseyes/example_pipelines/mlinspect-computervision-sneakers-labelerrors.yaml

  • Spotting data leakage in a price prediction pipeline:
    ./eyes arguseyes/example_pipelines/mlflow-regression-nyctaxifare-dataleakage.yaml

  • Adressing fairness violations in a credit scoring pipeline:
    ./eyes arguseyes/example_pipelines/openml-classification-incomelevel-fairness.yaml

Local setup

Prerequisite: Python 3.9

  1. Clone this repository

  2. Set up the environment

    cd arguseyes
    python3.9 -m venv venv
    source venv/bin/activate

  3. Install graphviz

    Linux: apt-get install graphviz
    MAC OS: brew install graphviz

  4. Install pip dependencies

    pip install -r requirements.txt

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages