Distributed CI Error Classifier

Objective

In the current setup when a job fails we don’t know if it failed because of DCI issue or issue is at partners end. As test cases are executed at partners end, error can be specifically caused because of issues in partners systems or networks. All these jobs have to be first checked by an RH developer and based on the error it is decided if it is an RH issue or partner issue.

To save time, we build an error log classification model using machine learning and NLP techniques . This model reads the job data and automatically classifies the failed log into DCI or non DCI. Once the job is classified, RH resources have to invest time only for jobs which are marked as DCI error type and all other jobs are redirected to corresponding partners.

Installation

clone this repository
use the package manager pip to install dciclient:

$ pip install python-dciclient

The package provides the API: a python module one can use to interact with a control server (dciclient.v1.api.*)

Configuration

Remoteci creation

DCI is connected to the Red Hat SSO. You need to log in https://www.distributed-ci.io with your redhat.com SSO account. Your user account will be created in our database the first time you connect.

After the first connection you can create a remoteci. Go to https://www.distributed-ci.io/remotecis and click Create a new remoteci button. Once your remoteci is created, you can retrieve the connection information in the Authentication column. Save this information in remoteci.rc file.

At this point, you can validate your credentials with the following commands:

$ source remoteci.rc

If you see your remoteci in the list, everything is working great so far.

Classifier

The classifier is built using a rule based system in NLP. Rules are stored in the elasticsearch database. Below is the pipeline for the model development.

To run the classifier: dci-classifier job-labelling --product="<product_name>"

Rules table Schema:

'Error_Type', type=str, default="None",choices=['non DCI','DCI'],help='Error label'
'Job_ID', type=str, default="0", help='Test job id'
'Stage_of_Failure', type=str, default="0",help='Task name at which job failed'
'Error_Message', type=str, default="0",help='Error content'
'Is_user_text', type=int,choices=[0,1],default=0, help='user_text.yml in failed bucket'
'Is_SUT', type=int,choices=[0,1],default=0, help='SUT.yml in failed bucket'
'Is_install', type=int,choices=[0,1],default=0, help='install.yml in failed bucket'
'Is_logs', type=int,choices=[0,1],default=0, help='logs.yml in failed bucket'
'Is_dci_rhel_cki', type=int,choices=[0,1],default=0, help='Failed task dci-rhel-cki'

New Rule creation

Flask API is created to create new rule. Entry point for the new rule creation is app.py @app.route('/rules', methods=['POST'])
To create new rule, run the API : http POST http://0.0.0.0:1234/rules <parameter_1="value>-----<parameter_n="value">
Sample for parameters in above command:
Stage_of_Failure = "Run the pre-run hook"
Error_Type = "non DCI"
Is_SUT = "1"

Testing new rule

Entry point for the rule testing is app.py @app.route('/rules/test', methods=['POST'])
Command to test the rule : http POST http://0.0.0.0:1234/rules/test <parameter_1="value>-----<parameter_n="value">
Parameters should be corresponding to the job id passed for testing the rule.

Search existing rules

Entry point for the getting list of all rules present in database is app.py @app.route('/rules', methods=['GET'])
Command to search the rule : http GET http://0.0.0.0:1234/rules

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
api		api
classifier		classifier
tests		tests
.gitignore		.gitignore
DCI_Classifier_Model.png		DCI_Classifier_Model.png
README.md		README.md
__init__.py		__init__.py
requirements.txt		requirements.txt
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed CI Error Classifier

Table of Contents

Objective

Installation

Configuration

Remoteci creation

Classifier

Rules table Schema:

New Rule creation

Testing new rule

Search existing rules

About

Releases

Packages

Languages

asergienk/dci-error-log-classification

Folders and files

Latest commit

History

Repository files navigation

Distributed CI Error Classifier

Table of Contents

Objective

Installation

Configuration

Remoteci creation

Classifier

Rules table Schema:

New Rule creation

Testing new rule

Search existing rules

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages