🧪 Behavioural Testing on a Sentiment Clasiffier 🔬

Check how robust a sentiment classifier is to random typos in our dataset using Github Actions and invariance testing.

🗃️ Table of contents

🗺️ A bit of context about this project
📝 Description
🛠️ Set up
🌱 How to use Github Actions to test this and your own project
👥 Aknowledgements

🗺️ A bit of context about this project

This project represents an aid for the talk "Testing Schrödinger's Box, AKA ML Systems" at CommitConf 2024 and Codemotion 2024.

Abstract of the talk:

Just like quantum mechanics and Schrödinger's cat experiment, in AI we have our own mysteries, which, curiously, are also related to boxes.We have come to create systems of such complexity that we call them "black box models" because we are unable to understand what goes on inside them. We only know what goes in and what comes out.

In this talk, we will talk about how can we test these black boxes to shed some light on what goes on inside them, or at least to ensure that they behave in a predictable way. Which is not trivial at all. We will also discuss a hands on example on how perform automatic tests on a sentiment anañlysis project.

📝 Description

Back to Top

In this project we present the code to predict an individual's belief about climate change based on their Twitter activity.

Information about the dataset and data processing performed: data/README.md and docs/processed/data_processing.ipynb
Benchmarking of the model: docs/processed/sentiment_analysis_guide.ipynb
Invariance test: run_invariance_test.py will asses the robustness of our classifier to typos in our dataset.

🛠️ Set up

Back to Top

This project requires python>=3.7=<3.11. Check your python version by running:

python --version

Clone the repository:

git clone https://github.com/LoboaTeresa/Behavioural-Testing-on-a-Sentiment-Clasiffier.git

Install the required packages:

pip install -r requirements.txt

You are ready to go! Dont forget to check the notebooks in the docs folder to understand the data processing and the benchmarking of the model.

🌱 How to use Github Actions to test this and your own project

Back to Top

Learning journey on Github Actions: click here

Create your own Github repository or fork this one. Github actions are integrated into your project as soon you create the Github repository.
Click on Actions in the top bar of your repository to check pre-built workflows. As you can see Github actions offers a very easy integratios with different tools, something indespensable for CI/CD tool.
Your automated workflows must be defined in a Github actions configuration file in.github/workflows directory. It must by a yaml file. Go check out mine.
You can modify the name of the job, the name of the workflow, the name of the python version, the name of the test, etc. You can also add more jobs to the workflow.
Once you have created the yaml file, you can push it to your repository. This will trigger the workflow and you will be able to see the results in the Actions tab of your repository.

👥 Aknowledgements

Back to Top

I would like to thank the organizers of CommitConf and Codemotion for giving me the opportunity to share my knowledge with the community. I would also like to thank the community for their support and feedback.

The code in this repository is based on the work of Max Stocker. Go give him a star in his repository and some claps for his Medium Article.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.github/workflows		.github/workflows
data		data
docs		docs
src		src
.gitignore		.gitignore
README.md		README.md
failure_modes.txt		failure_modes.txt
requirements.txt		requirements.txt
run_invariance_test.py		run_invariance_test.py
test_score.json		test_score.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧪 Behavioural Testing on a Sentiment Clasiffier 🔬

🗃️ Table of contents

🗺️ A bit of context about this project

📝 Description

🛠️ Set up

🌱 How to use Github Actions to test this and your own project

👥 Aknowledgements

About

Releases

Packages

Languages

LoboaTeresa/Behavioural-Testing-on-a-Sentiment-Clasiffier

Folders and files

Latest commit

History

Repository files navigation

🧪 Behavioural Testing on a Sentiment Clasiffier 🔬

🗃️ Table of contents

🗺️ A bit of context about this project

📝 Description

🛠️ Set up

🌱 How to use Github Actions to test this and your own project

👥 Aknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages