Evaluating quality of Amazon Translate's translations with BERTscore

With this example, you will deploy the resources required in AWS for setting up an evaluator of translations from Spanish to Italian done by Amazon Translate, based on the open-source library BERT score (https://github.com/Tiiiger/bert_score).

Architecture

The evaluator will follow this architecture in high-level:

where we have:

Text files uploaded to an Amazon S3 bucket, containing the original text (in Spanish) and the correct/reference translation to compare (in Italian). Note you must use "|" as the field separator, in example:

Hola|Ciao

Bienvenido a Italia, amigo|Benvenutto a la Italia, amico

Hola chico|Ciao ragazzo

An AWS Lambda function gets automatically triggered for reading the rows in this file, and sending those as messages to an Amazon SQS queue
A container task in Fargate that will be constantly running (until manually stopped) for reading the SQS queue and processing the messages
Each message will be translated with Amazon Translate, and evaluated with BERT Score accordingly
The results of the processing are stored in another location of the Amazon S3 bucket as CSV files, also with "|" separator, including in example:

Original text	Reference text	Translated text	BERT score P	BERT score R	BERT score F1
Hola	Ciao	Ciao	1.0000	1.0000	1.0000
Bienvenido a Italia, amigo	Benvenutto a la Italia, amico	Benvenuti in Italia, amico	0.8611	0.8259	0.8431
Hola chico	Ciao ragazzo	Ciao ragazzo.	0.8643	0.8876	0.8758

where P: Precision, R: Recall, and F1: F1 score.

Finally, an Amazon QuickSight dashboard is created for monitoring the BERTscore obtained for the sample texts. Optionally, you can use Amazon Athena for collecting the output S3 files' information into QuickSight.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
images		images
translate-bert		translate-bert
README.md		README.md
bert-score20210527110242.csv		bert-score20210527110242.csv
text.txt		text.txt
translate-bert-input.py		translate-bert-input.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evaluating quality of Amazon Translate's translations with BERTscore

Architecture

Instructions

Pre-requisites

Deploying the AWS Cloud Formation template

Fine tuning and testing the deployment

About

Releases

Packages

Languages

angelalb/translate-evaluate

Folders and files

Latest commit

History

Repository files navigation

Evaluating quality of Amazon Translate's translations with BERTscore

Architecture

Instructions

Pre-requisites

Deploying the AWS Cloud Formation template

Fine tuning and testing the deployment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages