You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
This is a new feature request and not related to a bug or issue within the current code. However, English is the only supported language and we would like to enable multilingual support.
Describe the solution you'd like
Add multiple language support for text via the Google Cloud Translation API. The proposed solution will process data in the following order:
Capture raw text input from the user. [this step already exists as part of Clean Chat]
Use the Google Cloud Translation API to auto-detect the language.
If language is not equal to English, then convert the non-English language to English using the Cloud Translation API.
Score the translated text for toxicity as part of the Clean Chat pipeline. [this step already exists as part of Clean Chat]
Describe alternatives you've considered
I've considered the tradeoffs between using the Translation API versus analyzing the native language(s) directly. While analyzing the native language directly may produce more accurate results, it does require additional model training and complexity in maintaining multiple language models. Native language analysis may be a feature that we add in the future, but our results with the Google Cloud Translation API look promising and the API is stable, reliable, and scalable.
Additional context
Need to add a flag for users to enable or disable multiple language support. This feature only applies to text input.
The text was updated successfully, but these errors were encountered:
Is your feature request related to a problem? Please describe.
This is a new feature request and not related to a bug or issue within the current code. However, English is the only supported language and we would like to enable multilingual support.
Describe the solution you'd like
Add multiple language support for text via the Google Cloud Translation API. The proposed solution will process data in the following order:
Describe alternatives you've considered
I've considered the tradeoffs between using the Translation API versus analyzing the native language(s) directly. While analyzing the native language directly may produce more accurate results, it does require additional model training and complexity in maintaining multiple language models. Native language analysis may be a feature that we add in the future, but our results with the Google Cloud Translation API look promising and the API is stable, reliable, and scalable.
Additional context
Need to add a flag for users to enable or disable multiple language support. This feature only applies to text input.
The text was updated successfully, but these errors were encountered: