[FEATURE] Basic guardrails for remote model input and output #2227

ylwu-amzn · 2024-03-19T17:18:41Z

Is your feature request related to a problem?
LLM may generate biased/impolite/hate outputs. We need to build guardrail to block such result.

What solution would you like?
Add basic guardrail based on stop words and regex. If model input and output contains stop word or regex, will throw exception. This will be released in 2.13.

What alternatives have you considered?
No

Do you have any additional context?
We plan to add guardrail based on other guardrail service and semantic after 2.13.

chishui · 2024-03-26T01:55:33Z

Can this guardrail be turned on and off by users?

jngz-es · 2024-04-01T22:56:35Z

#2209

ylwu-amzn added enhancement New feature or request untriaged and removed untriaged labels Mar 19, 2024

github-actions bot added the untriaged label Mar 19, 2024

ylwu-amzn added this to ml-commons projects Mar 19, 2024

ylwu-amzn moved this to In Progress in ml-commons projects Mar 19, 2024

ylwu-amzn mentioned this issue Mar 19, 2024

Agent framework GA/ml-commons 2.13 release tasks tracking #2190

Closed

ylwu-amzn added v2.13.0 Issues targeting release v2.13.0 and removed untriaged labels Mar 19, 2024

ylwu-amzn changed the title ~~[FEATURE] Guardrails for remote model input and output~~ [FEATURE] Basic guardrails for remote model input and output Mar 25, 2024

jngz-es closed this as completed Apr 1, 2024

github-project-automation bot moved this from In Progress to Done in ml-commons projects Apr 1, 2024

github-project-automation bot added this to OpenSearch Project Roadmap Aug 30, 2024

github-project-automation bot moved this to 2.13.0 (Launched ) in OpenSearch Project Roadmap Aug 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Basic guardrails for remote model input and output #2227

[FEATURE] Basic guardrails for remote model input and output #2227

ylwu-amzn commented Mar 19, 2024 •

edited

Loading

chishui commented Mar 26, 2024

jngz-es commented Apr 1, 2024

[FEATURE] Basic guardrails for remote model input and output #2227

[FEATURE] Basic guardrails for remote model input and output #2227

Comments

ylwu-amzn commented Mar 19, 2024 • edited Loading

chishui commented Mar 26, 2024

jngz-es commented Apr 1, 2024

ylwu-amzn commented Mar 19, 2024 •

edited

Loading