Moderation
One of the key concerns with using LLMs is that they may generate harmful or unethical text. This is an area of active research in the field. Here we present some built-in chains inspired by this research, which are intended to make the outputs of LLMs safer.
- Moderation chain: Explicitly check if any output text is harmful and flag it.
- Constitutional chain: Prompt the model with a set of principles which should guide it's behavior.
- Logical Fallacy chain: Checks the model output against logical fallacies to correct any deviation.
- Amazon Comprehend moderation chain: Use Amazon Comprehend to detect and handle PII and toxicity.