Skip to content

Add Guardrails Setup Agent Skill #2417

@joshreini1

Description

@joshreini1

Summary

TruLens supports blocking guardrails (evaluate before returning a response) for both LangChain and LlamaIndex, but the existing skills don't cover guardrail configuration. Users who want to use TruLens feedback functions as runtime safety checks have no guided workflow.

What

Create skills/guardrails/SKILL.md that walks users through:

  • Choosing which feedback functions to use as guardrails (safety metrics like harmfulness/toxicity are obvious, but context_relevance as a hallucination gate is powerful too)
  • Configuring threshold and action (block, warn, fallback response)
  • Framework-specific setup for LangChain (WithFeedbackFilterDocuments) and LlamaIndex guardrail patterns
  • Testing guardrails with adversarial inputs
  • Monitoring guardrail trigger rates in the dashboard

Reference

Difficulty

Easy-Medium

Metadata

Metadata

Assignees

No one assigned

    Labels

    Agent SkillsNew or improved agent skills for AI coding assistantsdocumentationImprovements or additions to documentationgood first issueGood for newcomers

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions