ai-testing

Here are 119 public repositories matching this topic...

Giskard-AI / giskard-oss

🐢 Open-Source Evaluation & Testing library for LLM Agents

ai-security mlops fairness-ai responsible-ai ml-validation red-team-tools trustworthy-ai ml-testing llm ai-red-team ai-testing llmops llm-security llm-eval llm-evaluation rag-evaluation agent-evaluation

Updated Feb 26, 2026
Python

langwatch / scenario

Star

Agentic testing for agentic codebases

python-library javascript-library ai-testing agent-simulations agent-testing

Updated Feb 26, 2026
TypeScript

Pacific-AI-Corp / langtest

Star

Deliver safe & effective language models

nlp artificial-intelligence benchmarks benchmark-framework model-assessment ai-safety mlops responsible-ai ml-safety trustworthy-ai ethics-in-ai ml-testing large-language-models llm ai-testing llm-test llm-evaluation-toolkit llm-as-evaluator llm-testing

Updated Feb 19, 2026
Python

Addepto / contextcheck

Star

MIT-licensed Framework for LLMs, RAGs, Chatbots testing. Configurable via YAML and integrable into CI pipelines for automated testing.

open-source ci testing-tools chatbot-framework testing-framework chatbot-testing rag ai-chat large-language-models llm ai-testing llm-evaluation llm-evaluation-framework prompt-test llm-testing ai-testing-tool generative-ai-testing rag-testing summarization-testing

Updated Dec 11, 2024
Python

PramodDutta / qaskills

Star

QA Skills Directory QA Skills is a curated directory of testing-specific skills for AI coding agents (Claude Code, Cursor, Copilot, etc.).

testing qa selenium test-automation cursor cypress sdet playwright ai-testing agent-skills claude-code agent-browser vibium qaskils

Updated Feb 27, 2026
TypeScript

tianshanghong / GPT4Go

Star

GPT4Go: AI-Powered Test Case Generation for Golang 🧪

golang test-automation openai code-generation software-testing test-generation golang-testing test-case-generation go-testing golang-utility openai-api golang-test golang-tests gpt-4 chatgpt chatgpt-go ai-testing ai-powered-testing gpt4go

Updated Apr 5, 2023
Go

srvsngh99 / genai-testing-journey

Star

52-week journey from QA/SDET to GenAI Testing - learning in public with weekly mini-projects, code, and honest documentation of struggles and wins.

python test-automation qa-engineering learning-in-public prompt-engineering ai-testing genai llm-testing 52-week-challenge

Updated Feb 23, 2026
Python

AI-powered E2E testing for 10 platforms. 253 MCP tools. Zero config. Works with Claude, Cursor, Windsurf, Copilot. Test Flutter, React Native, iOS, Android, Web, Electron, Tauri, KMP, .NET MAUI — all from natural language.

Updated Feb 25, 2026
Dart

kdunee / intentguard

Sponsor

Star

A Python library for verifying code properties using natural language assertions.

testing natural-language test-automation pytest unittest code-quality language-models code-verification llm ai-testing

Updated Mar 1, 2025
Python

TommyLemon / CVAuto

Star

👁 零代码零标注 CV AI 自动化测试工具 🚀 免除大量人工画框和打标签等，直接零代码快速自动化测试 CV 计算机视觉 AI 人工智能图像识别算法：行人检测、动植物分类、人脸识别、OCR 车牌识别、旋转校正、舞蹈姿态、抠图分割等，还可一键下载测试报告、导出训练和测试数据集

Updated Feb 23, 2026
JavaScript

greynewell / matchspec

Sponsor

Star

Eval framework. Define correct, test against it, get results.

Updated Feb 17, 2026
Go

onerun-ai / onerun

Star

Open-source framework for stress-testing LLMs and conversational AI. Identify hallucinations, policy violations, and edge cases with scalable, realistic simulations. Join the discord: https://discord.gg/ssd4S37WNW

security ai simulation chatbot ai-agents ai-testing llm-testing chatbot-simulation

Updated Sep 15, 2025
Python

josharsh / mcp-jest

Star

Automated testing for Model Context Protocol servers. Ship MCP Servers with confidence.

nodejs testing cli automation typescript jest mcp ci-cd test-framework developer-tools ai-testing anthropic model-context-protocol mcp-server

Updated Jan 23, 2026
TypeScript

monkscode / Natural-Language-to-Robot-Framework

Star

Turn plain English into Robot Framework files with AI. No dependencies, no hassle — just validated, ready-to-run tests

python docker open-source natural-language-processing selenium test-automation quality-assurance robotframework automation-framework software-testing fastapi large-language-models generative-ai ai-testing agentic-framework llm-applications nlp-to-code

Updated Feb 16, 2026
Python

alepot55 / agentrial

Star

Statistical evaluation framework for AI agents

python testing ci-cd pytest confidence-intervals quality-assurance non-deterministic ai-agents mlops statistical-testing llm ai-testing llm-evaluation agent-evaluation

Updated Feb 6, 2026
Python

naodeng / awesome-qa-prompt

Star

A professional collection of AI prompts for QA (Quality Assurance) professionals, designed to help test engineers and QA teams work more efficiently throughout the software testing lifecycle.

qa prompts prompt-engineering ai-testing

Updated Feb 26, 2026
TypeScript

greynewell / evaldriven.org

Sponsor

Star

Ship evals before you ship features.

Updated Feb 25, 2026
Nunjucks

KI-Testen / Uebungen

Star

Übungsaufgaben zum Buch "Basiswissen KI-Testen"

artificial-intelligence exercises software-testing german-language hands-on ai-testing

Updated Dec 20, 2024
Jupyter Notebook

sjnims / cc-plugin-eval

Star

4-stage evaluation framework for testing Claude Code plugin component triggering. Validates skills, agents, and commands activate correctly via programmatic detection and LLM judgment.

cli typescript test-automation developer-tools evaluation-framework testing-framework claude llm ai-testing anthropic claude-code claude-agent-sdk plugin-testing

Updated Feb 23, 2026
TypeScript

hemangjoshi37a / claude-code-frontend-dev

Star

🚀 First multimodal AI-powered visual testing plugin for Claude Code. AI that can SEE your UI! 10x faster frontend development with closed-loop testing, browser automation, and Claude 4.5 Sonnet vision.

Updated Jan 25, 2026
Python

Improve this page

Add a description, image, and links to the ai-testing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-testing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ai-testing

Here are 119 public repositories matching this topic...

Giskard-AI / giskard-oss

langwatch / scenario

Pacific-AI-Corp / langtest

Addepto / contextcheck

PramodDutta / qaskills

tianshanghong / GPT4Go

srvsngh99 / genai-testing-journey

ai-dashboad / flutter-skill

kdunee / intentguard

TommyLemon / CVAuto

greynewell / matchspec

onerun-ai / onerun

josharsh / mcp-jest

monkscode / Natural-Language-to-Robot-Framework

alepot55 / agentrial

naodeng / awesome-qa-prompt

greynewell / evaldriven.org

KI-Testen / Uebungen

sjnims / cc-plugin-eval

hemangjoshi37a / claude-code-frontend-dev

Improve this page

Add this topic to your repo