Pinned Repositories
aiverify
AI Verify
aiverify-developer-tools
api-model-server
backend-testing
Redis Testing
frontend-testing
API & Integration Testing of Frontend
LLM-Evals-Catalogue
This repository stems from our paper, “Cataloguing LLM Evaluations”, and serves as a living, collaborative catalogue of LLM evaluation frameworks, benchmarks and papers.
moonshot
Moonshot - A simple and modular tool to evaluate and red-team any LLM application.
moonshot-data
Contains all assets to run with Moonshot Library (Connectors, Datasets and Metrics)
moonshot-ui
Web UI for moonshot
test