Test your prompts, models, RAGs. Evaluate and compare LLM outputs, catch regressions, and improve prompt quality.
Primary LanguageTypeScriptMIT LicenseMIT