braintrustdata/autoevals

AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.

PythonMIT

Issues

Basic ValidJSON Question
#105 opened 5 days ago
5
Supported scores
#101 opened 2 months ago
1
Azure Openai is not well configured
#100 opened 2 months ago
1
Is there a way to access LLM token usage
#96 opened 3 months ago
3
Support of Azure Open AI models and API
#89 opened 4 months ago
4
(`autoevals` JS): Better support for evaluating based on pre-generated answer
#84 opened 5 months ago
2
(`autoevals` JS) Better support and documentation for using context-based evaluators in `Eval` run
#82 opened 5 months ago
3
JS `AnswerRelevancy` bug with model configuration
#81 opened 5 months ago
2
Context Relevancy issue with score not between 0 and 1.
#80 opened 5 months ago
0
Question about deps
#61 opened 9 months ago
1
General Question about the Evaluator LLM
#51 opened a year ago
3
(docs) add examples on how to use autoevals.llm
#30 opened a year ago
3
[Feat] Can you read openai api key from the .env
#29 opened a year ago
2
Factuality Evaluator failing
#28 opened a year ago
7
Add a `make test` github action
#4 opened 9 months ago
0