AI-secure/DecodingTrust

A Comprehensive Assessment of Trustworthiness in GPT Models

PythonCC-BY-SA-4.0

Issues

There is loading issue in the leaderboard..
#56 opened 5 months ago by zhimin-z
0
MissingMandatoryValue
#42 opened 9 months ago by richhh520
5
DecodingTrust/src/dt/perspectives/fairness/fairness_evaluation.py
#54 opened 5 months ago by dongjiancheng77
0
When should we set `example_prefix` to be True? And what is the difference between put ICL examples into system prompt versus multi-turn user-assistant chat?
#49 opened 8 months ago by peter-peng-w
1
Analysis request for blog finding that "GPT-4 is more vulnerable than GPT-3.5"
#50 opened 7 months ago by crizCraig
1
Hydra override error when running evaluations after non-editable installation
#12 opened a year ago by ziyic7
14
privacy_evaluation
#28 opened 9 months ago by richhh520
3
Templates for Advglue
#33 opened 9 months ago by dt-ahmed-touila
0
Aggregate results for ethics
#21 opened 9 months ago by jyhong836
3
num_tokens_from_messages() is not implemented
#30 opened 10 months ago by jinz2014
0
conversation template for GPT-neo
#29 opened a year ago by richhh520
0
Assumption of Privacy Assessment on Llama
#27 opened a year ago by richhh520
2
Fairnsss Scoring Keywords & Max Tokens
#18 opened a year ago by danielz02
3
Broken links in GitHub Pages
#26 opened a year ago by ashwhall
1
Division of zero
#23 opened a year ago by jyhong836
0
How to evaluate toxicity task on local hf-llama2-7B？
#19 opened a year ago by AboveParadise
6
OpenAI API key should not be required
#13 opened a year ago by danielz02
1
Remove Duplicate Code in Fairness
#17 opened a year ago by danielz02
1
Paper benchmark results in machine readable format
#2 opened a year ago by ogencoglu
1
black src scripts
#5 opened a year ago by y12uc231
1
[./pre-commit.sh Error] Found 52 errors in 17 files (checked 365 source files)
#6 opened a year ago by y12uc231
1
Difference to HELM benchmark
#1 opened a year ago by ogencoglu
1