Pinned Repositories
inspect_evals
Collection of evals for Inspect AI
apps-monitor-control-eval
A repo to evaluate the performance of different monitors and attack policies
Civil_resistance
foolbox
A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX
inspect_ai
Inspect: A framework for large language model evaluations
inspect_evals
Collection of evals for Inspect AI
NN_security
paper_measure_code
Numerical experiments for the stable neural networks paper
thesis_code
Relevant code for my PhD thesis
zhenningdavidliu's Repositories
zhenningdavidliu/apps-monitor-control-eval
A repo to evaluate the performance of different monitors and attack policies
zhenningdavidliu/Civil_resistance
zhenningdavidliu/foolbox
A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX
zhenningdavidliu/inspect_ai
Inspect: A framework for large language model evaluations
zhenningdavidliu/inspect_evals
Collection of evals for Inspect AI
zhenningdavidliu/NN_security
zhenningdavidliu/paper_measure_code
Numerical experiments for the stable neural networks paper
zhenningdavidliu/thesis_code
Relevant code for my PhD thesis