Pinned Repositories
HarmBench
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
tdc2023-starter-kit
This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.
alpaca_eval
This fork is for evaluations in Safetywashing https://arxiv.org/abs/2407.21792
BritishNationalCipherChallenge
unofficial archive of the British National Cipher Challenge
FastChat
This fork is for evaluations in Safetywashing https://arxiv.org/abs/2407.21792
harmbench_website
jailbreakbench.github.io
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models
llm_forecasting
Forecasting with LLMs
newspaper4k-forecasting-ai
📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
openlimit
Efficient rate limiter for the OpenAI API
justinphan3110cais's Repositories
justinphan3110cais/harmbench_website
justinphan3110cais/newspaper4k-forecasting-ai
📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
justinphan3110cais/alpaca_eval
This fork is for evaluations in Safetywashing https://arxiv.org/abs/2407.21792
justinphan3110cais/BritishNationalCipherChallenge
unofficial archive of the British National Cipher Challenge
justinphan3110cais/FastChat
This fork is for evaluations in Safetywashing https://arxiv.org/abs/2407.21792
justinphan3110cais/jailbreakbench.github.io
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models
justinphan3110cais/llm_forecasting
Forecasting with LLMs
justinphan3110cais/openlimit
Efficient rate limiter for the OpenAI API
justinphan3110cais/PurpleLlama
This fork is for evaluations in Safetywashing https://arxiv.org/abs/2407.21792
justinphan3110cais/safetywashing_website
justinphan3110cais/transformers-stream-generator
This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/Transformers.
justinphan3110cais/wmdp