yawen-d

Concordia AI; MPhil in Machine Learning at Cambridge; Former Visiting Research Student at @HumanCompatibleAI, UC Berkeley.

yawen-d's Stars

normster/llm_rules
RuLES: a benchmark for evaluating rule-following in language models
Language:Python21015
aiverify-foundation/moonshot-data
Contains all assets to run with Moonshot Library (Connectors, Datasets and Metrics)
Language:Python1716
aiverify-foundation/moonshot-ui
Web UI for moonshot
Language:TypeScript73
IS2Lab/S-Eval
S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models
323
kevinyaobytedance/llm_eval
LLM evaluation.
Language:Python132
EleutherAI/cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
Language:Python67934
GraySwanAI/nanoGCG
A fast + lightweight implementation of the GCG algorithm in PyTorch
Language:Python8723
RZFan525/Awesome-ScalingLaws
A curated list of awesome resources dedicated to Scaling Laws for LLMs
613
princeton-nlp/tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Language:Python4.7k436
karpathy/LLM101n
LLM101n: Let's build a Storyteller
29.2k1.6k
aiverify-foundation/moonshot
Moonshot - A simple and modular tool to evaluate and red-team any LLM application.
Language:Python16032
lakeraai/pint-benchmark
A benchmark for prompt injection detection systems.
Language:Jupyter Notebook8310
MetaGLM/zhipuai-sdk-python-v4
Language:Python15719
prompt-security/ps-fuzz
Make your GenAI Apps Safe & Secure :rocket: Test & harden your system prompt
Language:Python36646
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python6.6k1.7k
ydyjya/Awesome-LLM-Safety
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.
88149
haizelabs/redteaming-resistance-benchmark
Language:Python332
METR/task-standard
METR Task Standard
Language:TypeScript11428
adityatelange/hugo-PaperMod
A fast, clean, responsive Hugo theme.
Language:HTML9.7k2.6k
UKGovernmentBEIS/inspect_ai
Inspect: A framework for large language model evaluations
Language:Python56999
OpenSafetyLab/SALAD-BENCH
【ACL 2024】 SALAD benchmark & MD-Judge
Language:Python8911
ChiangE/Sophon
The implementation of Sophon
Language:Python82
princeton-nlp/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.
Language:Python13.4k1.3k
AI-secure/DecodingTrust
A Comprehensive Assessment of Trustworthiness in GPT Models
Language:Python25354
centerforaisafety/HarmBench
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Language:Jupyter Notebook29049
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python21.8k2.1k
alexandrasouly/strongreject
Repository for "StrongREJECT for Empty Jailbreaks" paper
Language:Jupyter Notebook1015
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python3.8k406
BeenKim/BeenKim.github.io
website
Language:HTML72
Xianjun-Yang/Awesome_papers_on_LLMs_detection
The lastest paper about detection of LLM-generated text and code
20012

yawen-d

yawen-d's Stars

normster/llm_rules

aiverify-foundation/moonshot-data

aiverify-foundation/moonshot-ui

IS2Lab/S-Eval

kevinyaobytedance/llm_eval

EleutherAI/cookbook

GraySwanAI/nanoGCG

RZFan525/Awesome-ScalingLaws

princeton-nlp/tree-of-thought-llm

karpathy/LLM101n

aiverify-foundation/moonshot

lakeraai/pint-benchmark

MetaGLM/zhipuai-sdk-python-v4

prompt-security/ps-fuzz

EleutherAI/lm-evaluation-harness

ydyjya/Awesome-LLM-Safety

haizelabs/redteaming-resistance-benchmark

METR/task-standard

adityatelange/hugo-PaperMod

UKGovernmentBEIS/inspect_ai

OpenSafetyLab/SALAD-BENCH

ChiangE/Sophon

princeton-nlp/SWE-agent

AI-secure/DecodingTrust

centerforaisafety/HarmBench

hpcaitech/Open-Sora

alexandrasouly/strongreject

open-compass/opencompass

BeenKim/BeenKim.github.io

Xianjun-Yang/Awesome_papers_on_LLMs_detection