leonardtang's Stars
shreyansh26/Red-Teaming-Language-Models-with-Language-Models
A re-implementation of the "Red Teaming Language Models with Language Models" paper by Perez et al., 2022
braintrustdata/autoevals
AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.
fiddler-labs/fiddler-auditor
Fiddler Auditor is a tool to evaluate language models.
chziakas/redeval
A library for red-teaming LLM applications with LLMs.
mnns/LLMFuzzer
🧠 LLMFuzzer - Fuzzing Framework for Large Language Models 🧠 LLMFuzzer is the first open-source fuzzing framework specifically designed for Large Language Models (LLMs), especially for their integrations in applications via LLM APIs. 🚀💥
traceloop/openllmetry
Open-source observability for your LLM application, based on OpenTelemetry
confident-ai/deepeval
The LLM Evaluation Framework
whylabs/langkit
🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring safety & security. 🛡️ Features include text quality, relevance metrics, & sentiment analysis. 📊 A comprehensive tool for LLM observability. 👀
mik0w/pallms
Payloads for Attacking Large Language Models
utkusen/promptmap
automatically tests prompt injection attacks on ChatGPT instances
thestephencasper/explore_establish_exploit_llms
ruixiangcui/AGIEval
centerforaisafety/tdc2023-starter-kit
This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.
qiuhuachuan/smile
[EMNLP 2024] 中文领域心理健康对话大模型MeChat
marcotcr/checklist
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
thestephencasper/gpt4_bs
Examples of prompts that cause ChatGPT-4 to hallucinate.
mitre/advmlthreatmatrix
Adversarial Threat Landscape for AI Systems
mlb2251/stitch
A scalable abstraction learning library
ora-io/awesome-uniswap-hooks
A curated list of awesome Uniswap v4 hooks resources.
bcc-research/CFMMRouter.jl
Convex optimization for fun and profit. (Now in Julia!)
gizatechxyz/orion
ONNX Runtime in Cairo 1.0 for verifiable ML inference using STARK
w3f/Grants-Program
Web3 Foundation Grants Program
Consensys/daedaluzz
Benchmark Generator for Smart-Contract Fuzzers
rfeinman/pyBPL
Python implementation of Bayesian Program Learning tools (with PyTorch)
jquesnelle/yarn
YaRN: Efficient Context Window Extension of Large Language Models
glassroom/heinsen_routing
Reference implementation of "An Algorithm for Routing Vectors in Sequences" (Heinsen, 2022) and "An Algorithm for Routing Capsules in All Domains" (Heinsen, 2019), for composing deep neural networks.
booydar/recurrent-memory-transformer
[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.
wolflo/evm-opcodes
A quick reference for EVM opcodes
wesleyjtann/Safe-SmartContracts
A Sequence Learning Approach to Detecting Vulnerabilities
cleanunicorn/karl
Monitor smart contracts deployed on blockchain and test against vulnerabilities with Mythril. It was presented at DEFCON 2019.