Pinned Repositories
broad_twitter_corpus
The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016)
autoredteam
autoredteam: code for training models that automatically red team other language models
dagw_page
The Danish Gigaword project
emerging_entities_17
Dataset for the Emerging & Novel Entity NER task (WNUT '17)
entity_recognition
framework for doing NER and other types of entity recognition, in Python
generalised-brown
C++ implementation of Generalised Brown clustering and python scripts for feature generation (AAAI 2016)
hatespeechdata
Catalog of abusive language data (PLoS 2020)
lm_risk_cards
Risks and targets for assessing LLMs & LLM vulnerabilities
twokenize
Python standalone tokenizer
garak
the LLM vulnerability scanner
leondz's Repositories
leondz/hatespeechdata
Catalog of abusive language data (PLoS 2020)
leondz/lm_risk_cards
Risks and targets for assessing LLMs & LLM vulnerabilities
leondz/autoredteam
autoredteam: code for training models that automatically red team other language models
leondz/llmsec-site
leondz/aclsigsec-web
leondz/PyRIT
The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems.
leondz/garacc
the LLM vulnerability scanner
leondz/garak-test
quality tests for llmsec failure mode detectors
leondz/litellm
Python SDK, Proxy Server to call 100+ LLM APIs using the OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
leondz/rtd-tutorial-template
Template for the Read the Docs tutorial
leondz/summonademon
leondz/TrustGPT
Can We Trust Large Language Models?: A Benchmark for Responsible Large Language Models via Toxicity, Bias, and Value-alignment Evaluation
leondz/vexillomesse
leondz/www-project-top-10-for-large-language-model-applications
OWASP Foundation Web Respository
leondz/acl-anthology
Data and software for building the ACL Anthology.
leondz/aclrollingreview
ACL Rolling Review website
leondz/advisory-database
Advisory database for Python packages published on pypi.org
leondz/arjun-krishna1
My Github Profile
leondz/Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
leondz/CyberAgressionAdo-v1
Dataset of Teen Cyberbullying scenari in French
leondz/dagw-site
leondz/leondz
leondz/llmsecurity
leondz/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
leondz/mole-stance
MoLE: Cross-Domain Label-Adaptive Stance Detection
leondz/nanoChatGPT
A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick
leondz/nejlt-kickstart
leondz/Prompt-Engineering-Guide
:octopus: Guide and resources for prompt engineering
leondz/Snowballed_Hallucination
leondz/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.