cassidylaidlaw

CS PhD student at UC Berkeley

Pinned Repositories

boltzmann-policy-distribution
Code and pretrained models for the ICLR 2022 paper "The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models"
Language:Python8 2 11
cleverhans
An adversarial example library for constructing attacks, building defenses, and benchmarking both
Language:Python20
cs285-homework
Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2020)
Language:Jupyter Notebook32 1 017
effective-horizon
Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"
Language:Python42 3 36
fast-pytorch-adversarial-training
Language:Python3 2 00
hidden-context
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
Language:Python25 1 05
orpo
Language:Python3 1 00
perceptual-advex
Code and data for the ICLR 2021 paper "Perceptual Adversarial Robustness: Defense Against Unseen Threat Models".
Language:Python54 3 1410
python-boilerplate
Language:Python4 2 11
ReColorAdv
ReColorAdv and other attacks from the NeurIPS 2019 paper "Functional Adversarial Attacks"
Language:Python36 3 27

cassidylaidlaw's Repositories

cassidylaidlaw/perceptual-advex
Code and data for the ICLR 2021 paper "Perceptual Adversarial Robustness: Defense Against Unseen Threat Models".
Language:Python54 3 1410
cassidylaidlaw/effective-horizon
Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"
Language:Python42 3 36
cassidylaidlaw/ReColorAdv
ReColorAdv and other attacks from the NeurIPS 2019 paper "Functional Adversarial Attacks"
Language:Python36 3 27
cassidylaidlaw/cs285-homework
Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2020)
Language:Jupyter Notebook32 1 017
cassidylaidlaw/hidden-context
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
Language:Python25 1 05
cassidylaidlaw/boltzmann-policy-distribution
Code and pretrained models for the ICLR 2022 paper "The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models"
Language:Python8 2 11
cassidylaidlaw/python-boilerplate
Language:Python4 2 11
cassidylaidlaw/fast-pytorch-adversarial-training
Language:Python3 2 00
cassidylaidlaw/orpo
Language:Python3 1 00
cassidylaidlaw/cleverhans
An adversarial example library for constructing attacks, building defenses, and benchmarking both
Language:Python20
cassidylaidlaw/playing-it-safe
Code for the paper "Playing it Safe: Adversarial Robustness with an Abstain Option"
Language:Python2 2 00
cassidylaidlaw/malmo
Project Malmo is a platform for Artificial Intelligence experimentation and research built on top of Minecraft. We aim to inspire a new generation of research into challenging new problems presented by this unique environment. --- For installation instructions, scroll down to *Getting Started* below, or visit the project page for more information:
Language:Java1 1 00
cassidylaidlaw/advex-uar
Code for "Testing Robustness Against Unforeseen Adversaries"
Language:Python0 1 00
cassidylaidlaw/auto-attack
Code relative to "Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks"
Language:Python1 0
cassidylaidlaw/brainwave-sculpture
Language:Python3 0
cassidylaidlaw/django-puppeteer-pdf
Django Wrapper to the Chrome puppeteer to pdf
Language:Python1 0
cassidylaidlaw/django-push-notifications
Send push notifications to mobile devices through GCM or APNS in Django.
Language:Python2 0
cassidylaidlaw/eslint-plugin-sentence-case
ESLint plugin to enforce sentence case in string literals
Language:JavaScript2 13
cassidylaidlaw/Fixup
A Re-implementation of Fixed-update Initialization
Language:Python2 0
cassidylaidlaw/human_aware_rl
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
1
cassidylaidlaw/iframe-sync
Seamlessly embed dynamic web applications within a static website
Language:JavaScript2 0
cassidylaidlaw/kitsu
🦊 A simple, lightweight & framework agnostic JSON:API client
Language:JavaScript2 0
cassidylaidlaw/llm_optimization
A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.
Language:Python
cassidylaidlaw/microsoftgraph-python
Microsoft Graph API wrapper written in Python
Language:Python1 0
cassidylaidlaw/overcooked-demo
Web application where humans can play Overcooked with AI agents.
cassidylaidlaw/overcooked_ai
A benchmark environment for fully cooperative human-AI performance.
Language:Python
cassidylaidlaw/PyAPNs2
Python library for interacting with the Apple Push Notification service (APNs) via HTTP/2 protocol
Language:Python1 0
cassidylaidlaw/ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Language:Python1 0
cassidylaidlaw/resume
My resume, rendered by React
Language:JavaScript2 01
cassidylaidlaw/universe
Universe: a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.