This project was conducted as part of the Introduction to Deep Learning course at Carnegie Mellon University (CMU).
The goal of this project is to investigate social bias in self-improvement methods using three different models and five distinct techniques. The dataset used for experiments is BBQ data.
- LLaMA2
- LLaMA3
- Gemini
- Zero-shot reasoning
- CoT (Chain-of-Thought) reasoning
- Self-consistency
- Self-consistency without CoT
- Self-refinement
- RCI
-
Install the required dependencies:
pip install -r requirements.txt
-
make personal.py and write the google api key for Gemini as below.
GEMINI_KEY=""
-
Run the main script to observe results for each method and model:
bash run.sh
This script will execute experiments and display the results for all methods across the three models.
-
To perform specific or customized experiments, modify the
run.sh
script as needed. -
For Self-refinement, use the dedicated script:
bash ./self-refine/run.sh