/SketchGCD

This repo contains the code for Sketch-Guided Constrained Decoding, published at ACL 2024(main)

Primary LanguagePython

Python 3.9 MIT License arXiv

SketchGCD: Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access


Leverage powerful blackbox LLM to generate high quality sketch and correct the sketch with a small open-source LLM using constrained decoding.

1. The Overview of SketchGCD

2. Environment Setup

With the repository cloned, we recommend creating a new conda virtual environment:

conda create -n SketchGCD python=3.9
conda activate SketchGCD

Install the required packages:

pip install -r requirements.txt

Download Knowledge Graph Data

At the root of the repository, run the following command to download the data files

git lfs install
git clone https://huggingface.co/datasets/Saibo-creator/SketchGCD-KG
mv SketchGCD-KG constraints/KG

Constraints

The constraints are provided in the constraints directory. Two types of constraints are used in the paper:

Dataset

The datasets used in the paper are:

In addition, we also provide the sketches generated by the blackbox LLMs for the experiments in the paper.

Each sketch dataset contains 8 splits, corresponding to the 8 different LLMs' unconstrained decoding outputs.

  • GPT4
  • GPT3.5
  • Claude
  • Claude-Instant
  • LLAMA2-70B
  • LLAMA2-13B
  • LLAMA2-7B
  • LLAMA-33B

N.B. The datasets will be automatically downloaded when running the experiments scripts. You don't need to download them manually.

Repo Structure

The repository is structured as follows:

  • assets:
    • prompters: contains the prompter configurations for the experiments
    • output: contains the outputs of the experiments
  • constraints: contains the constraints used in the paper
  • src: contains the source code for the models and experiments
    • constrained_generation: contains the code for the constrained generation
    • models: contains the code for the models
    • utils: contains the utility functions
    • datamodule: contains the data module for the datasets
    • metrics: contains the metrics for the experiments
  • scripts: contains the scripts for running the experiments
    • exp_unconstrained_decoding: contains the scripts for the unconstrained decoding experiments
    • exp_constrained_decoding: contains the scripts for the constrained decoding experiments
    • exp_sketch_constrained_decoding: contains the scripts for the sketch-guided constrained decoding experiments
    • ablation_study_beam_size: contains the scripts for the ablation study on beam size
    • ablation_study_many_decoder_sizes: contains the scripts for the ablation study on decoder model size
    • ablation_study_no_origin_input: contains the scripts for the ablation study without providing the origin input in the sketch [Not included in the final paper, only for debugging]
    • ablation_study_label_as_sketch: contains the scripts for the ablation study by using the ground truth label as the sketch [Not included in the final paper, only for debugging]
    • analyse_results: contains the scripts for analysing the results by computing confidence intervals
    • prepare_datasets: contains the scripts for preparing the datasets
    • prepare_prompts: contains the scripts for building the prompts for the experiments
  • test: contains some basic unit tests for the code

Experiments

THere are two tasks in the paper: closed information triplet extraction and constituency parsing.

The three datasets used in the paper are:

  • Closed Information Triplet Extraction: wikinre and synthie
  • Constituency Parsing: penn treebank

There are three different settings for the experiments:

  • unconstrained decoding
  • constrained decoding
  • sketch-guided constrained decoding

For all flags and options, please refer to the scripts/run_experiment.py

Unconstrained Decoding

This is the vanilla setting where we few-shot prompt an LLM to generate the output.

To run the unconstrained decoding experiments, use the following scripts:

bash scripts/exp_unconstrained_decoding/run_cp_unconstrained.sh
bash scripts/exp_unconstrained_decoding/run_synthie_unconstrained.sh
bash scripts/exp_unconstrained_decoding/run_wikinre_unconstrained.sh

Constrained Decoding

This is the setting where we use the constrained decoding to guide the decoding process of the LLM.

To run the constrained decoding experiments, use the following scripts:

bash scripts/exp_constrained_decoding/run_cp_constrained.sh
bash scripts/exp_constrained_decoding/run_synthie_constrained.sh
bash scripts/exp_constrained_decoding/run_wikinre_constrained.sh

Sketch-Guided Constrained Decoding

Sketch-Guided Constrained Decoding is the main contribution of the paper. It consists of two steps:

  1. Generate a sketch using a powerful blackbox LLM
  2. Correct the sketch using a small open-source LLM with constrained decoding

In order to generate the sketch, just reuse the same code as in the unconstrained decoding setting. The sketch is then used as the input for the small open-source LLM with constrained decoding.

In order to avoid calling api repeatedly, we provide the generated sketches for the experiments in the paper, which can be downloaded from the links provided above.

To run the sketch-guided constrained decoding experiments, use the following scripts:

bash scripts/exp_sketch_constrained_decoding/run_cp_SGCD.sh
bash scripts/exp_sketch_constrained_decoding/run_synthie_SGCD.sh
bash scripts/exp_sketch_constrained_decoding/run_wikinre_SGCD.sh

Citation

This repository contains the code for the models and experiments in Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access

@misc{geng2024sketchguidedconstraineddecodingboosting,
      title={Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access},
      author={Saibo Geng and Berkay Döner and Chris Wendler and Martin Josifoski and Robert West},
      year={2024},
      eprint={2401.09967},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2401.09967},
}

Please consider citing our work, if you found the provided resources useful.

License

This project is licensed under the terms of the MIT license.