CoF-CoT

This repository provides evaluation datasets, implementation and sample demos for the paper CoF-CoT: Enhancing Large Language Models with Coarse-to-Fine Chain-of-Thought Prompting for Multi-domain NLU Tasks (EMNLP'2023 Main Conference)

Support

We provide support for API Calls from 2 Large Language Models (LLMs), including PaLM and GPT3.5-turbo. Please make sure you have your own valid API keys and update them in openai_k.txt and google_k.txt files correspondingly before running the experiments.

Prerequisites

Refer to documentation of PaLM and GPT3.5-turbo regarding pre-requisites.

Dataset

We conduct evaluations on subsets of MTOP dataset and MASSIVE dataset. Both evaluation datasets are few-shot in-context learning samples are provided. Please refer to our paper for further details.

Running Experiments/ Demonstrations

Please refer to the manuscript regarding the detailed rationale of the experiment design. Individual prompts contain minor updates to account for the generated outputs.

bash run_query.sh ${dataset} ${model_type} ${add_demo}

where passing arguments ${.} are defined as follows:

dataset: Evaluation Dataset (i.e. MASSIVE or MTOP)
model_type: Backbone LLMs (i.e. palm or gpt)
add_demo: Whether to add demonstration few-shot samples (few-shot/ in-context learning) or not (zero-shot learning)

Citation

If you find our ideas, code or dataset helpful, please consider citing our work as follows: