skywalker023/fantom
👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"
PythonMIT
Issues
- 7
- 1
- 1
Did you try few-shot prompting GPT-4?
#2 opened by lukasberglund - 1
Dataset/code release?
#1 opened by Jiayi-Pan