Knowledge Injection in the era of LLMs

knowledge injection in the pre-training or fine-tuning process

The approach in this part mainly involves fine-tuning LLMs using domain-specific data, thereby resulting in numerous vertical domain LLMs. The generation of their dataset can be sourced from domain-specific knowledge graphs or online data, and so on. Here, we mainly list some open-source large models in the medical field.

BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining, Briefings in Bioinformatics, 2022
DoctorGLM: Fine-tuning your Chinese Doctor is not a Herculean Task, Arxiv, 2023
ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge, Arxiv, 2023
PMC-LLaMA: Towards Building Open-source Language Models for Medicine, Arxiv, 2023
HuatuoGPT, Towards Taming Language Models To Be a Doctor, Arxiv, 2023

knowledge injection with KB

The approach in this part is mainly to integrate the domain knowledge base into LLMs, usually involving the graph related algorithms and the retrieval way from the knowledge base.

knowlege injection with external knowledge (document corpus or other types of knowledge that is different from KB)

The method in this part mainly integrates externally available domain knowledge into LLMs. Note that this external knowledge is different from the knowledge graph. It is generally presented in the form of natural text (not a graph), so there is no need to use graphs to retrieve or query.

Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework, ACL2023
Unified Demonstration Retriever for In-Context Learning, ACL2023
LLaMAIndex, Github, 2022
LangChain, Github, 2022
ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory, Arxiv, 2023
The CALLA Dataset: Probing LLMs' Interactive Knowledge Acquisition from Chinese Medical Literature, Arxiv, 2023

knowledge injection with model self-driving

This part organizes the methods of model self-driving to obtain domain knowledge. Specifically, this type of method designs prompts to allow the LLM to generate the required domain-related text by itself, and then uses the generated text to perform in-domain tasks.

knowledge probing benchmarks

When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories, ACL2023
Do Large Language Models Know What They Don’t Know?, findings of ACL2023
Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation, ACL2023

lyyang01/awesome-knowledge-injection-in-LLMs