haoyiq114's Stars
SALT-NLP/CultureBank
facebookresearch/ResponsibleNLP
Repository for research in the field of Responsible NLP at Meta.
haoyiq114/VALOR
Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models (ACL-Findings 2024)
AGI-Edgerunners/LLM-Agents-Papers
A repo lists papers related to LLM based agent
TheShadow29/VidSitu
[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
allenai/open-instruct
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
RManLuo/Awesome-LLM-KG
Awesome papers about unifying LLMs and KGs
shmsw25/FActScore
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"
showlab/Awesome-MLLM-Hallucination
📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).
yukunZhao/Self-DETECTION
jxzhangjhu/Awesome-LLM-Uncertainty-Reliability-Robustness
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
yuchenlin/rebiber
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
bjascob/amrlib
A python library that makes AMR parsing, generation and visualization simple.
khuangaf/Awesome-Chart-Understanding
A curated list of recent and past chart understanding work based on our survey paper: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models.
salesforce/DiverseSumm
Code and data for the NAACL 2024 paper "Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles"
glgh/awesome-llm-human-preference-datasets
A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.
khuangaf/CHOCOLATE
Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"
khuangaf/CONCRETE
Official implementation of "CONCRETE: Improving Cross-lingual Fact Checking with Cross-lingual Retrieval" (COLING'22)
khuangaf/CryptocurrencyPrediction
Predict Cryptocurrency Price with Deep Learning
khuangaf/FakingFakeNews
Official implementation of the ACL 2023 paper: "Faking Fake News for Real Fake News Detection: Propaganda-Loaded Training Data Generation"
khuangaf/ZeroFEC
Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"
PlusLabNLP/clipscore-bias
Data for our EMNLP-2023 paper: "Gender Biases in Automatic Evaluation Metrics for Image Captioning"
meta-llama/llama
Inference code for Llama models
wxjiao/Is-ChatGPT-A-Good-Translator
A preliminary evaluation of ChatGPT/GPT-4 for machine translation.
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence