Pinned Repositories
GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
HippoRAG
[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personalized PageRank.
LLM-Planner
[ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models
LLM4Chem
Official code repo for the paper "LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset"
MagicBrush
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
Mind2Web
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"
SeeAct
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
TableLlama
[NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".
TravelPlanner
[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"
UGround
Official Repo for UGround
OSU Natural Language Processing's Repositories
OSU-NLP-Group/HippoRAG
[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personalized PageRank.
OSU-NLP-Group/Mind2Web
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"
OSU-NLP-Group/SeeAct
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
OSU-NLP-Group/MagicBrush
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
OSU-NLP-Group/TravelPlanner
[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"
OSU-NLP-Group/GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
OSU-NLP-Group/LLM-Planner
[ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models
OSU-NLP-Group/TableLlama
[NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".
OSU-NLP-Group/UGround
Official Repo for UGround
OSU-NLP-Group/LLM4Chem
Official code repo for the paper "LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset"
OSU-NLP-Group/LLM-Knowledge-Conflict
[ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"
OSU-NLP-Group/llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
OSU-NLP-Group/AmpleGCG
AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM
OSU-NLP-Group/QA4RE
[ACL'23 Findings] "Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors"
OSU-NLP-Group/Middleware
Repository for paper Tools Are Instrumental for Language Agents in Complex Environments
OSU-NLP-Group/Auto-SQL-Correction
Code, data, and model of paper "Text-to-SQL Error Correction with Language Models of Code" (ACL'23)
OSU-NLP-Group/ScienceAgentBench
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
OSU-NLP-Group/Deductive-Beam-Search
[COLM'24] "Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning"
OSU-NLP-Group/AgentAttack
OSU-NLP-Group/In-Context-Reranking
Code for "Attention in Large Language Models Yeilds Efficient Zero-Shot Re-Rankers"
OSU-NLP-Group/AttributionBench
OSU-NLP-Group/EIA_against_webagent
OSU-NLP-Group/MQA
Multimodal Question Answering for Unified Information Extraction
OSU-NLP-Group/COSMO
[CIKM'24] Reviving the Context: Camera Trap Species Classification as Link Prediction on Multimodal Knowledge Graphs
OSU-NLP-Group/Eval-LLM-Trust
OSU-NLP-Group/GroundCocoa
OSU-NLP-Group/FL4SemanticParsing
OSU-NLP-Group/Text2SQL-Error-Detection
Code for paper "Error Detection for Text-to-SQL Semantic Parsing"
OSU-NLP-Group/UMLS-Vocabulary-Insertion
OSU-NLP-Group/LLM-CN-Eval
[NAACL'24] A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models