AdaCheng's Stars
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
SimplifyJobs/Summer2025-Internships
Collection of Summer 2025 tech internships!
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
e2b-dev/awesome-ai-agents
A list of AI autonomous agents
FMInference/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
HugoBlox/theme-academic-cv
🎓 无需编写任何代码即可轻松创建漂亮的学术网站 Easily create a beautiful academic résumé or educational website using Hugo and GitHub. No code.
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
OFA-Sys/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
CStanKonrad/long_llama
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.
tdurieux/anonymous_github
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
hyp1231/awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
allenai/ai2thor
An open-source platform for Visual AI.
microsoft/SoM
Set-of-Mark Prompting for LMMs
OptimalScale/DetGPT
askforalfred/alfred
ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks
alfworld/alfworld
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
huangwl18/language-planner
Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
alibaba/conv-llava
THUNLP-MT/StableToolBench
A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.
2toinf/DecisionNCE
[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"
AdaCheng/EgoThink
[CVPR'24 Highlight] The official code and data for paper "EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Language Models"
2toinf/IVM
The offical Implementation of "Instruction-Guided Visual Masking"
AdaCheng/Awesome-Embodied-AI
Paper List of Embodied AI with Foundation Models
AdaCheng/Awesome-Research-Agent
Paper and project list about research agent, including review, paper reading, and so on.