emigmo

Tsinghua UniversityBeijing

emigmo's Stars

Yxxxb/VoCo-LLaMA
VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".
Language:Python734
CleanDiffuserTeam/CleanDiffuser
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
Language:Jupyter Notebook30726
ethz-spylab/agentdojo
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
Language:Jupyter Notebook486
liugangcode/InfoAlign
The code for "Learning Molecular Representation in a Cell"
Language:Python9
alanaai/EVUD
Egocentric Video Understanding Dataset (EVUD)
Language:Python192
ml-research/LlavaGuard
Language:Python20
YangLing0818/buffer-of-thought-llm
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models
Language:Python48551
RL4VLM/RL4VLM
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Language:Jupyter Notebook17818
togethercomputer/MoA
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
Language:Python2.5k353
yfzhang114/SliME
✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models
Language:Python1297
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Language:Python73750
OpenGVLab/De-focus-Attention-Networks
Learning 1D Causal Visual Representation with De-focus Attention Networks
Language:Python28
NL2Code/CodeR
15117
truefoundry/cognita
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
Language:Python3.2k255
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python17.8k1.8k
SALT-NLP/demonstrated-feedback
Language:Python10614
kaistAI/Janus
[ACL 2024 NLP4ConvAI Oral] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages
Language:Python341
beccabai/Data-centric_multimodal_LLM
Survey on Data-centric Large Language Models
58
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python30.9k3.4k
shenao-zhang/SELM
The official implementation of Self-Exploring Language Models (SELM)
Language:Python556
sahsaeedi/triple-preference-optimization
Language:Python17
princeton-nlp/SimPO
SimPO: Simple Preference Optimization with a Reference-Free Reward
Language:Python65039
YueFan1014/VideoAgent
This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)
Language:Python1065
architsharma97/dpo-rlaif
Language:Jupyter Notebook878
X-LANCE/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
Language:Python50541
RLHF-V/RLAIF-V
RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness
Language:Python2006
EDiRobotics/GR1-Training
A generalized policy for robotics manipulation
Language:Python703
EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
Language:Python1.4k107
YifeiZhou02/ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
Language:Python8711
KindXiaoming/pykan
Kolmogorov Arnold Networks
Language:Jupyter Notebook14.6k1.3k

emigmo

emigmo's Stars

Yxxxb/VoCo-LLaMA

CleanDiffuserTeam/CleanDiffuser

ethz-spylab/agentdojo

liugangcode/InfoAlign

alanaai/EVUD

ml-research/LlavaGuard

YangLing0818/buffer-of-thought-llm

RL4VLM/RL4VLM

togethercomputer/MoA

yfzhang114/SliME

DAMO-NLP-SG/VideoLLaMA2

OpenGVLab/De-focus-Attention-Networks

NL2Code/CodeR

truefoundry/cognita

infiniflow/ragflow

SALT-NLP/demonstrated-feedback

kaistAI/Janus

beccabai/Data-centric_multimodal_LLM

2noise/ChatTTS

shenao-zhang/SELM

sahsaeedi/triple-preference-optimization

princeton-nlp/SimPO

YueFan1014/VideoAgent

architsharma97/dpo-rlaif

X-LANCE/SLAM-LLM

RLHF-V/RLAIF-V

EDiRobotics/GR1-Training

EvolvingLMMs-Lab/lmms-eval

YifeiZhou02/ArCHer

KindXiaoming/pykan