XiangBaoSong's Stars
meta-llama/llama
Inference code for Llama models
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
zhayujie/chatgpt-on-wechat
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
HKUDS/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
gaomingqi/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
showlab/Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
ShoufaChen/DiffusionDet
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
hkchengrex/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
google-deepmind/open_x_embodiment
facebookresearch/home-robot
Mobile manipulation research tools for roboticists
rhymes-ai/Aria
Codebase for Aria - an Open Multimodal Native MoE
robodhruv/visualnav-transformer
Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.
vlmaps/vlmaps
[ICRA2023] Implementation of Visual Language Maps for Robot Navigation
leostrong8/openai-fill-billing
openai 充值指南
yfeng95/SCARF
ir413/mvp
Masked Visual Pre-training for Robotics
bcmi/SLBR-Visible-Watermark-Removal
[ACM MM 2021] Visible Watermark Removal via Self-calibrated Localization and Background Refinement
OrigamiDream/gato
Unofficial Gato: A Generalist Agent
heyuanYao-pku/Control-VAE
benquick123/C-VTON
C-VTON: Context-Driven Image-Based Virtual Try-On Network
siddhanthaldar/BAKU
Code for BAKU: An Efficient Transformer for Multi-Task Policy Learning
ManifoldRG/NEKO
In Progress Implementation of GATO style Generalist Multimodal model capable of image, text, RL and Robotics tasks
YushuoLi/Gato-A-Generalist-Agent
Minimal code for A Generalist Agent
sfd158/SimAndViewCharacter