sunyoe
Body in ME, heart to Algorithm. Dream in sweet, mind in sleep.
Shanghai Jiao Tong UniversityShanghai
sunyoe's Stars
wangshuai67/hf-mirror-cli
hf-mirror-cli 使用国内镜像,无需配置开箱即用,快速下载hugingface上的模型
OpenRL-Lab/Wandb_Tutorial
How to use wandb?
comfyanonymous/ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
evilsocket/cake
Distributed LLM inference for mobile, desktop and server.
nerfies/nerfies.github.io
DepthAnything/Depth-Anything-V2
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
TRI-ML/prismatic-vlms
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
baaivision/EVE
EVE: Encoder-Free Vision-Language Models from BAAI
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
bobo0810/LearnDeepSpeed
DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
TinyLLaVA/TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
MILVLG/imp
a family of highly capabale yet efficient large multimodal models
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
google/imageinwords
Data release for the ImageInWords (IIW) paper.
AviSoori1x/seemore
From scratch implementation of a vision language model in pure PyTorch
dyhBUPT/iKUN
[CVPR 2024] iKUN: Speak to Trackers without Retraining
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
domeccleston/sharegpt
Easily share permanent links to ChatGPT conversations with your friends
YangSun22/TC-MoA
Task-Customized Mixture of Adapters for General Image Fusion (CVPR 2024)
zhilizju/Awesome-instruction-tuning
A curated list of awesome instruction tuning datasets, models, papers and repositories.
FudanNLPLAB/MouSi
Leymore/ruozhiba
OpenGVLab/LAMM
[NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agents
tjubiit/TJU-DHD
A newly built high-resolution dataset for object detection and pedestrian detection (IEEE TIP 2020)
dataabc/weiboSpider
新浪微博爬虫,用python爬取新浪微博数据
cv-cat/Spider_XHS
小红书爬虫,小红书笔记、主页、搜索爬取
Evil0ctal/Douyin_TikTok_Download_API
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.