shisantaibao

student of sichuan university

sichuan university

shisantaibao's Stars

InternLM/InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Language:Python2.5k150
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Language:Python18.2k1.9k
eseckel/ai-for-grant-writing
A curated list of resources for using LLMs to develop more competitive grant applications.
Language:Python1.2k156
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
Language:HTML9.2k893
andimarafioti/florence2-finetuning
Quick exploration into fine tuning florence 2
Language:Jupyter Notebook25022
mbzuai-oryx/GeoChat
[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing
Language:Python40729
Raguggg/quillbot-premium-for-free
Quillbot Unlock: It is used users paraphrase an unlimited number of words, with access to seven different writing modes and four synonyms options. Its summarizer feature has a word limit of 6000, and can process up to 15 sentences at once. Additionally, users have the option to freeze unlimited words and phrases.
19331
AntonioTepsich/Convolutional-KANs
This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing the classic linear transformation of the convolution to learnable non linear activations in each pixel.
Language:Jupyter Notebook70870
Ablustrund/LoRAMoE
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
Language:Python19113
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Language:Python3.2k279
Vision-CAIR/MiniGPT4-video
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
Language:Python53558
FreedomIntelligence/ALLaVA
Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model
Language:Python2378
v2rayA/v2rayA
A web GUI client of Project V which supports VMess, VLESS, SS, SSR, Trojan, Tuic and Juicity protocols. 🚀
Language:Go11k1.2k
Yuliang-Liu/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Language:Python1.8k122
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
Language:Python2.2k162
Coobiw/MPP-LLaVA
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.
Language:Jupyter Notebook35520
nwpu-zxr/VadCLIP
VadCLIP official Pytorch implementation
Language:Python989
rese1f/MovieChat
[CVPR 2024] 🎬💭 chat with over 10K frames of video!
Language:Python49039
PKU-YuanGroup/Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Language:Python2.8k205
lzw-lzw/GroundingGPT
[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model
Language:Python28314
AppFlowy-IO/AppFlowy
Bring projects, wikis, and teams together with AI. AppFlowy is an AI collaborative workspace where you achieve more without losing control of your data. The best open source alternative to Notion.
Language:Dart55.5k3.6k
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
Language:Python1.9k121
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.3k2.1k
vikhyat/moondream
tiny vision language model
Language:Jupyter Notebook4.9k436
LLaVA-VL/LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
Language:Python69252
fbcotter/pytorch_wavelets
Pytorch implementation of 2D Discrete Wavelet (DWT) and Dual Tree Complex Wavelet Transforms (DTCWT) and a DTCWT based ScatterNet
Language:Python935144
alexandrosstergiou/adaPool
[T-IP 2023] Code for exponential adaptive pooling for PyTorch
Language:Cuda799
dwromero/ckconv
Code repository of the paper "CKConv: Continuous Kernel Convolution For Sequential Data" published at ICLR 2022. https://arxiv.org/abs/2102.02611
Language:Python11715
JiuTian-VL/JiuTian-LION
[CVPR 2024] LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge
Language:Jupyter Notebook1143
BAAI-DCAI/Bunny
A family of lightweight multimodal models.
Language:Python87766

shisantaibao

shisantaibao's Stars

InternLM/InternLM-XComposer

kaixindelele/ChatPaper

eseckel/ai-for-grant-writing

liguodongiot/llm-action

andimarafioti/florence2-finetuning

mbzuai-oryx/GeoChat

Raguggg/quillbot-premium-for-free

AntonioTepsich/Convolutional-KANs

Ablustrund/LoRAMoE

dvlab-research/MGM

Vision-CAIR/MiniGPT4-video

FreedomIntelligence/ALLaVA

v2rayA/v2rayA

Yuliang-Liu/Monkey

baaivision/EVA

Coobiw/MPP-LLaVA

nwpu-zxr/VadCLIP

rese1f/MovieChat

PKU-YuanGroup/Video-LLaVA

lzw-lzw/GroundingGPT

AppFlowy-IO/AppFlowy

PKU-YuanGroup/MoE-LLaVA

haotian-liu/LLaVA

vikhyat/moondream

LLaVA-VL/LLaVA-Plus-Codebase

fbcotter/pytorch_wavelets

alexandrosstergiou/adaPool

dwromero/ckconv

JiuTian-VL/JiuTian-LION

BAAI-DCAI/Bunny