strawsyz

strawsyz's Stars

SoccerNet/sn-caption
Repository containing all necessary codes to get started on the SoccerNet Dense Video Captioning challenge.
Language:Python262
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
Language:Python2k129
rakutentech/Document-understanding
Language:Python61
openai/transformer-debugger
Language:Python4k231
strawsyz/KnowledgeSelection
Applying LLM on visual domain efficiently. Based on VisualGLM
Language:Python1
THUDM/GLM
GLM (General Language Model)
Language:Python3.2k321
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python7.6k444
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Language:Python13.3k1.5k
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Language:Python40.4k5.2k
BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
Language:Python9.4k687
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python36.4k4.5k
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook9.7k944
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python2.7k243
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
Language:Python35.6k3.4k
gpt-engineer-org/gpt-engineer
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
Language:Python52k6.8k
facebookresearch/ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
Language:Python2.8k353
OpenBMB/BMTools
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
Language:Python2.9k269
fjchange/awesome-video-anomaly-detection
Papers for Video Anomaly Detection, released codes collection, Performance Comparision.
581101
keirp/automatic_prompt_engineer
Language:Python1.1k144
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python38.6k4.3k
PaddlePaddle/PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Language:Python12.6k2.9k
yufree/sciguide
现代科研指北
Language:R19938
lucidrains/RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Language:Python850106
GoogleContainerTools/distroless
🥑 Language focused docker images, minus the operating system.
Language:Starlark18.7k1.1k
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
17.7k2.6k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.5k2.5k
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook24.7k3.2k
open-mmlab/mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Language:Python4.2k1.2k
gallantlab/pycortex
Pycortex is a python-based toolkit for surface visualization of fMRI data
Language:JavaScript581137
jinwchoi/awesome-action-recognition
A curated list of action recognition and related area resources
3.8k724