strawsyz's Stars
SoccerNet/sn-caption
Repository containing all necessary codes to get started on the SoccerNet Dense Video Captioning challenge.
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
rakutentech/Document-understanding
openai/transformer-debugger
strawsyz/KnowledgeSelection
Applying LLM on visual domain efficiently. Based on VisualGLM
THUDM/GLM
GLM (General Language Model)
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
gpt-engineer-org/gpt-engineer
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
facebookresearch/ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
OpenBMB/BMTools
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
fjchange/awesome-video-anomaly-detection
Papers for Video Anomaly Detection, released codes collection, Performance Comparision.
keirp/automatic_prompt_engineer
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
PaddlePaddle/PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
yufree/sciguide
现代科研指北
lucidrains/RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
GoogleContainerTools/distroless
🥑 Language focused docker images, minus the operating system.
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
open-mmlab/mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
gallantlab/pycortex
Pycortex is a python-based toolkit for surface visualization of fMRI data
jinwchoi/awesome-action-recognition
A curated list of action recognition and related area resources