Pinned Repositories
Awesome-Pedestrian
Pedestrian Datasets, Papers, Resources
Chess
QT Chinese Chess
PaddleDetection
Object detection and instance segmentation toolkit based on PaddlePaddle.
PaddleDetection_YOLOv5
🚀🚀🚀 YOLOv5 of PaddleDetection, Paddle implementation of YOLOv5
segmentation-paper-list
Collection of online resources about segmentation.
UniDiffuser_Paddle
PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
PaddleMIX
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
PaddleYOLO
🚀🚀🚀 YOLO series of PaddlePaddle implementation, PP-YOLOE+, RT-DETR, YOLOv5, YOLOv6, YOLOv7, YOLOv8, YOLOv10, YOLO11, YOLOX, YOLOv5u, YOLOv7u, YOLOv6Lite, RTMDet and so on. 🚀🚀🚀
nemonameless's Repositories
nemonameless/PaddleDetection
Object detection and instance segmentation toolkit based on PaddlePaddle.
nemonameless/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
nemonameless/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
nemonameless/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
nemonameless/co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
nemonameless/ColossalAI
Making large AI models cheaper, faster and more accessible
nemonameless/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
nemonameless/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
nemonameless/Emu
Emu: An Open Multimodal Generalist
nemonameless/EVA
EVA Series: Visual Representation Fantasies from BAAI
nemonameless/FastSAM
Fast Segment Anything
nemonameless/generative-models
Generative Models by Stability AI
nemonameless/InternVL
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks —— An Open-Source Alternative to ViT-22B
nemonameless/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
nemonameless/LLaMA-Efficient-Tuning
Easy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA) (LLaMA-2, BLOOM, Falcon, Baichuan)
nemonameless/llama-recipes
Examples and recipes for Llama 2 model
nemonameless/LLaVA
Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
nemonameless/lynx-llm
paper: https://arxiv.org/abs/2307.02469 page: https://lynx-llm.github.io/
nemonameless/MetaTransformer
Meta-Transformer for Unified Multimodal Learning
nemonameless/ONE-PEACE
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
nemonameless/PaddleNLP
👑 Easy-to-use and powerful NLP library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis and 🖼 Diffusion AIGC system etc.
nemonameless/PASSL
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PVTv2 等基础视觉算法
nemonameless/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
nemonameless/RepViT
RepViT: Revisiting Mobile CNN From ViT Perspective
nemonameless/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
nemonameless/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
nemonameless/U-ViT
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
nemonameless/VisIT-Bench
nemonameless/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
nemonameless/VMamba
VMamba: Visual State Space Models