Pinned Repositories
Awesome-Pedestrian
Pedestrian Datasets, Papers, Resources
Chess
QT Chinese Chess
PaddleDetection
Object detection and instance segmentation toolkit based on PaddlePaddle.
PaddleDetection_YOLOv5
🚀🚀🚀 YOLOv5 of PaddleDetection, Paddle implementation of YOLOv5
segmentation-paper-list
Collection of online resources about segmentation.
UniDiffuser_Paddle
PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
PaddleMIX
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
PaddleYOLO
🚀🚀🚀 YOLO series of PaddlePaddle implementation, PP-YOLOE+, RT-DETR, YOLOv5, YOLOv6, YOLOv7, YOLOv8, YOLOv10, YOLOX, YOLOv5u, YOLOv7u, YOLOv6Lite, RTMDet and so on. 🚀🚀🚀
nemonameless's Repositories
nemonameless/PaddleDetection
Object detection and instance segmentation toolkit based on PaddlePaddle.
nemonameless/PaddleYOLO
🚀🚀🚀 YOLOSeries of PaddleDetection implementation, PPYOLOE, YOLOX, YOLOv5, YOLOv6, YOLOv7 and so on. 🚀🚀🚀
nemonameless/Lumina-T2X
Lumina-T2X is a model for Text to Any Modality Generation
nemonameless/Aria
Codebase for Aria - an Open Multimodal Native MoE
nemonameless/benchmark
nemonameless/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
nemonameless/Diffusion-RWKV
Scaling RWKV-Like Architectures for Diffusion Models
nemonameless/DiM-DiffusionMamba
The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
nemonameless/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
nemonameless/Inf-DiT
Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
nemonameless/LLaMA-Factory
Easy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA) (LLaMA-2, BLOOM, Falcon, Baichuan)
nemonameless/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
nemonameless/MiniGemini
Official implementation for Mini-Gemini
nemonameless/Mira
nemonameless/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
nemonameless/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
nemonameless/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
nemonameless/Ovis
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
nemonameless/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
nemonameless/PaddleClas
A treasure chest for visual recognition powered by PaddlePaddle
nemonameless/PaddleMIX
nemonameless/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
nemonameless/streaming
A Data Streaming Library for Efficient Neural Network Training
nemonameless/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
nemonameless/TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
nemonameless/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
nemonameless/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
nemonameless/weights_st2dy
nemonameless/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM)
nemonameless/zigma
The official implementation of "ZigMa: A DiT-Style Mamba-based Diffusion Model