nemonameless

BaiduBeijing

Pinned Repositories

Awesome-Pedestrian
Pedestrian Datasets, Papers, Resources
19 3 01
Chess
QT Chinese Chess
Language:C++6 5 00
PaddleDetection
Object detection and instance segmentation toolkit based on PaddlePaddle.
Language:Python3 1 03
PaddleDetection_YOLOv5
🚀🚀🚀 YOLOv5 of PaddleDetection, Paddle implementation of YOLOv5
Language:Python14 2 25
segmentation-paper-list
Collection of online resources about segmentation.
8 3 00
UniDiffuser_Paddle
Language:Python2 2 00
PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Language:Python13k 199 5.5k2.9k
PaddleMIX
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
Language:Python456 22 162169
PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
Language:Python12.3k 105 3.7k3k
PaddleYOLO
🚀🚀🚀 YOLO series of PaddlePaddle implementation, PP-YOLOE+, RT-DETR, YOLOv5, YOLOv6, YOLOv7, YOLOv8, YOLOv10, YOLOX, YOLOv5u, YOLOv7u, YOLOv6Lite, RTMDet and so on. 🚀🚀🚀
Language:Python574 15 152137

nemonameless's Repositories

nemonameless/PaddleDetection
Object detection and instance segmentation toolkit based on PaddlePaddle.
Language:Python3 1 03
nemonameless/PaddleYOLO
🚀🚀🚀 YOLOSeries of PaddleDetection implementation, PPYOLOE, YOLOX, YOLOv5, YOLOv6, YOLOv7 and so on. 🚀🚀🚀
Language:Python2 1 00
nemonameless/Lumina-T2X
Lumina-T2X is a model for Text to Any Modality Generation
Language:Python1 0 0
nemonameless/Aria
Codebase for Aria - an Open Multimodal Native MoE
Language:Jupyter Notebook0 0
nemonameless/benchmark
Language:Python1 0
nemonameless/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python0 0
nemonameless/Diffusion-RWKV
Scaling RWKV-Like Architectures for Diffusion Models
Language:Python0 0
nemonameless/DiM-DiffusionMamba
The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
Language:Python0 0
nemonameless/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Python0 0
nemonameless/Inf-DiT
Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
Language:Python0 0
nemonameless/LLaMA-Factory
Easy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA) (LLaMA-2, BLOOM, Falcon, Baichuan)
Language:Python
nemonameless/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python0 0
nemonameless/MiniGemini
Official implementation for Mini-Gemini
Language:Python0 0
nemonameless/Mira
Language:Python0 0
nemonameless/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Language:Python0 0
nemonameless/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
nemonameless/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
Language:Python0 0
nemonameless/Ovis
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
Language:Python0 0
nemonameless/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）
Language:C++1 0
nemonameless/PaddleClas
A treasure chest for visual recognition powered by PaddlePaddle
Language:Python1 0
nemonameless/PaddleMIX
Language:Python0 0
nemonameless/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python
nemonameless/streaming
A Data Streaming Library for Efficient Neural Network Training
Language:Python0 0
nemonameless/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Language:Python0 0
nemonameless/TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
Language:Python
nemonameless/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
Language:Python0 0
nemonameless/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
nemonameless/weights_st2dy
Language:Python1 0
nemonameless/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM)
Language:Python0 0
nemonameless/zigma
The official implementation of "ZigMa: A DiT-Style Mamba-based Diffusion Model
Language:Python0 0