nemonameless

BaiduBeijing

Pinned Repositories

Awesome-Pedestrian
Pedestrian Datasets, Papers, Resources
19 3 01
Chess
QT Chinese Chess
Language:C++6 5 00
PaddleDetection
Object detection and instance segmentation toolkit based on PaddlePaddle.
Language:Python3 0 03
PaddleDetection_YOLOv5
🚀🚀🚀 YOLOv5 of PaddleDetection, Paddle implementation of YOLOv5
Language:Python14 2 25
segmentation-paper-list
Collection of online resources about segmentation.
8 3 00
UniDiffuser_Paddle
Language:Python2 2 00
PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Language:Python13.2k 199 5.5k2.9k
PaddleMIX
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
Language:Python567 24 172189
PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
Language:Python12.4k 103 3.7k3k
PaddleYOLO
🚀🚀🚀 YOLO series of PaddlePaddle implementation, PP-YOLOE+, RT-DETR, YOLOv5, YOLOv6, YOLOv7, YOLOv8, YOLOv10, YOLO11, YOLOX, YOLOv5u, YOLOv7u, YOLOv6Lite, RTMDet and so on. 🚀🚀🚀
Language:Python587 15 153141

nemonameless's Repositories

nemonameless/PaddleDetection
Object detection and instance segmentation toolkit based on PaddlePaddle.
Language:Python3 0 03
nemonameless/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
0 0
nemonameless/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Language:Python0 0
nemonameless/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook0 0
nemonameless/co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
Language:Jupyter Notebook0 0
nemonameless/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python0 0
nemonameless/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Language:Python1 0
nemonameless/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Python1 0
nemonameless/Emu
Emu: An Open Multimodal Generalist
Language:Python0 0
nemonameless/EVA
EVA Series: Visual Representation Fantasies from BAAI
Language:Python0 0
nemonameless/FastSAM
Fast Segment Anything
Language:Python2
nemonameless/generative-models
Generative Models by Stability AI
Language:Python0 0
nemonameless/InternVL
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks —— An Open-Source Alternative to ViT-22B
Language:Jupyter Notebook0 0
nemonameless/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python0 0
nemonameless/LLaMA-Efficient-Tuning
Easy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA) (LLaMA-2, BLOOM, Falcon, Baichuan)
Language:Python0 0
nemonameless/llama-recipes
Examples and recipes for Llama 2 model
Language:Python0 0
nemonameless/LLaVA
Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
Language:Python0 0
nemonameless/lynx-llm
paper: https://arxiv.org/abs/2307.02469 page: https://lynx-llm.github.io/
Language:Python0 0
nemonameless/MetaTransformer
Meta-Transformer for Unified Multimodal Learning
Language:Python0 0
nemonameless/ONE-PEACE
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Language:Python0 0
nemonameless/PaddleNLP
👑 Easy-to-use and powerful NLP library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis and 🖼 Diffusion AIGC system etc.
Language:Python1 0
nemonameless/PASSL
PASSL包含 SimCLR，MoCo v1/v2，BYOL，CLIP，PixPro，simsiam, SwAV, BEiT，MAE 等图像自监督算法以及 Vision Transformer，DEiT，Swin Transformer，CvT，T2T-ViT，MLP-Mixer，XCiT，ConvNeXt，PVTv2 等基础视觉算法
Language:Python1 0
nemonameless/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python0 0
nemonameless/RepViT
RepViT: Revisiting Mobile CNN From ViT Perspective
Language:Python0 0
nemonameless/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Python0 0
nemonameless/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0
nemonameless/U-ViT
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
Language:Jupyter Notebook1 0
nemonameless/VisIT-Bench
Language:Python0 0
nemonameless/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Language:Python0 0
nemonameless/VMamba
VMamba: Visual State Space Models
Language:Python0 0