Pinned Repositories
AdaTAD
The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
Aicity2023-Track3
Aicity2023-Track3
AIcity2024-track3
libtorch_RefineDet_2020
libtorch_RefineDet
llama
Inference code for LLaMA models
Pytorch-Retinaface-Mask-Detection
Pytorch版本的Retainface, 用于人脸和口罩检测
Social-Distancing-using-YOLOv5
tensorflow_models_nets
tensorflow GoogleNet inception V1 V2 V3 V4
TensorRT-YOLOv4
tensorrt5, yolov4, yolov3,yolov3-tniy,yolov3-tniy-prn
tensorrtx
Implementation of popular deep learning networks with TensorRT network definition APIs
wolfworld6's Repositories
wolfworld6/AIcity2024-track3
wolfworld6/AdaTAD
The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
wolfworld6/llama
Inference code for LLaMA models
wolfworld6/AL-Ref-SAM2
AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
wolfworld6/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
wolfworld6/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大语言模型 (Chinese LLaMA-2 & Alpaca-2 LLMs)
wolfworld6/DetGPT
wolfworld6/dot
wolfworld6/ego4d_asl
code for Ego4D Workshop@CVPR 2023 - 1st in MQ & 2nd in NLQ challenge
wolfworld6/InternVideo
InternVideo: General Video Foundation Models via Generative and Discriminative Learning (https://arxiv.org/abs/2212.03191)
wolfworld6/keras-llm-robot
A web UI Project In order to learn the large language model. This project includes features such as chat, quantization, fine-tuning, prompt engineering templates, and multimodality.
wolfworld6/LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
wolfworld6/llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
wolfworld6/MIC
MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU
wolfworld6/MovieChat
🔥 chat with over 10K frames of video!
wolfworld6/Multi-LLM-Agent
wolfworld6/NExT-Chat
The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".
wolfworld6/Note-YOLO
Here are notes on YOLO and other related topics of object detection and instance segmentation.
wolfworld6/ONE-PEACE
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
wolfworld6/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
wolfworld6/RFAConv
RAFConv: Innovating Spatital Attention and Standard Convolutional Operation
wolfworld6/Skywork
wolfworld6/snag_release
Official Implementation of SnAG (CVPR 2024)
wolfworld6/SoM
Set-of-Mark Prompting for LMMs
wolfworld6/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
wolfworld6/Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
wolfworld6/VisRAG
Parsing-free RAG supported by VLMs
wolfworld6/Vista
A Generalizable World Model for Autonomous Driving
wolfworld6/VQLoC
Open-set visual object query search & localization in long-form videos
wolfworld6/yolo_research
based on yolo-high-level project (detect\pose\classify\segment\):include yolov5\yolov7\yolov8\ core ,improvement research ,SwintransformV2 and Attention Series. training skills, business customization, engineering deployment C