Pinned Repositories
2nd-place-solution-in-Scene-Understanding-for-Autonomous-Drone-Delivery
2nd place solution in Scene Understanding for Autonomous Drone Delivery
ADer
ADer is an open source visual anomaly detection toolbox based on PyTorch, which supports multiple popular AD datasets and approaches.
approachingalmost
Approaching (Almost) Any Machine Learning Problem
AutoKG
Code and dataset for the paper "LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities".
awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
awesome-graph-self-supervised-learning
Awesome Graph Self-Supervised Learning
Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
QAnything
Question and Answer based on Anything.
quduoduo.github.io
wazuh
Wazuh - The Open Source Security Platform. Unified XDR and SIEM protection for endpoints and cloud workloads.
quduoduo's Repositories
quduoduo/QAnything
Question and Answer based on Anything.
quduoduo/ADer
ADer is an open source visual anomaly detection toolbox based on PyTorch, which supports multiple popular AD datasets and approaches.
quduoduo/quduoduo.github.io
quduoduo/Chat-UniVi
[CVPR 2024🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
quduoduo/DataDreamer
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
quduoduo/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
quduoduo/face_recognition
The world's simplest facial recognition api for Python and the command line
quduoduo/GPT4V-Image-Captioner
quduoduo/HuggingFists
A low-code data flow tool that allows for convenient use of LLM and HuggingFace models, with some features considered as a low-code version of Langchain.
quduoduo/IG-VLM
quduoduo/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源模型
quduoduo/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
quduoduo/MiniCPM-V
MiniCPM-V 2.0: An Efficient End-side MLLM with Strong OCR and Understanding Capabilities
quduoduo/MiniGPT4Qwen
Personal Project: MPP-Qwen14B(Multimodal Pipeline Parallel-Qwen14B). Don't let the poverty limit your imagination! Train your own 14B LLaVA-like MLLM on RTX3090/4090 24GB.
quduoduo/mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
quduoduo/MM-TSFlib
quduoduo/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
quduoduo/prismatic-vlms
*****A flexible and efficient codebase for training visually-conditioned language models (VLMs)
quduoduo/RWKV-Infer
A large-scale RWKV v6 inference wrapper using the Cuda backend. Easy to deploy on docker. Supports multi-batch generation and dynamic State switching. Let's spread RWKV, which combines RNN technology with impressively low inference costs!
quduoduo/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
quduoduo/ST-EVCDP
A real-world dataset for EV-related research, e.g., spatiotemporal prediction and urban energy management.
quduoduo/STGormer
Here is the repository containing our code implementation of Spatio-Temporal Graph Transformer (STGormer).
quduoduo/Time-Series-Library
A Library for Advanced Deep Time Series Models.
quduoduo/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
quduoduo/Valley
The official repository of "Video assistant towards large language model makes everything easy"
quduoduo/VideoRecap
quduoduo/vision_transformer
quduoduo/wiseflow
Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and uploads them to the database.
quduoduo/xTP-LLM
quduoduo/Youku-mPLUG
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks