Pinned Repositories
awesome-cbir-papers
📝Awesome and classical image retrieval papers
contentvec
speech self-supervised representations
generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
linkerd2
Ultralight, security-first service mesh for Kubernetes. Main repo for Linkerd 2.x.
mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
PyTorch-BigGraph
Generate embeddings from large-scale graph-structured data.
QuickVC-VoiceConversion
QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
so-vits-svc
SoftVC VITS Singing Voice Conversion
voice-changer
リアルタイムボイスチェンジャー Realtime Voice Changer
232136813's Repositories
232136813/awesome-cbir-papers
📝Awesome and classical image retrieval papers
232136813/generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
232136813/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
232136813/PyTorch-BigGraph
Generate embeddings from large-scale graph-structured data.
232136813/StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
232136813/awesome-vector-search
Collections of vector search related libraries, service and research papers
232136813/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
232136813/camel
🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Scale Language Model Society
232136813/cu
package cu provides an idiomatic interface to the CUDA Driver API.
232136813/DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
232136813/elastiknn
Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.
232136813/embedding_model_test
基于开源embedding模型的中文向量效果测试
232136813/faiss
A library for efficient similarity search and clustering of dense vectors.
232136813/gans-awesome-applications
Curated list of awesome GAN applications and demo
232136813/genworlds
The pod that creates, ensembles, and deploys agents on demand.
232136813/GPTeam
GPTeam: An open-source multi-agent simulation
232136813/guidance
A guidance language for controlling large language models.
232136813/hnswlib
Header-only C++/python library for fast approximate nearest neighbors
232136813/langchaingo
LangChain for Go
232136813/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
232136813/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
232136813/llama
Inference code for LLaMA models
232136813/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
232136813/mindocr
A toolbox of OCR models, algorithms, and pipelines based on MindSpore
232136813/ollama
Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.
232136813/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
232136813/SynthText
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
232136813/TextRecognitionDataGenerator
A synthetic data generator for text recognition
232136813/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
232136813/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs