gaoj0017's Stars
milvus-io/milvus
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
ggerganov/llama.cpp
LLM inference in C/C++
cwida/ALP
ALP: Adaptive Lossless Floating-Point Compression
gaoj0017/RaBitQ
[SIGMOD 2024] RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search
vietnh1009/ASCII-generator
ASCII generator (image to text, image to image, video to video)
tensorchord/VectorChord
Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.
microsoft/BitBLAS
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
VectorDB-NTU/Extended-RaBitQ
tensorchord/pgvecto.rs
Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.
kemingy/rabitq
rabitq rust implementation
CaucherWang/Steiner-hardness
A new query hardness measure for graph-based ANN indexes. Build unbiased workloads with this hardness to see the actual performance of your ANN indexes.
tspeterkim/flash-attention-minimal
Flash Attention in ~100 lines of CUDA (forward pass only)
dangkhoasdc/awesome-vector-database
A curated list of awesome works related to high dimensional structure/vector search & database
xuyuzhuang11/OneBit
The homepage of OneBit model quantization framework.
ashvardanian/SimSIMD
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
YuriMiller/CompleteSuicideManual-Zh_CN
简体中文版本的《完全自杀手册》,自己完成了一部分翻译和校对、注释工作,不久之后就要与这个世界告别,这一本书算是最后的礼物。
xk-huang/segment-caption-anything
[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gradio demo that show how to use the model.
intel/ScalableVectorSearch
goodroot/hugo-classic
A simple and text-centric theme for Hugo.io
pytorch/FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
qwopqwop200/GPTQ-for-LLaMa
4 bits quantization of LLaMA using GPTQ
cmuparlay/ParlayANN
A library of algorithms for approximate nearest neighbor search in high dimensions, along with a set of useful tools for designing such algorithms.
Sam1224/SCCAN
Self-Calibrated Cross Attention Network for Few-Shot Segmentation (ICCV'23)
tpoisonooo/llama.onnx
LLaMa/RWKV onnx models, quantization and testcase
LuChengTHU/dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
timqian/chinese-independent-blogs
中文独立博客列表
CSStipendRankings/CSStipendRankings
https://csstipendrankings.org
openai/chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。