gaoj0017

Nanyang Technological UniversitySingapore

gaoj0017's Stars

milvus-io/milvus
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Language:Go31.7k3k
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++70.3k10.1k
cwida/ALP
ALP: Adaptive Lossless Floating-Point Compression
Language:C++727
gaoj0017/RaBitQ
[SIGMOD 2024] RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search
Language:C++639
vietnh1009/ASCII-generator
ASCII generator (image to text, image to image, video to video)
Language:Python7.6k572
tensorchord/VectorChord
Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.
Language:Rust39517
microsoft/BitBLAS
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
Language:Python48434
VectorDB-NTU/Extended-RaBitQ
Language:C++163
tensorchord/pgvecto.rs
Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.
Language:Rust1.9k73
kemingy/rabitq
rabitq rust implementation
Language:Rust9
CaucherWang/Steiner-hardness
A new query hardness measure for graph-based ANN indexes. Build unbiased workloads with this hardness to see the actual performance of your ANN indexes.
Language:C++125
tspeterkim/flash-attention-minimal
Flash Attention in ~100 lines of CUDA (forward pass only)
Language:Cuda67358
dangkhoasdc/awesome-vector-database
A curated list of awesome works related to high dimensional structure/vector search & database
23611
xuyuzhuang11/OneBit
The homepage of OneBit model quantization framework.
Language:Python1624
ashvardanian/SimSIMD
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
Language:C1.2k67
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python33.2k5.1k
YuriMiller/CompleteSuicideManual-Zh_CN
简体中文版本的《完全自杀手册》，自己完成了一部分翻译和校对、注释工作，不久之后就要与这个世界告别，这一本书算是最后的礼物。
807120
xk-huang/segment-caption-anything
[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gradio demo that show how to use the model.
Language:Python2098
intel/ScalableVectorSearch
Language:C++13220
goodroot/hugo-classic
A simple and text-centric theme for Hugo.io
Language:CSS14970
pytorch/FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Language:C++1.2k521
qwopqwop200/GPTQ-for-LLaMa
4 bits quantization of LLaMA using GPTQ
Language:Python3k460
cmuparlay/ParlayANN
A library of algorithms for approximate nearest neighbor search in high dimensions, along with a set of useful tools for designing such algorithms.
Language:C++12629
Sam1224/SCCAN
Self-Calibrated Cross Attention Network for Few-Shot Segmentation (ICCV'23)
Language:Python374
tpoisonooo/llama.onnx
LLaMa/RWKV onnx models, quantization and testcase
Language:Python35631
LuChengTHU/dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
Language:Python1.6k120
timqian/chinese-independent-blogs
中文独立博客列表
Language:Python21k2.5k
CSStipendRankings/CSStipendRankings
https://csstipendrankings.org
Language:HTML20154
openai/chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
Language:Python21.1k3.7k
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python66.8k8.2k

gaoj0017

gaoj0017's Stars

milvus-io/milvus

ggerganov/llama.cpp

cwida/ALP

gaoj0017/RaBitQ

vietnh1009/ASCII-generator

tensorchord/VectorChord

microsoft/BitBLAS

VectorDB-NTU/Extended-RaBitQ

tensorchord/pgvecto.rs

kemingy/rabitq

CaucherWang/Steiner-hardness

tspeterkim/flash-attention-minimal

dangkhoasdc/awesome-vector-database

xuyuzhuang11/OneBit

ashvardanian/SimSIMD

vllm-project/vllm

YuriMiller/CompleteSuicideManual-Zh_CN

xk-huang/segment-caption-anything

intel/ScalableVectorSearch

goodroot/hugo-classic

pytorch/FBGEMM

qwopqwop200/GPTQ-for-LLaMa

cmuparlay/ParlayANN

Sam1224/SCCAN

tpoisonooo/llama.onnx

LuChengTHU/dpm-solver

timqian/chinese-independent-blogs

CSStipendRankings/CSStipendRankings

openai/chatgpt-retrieval-plugin

binary-husky/gpt_academic