Pinned Repositories
AutoAWQ-llava-fix
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
ctc_decoder
A ctc decoder for both online and offline asr model
fast-autocomplete
Fast Autocomplete: When Elastcsearch suggestions are not fast and flexible enough
go-ethereum
Official Go implementation of the Ethereum protocol
llama.cpp
Port of Facebook's LLaMA model in C/C++
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
pt-search-deploy
QOL
sentence-transformers
Sentence Embeddings with BERT & XLNet
symengine
SymEngine is a fast symbolic manipulation library, written in C++
shifan3's Repositories
shifan3 doesn’t have any repository yet.