opengear-project/GEAR
GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM
PythonMIT
Issues
- 0
No such file or directory: 'ldconfig'
#21 opened by yaldashbz - 2
Clarification on Code Structure and Usage
#20 opened by DKmiyan - 1
Integration with FlashAttention
#19 opened by ThisisBillhe - 0
- 3
- 4
Qustion about storage
#7 opened by mlxht990720 - 3
Question about LowRank
#11 opened by shhn1 - 1
Questions about zero-shot
#14 opened by YcChou - 1
- 2
How to reproduce GEAR on Mistral models
#12 opened by CUHKSZzxy - 3
Questions about the code structure
#10 opened by CUHKSZzxy - 1
- 2
How to eval GEAR with lm-eval framework?
#6 opened by ThisisBillhe - 2
Question about the shell commands details on how to reproduce the main results of COT and zeroshot performance
#9 opened by zoominguniverse - 2
Can't reproduce the benchmarks
#8 opened by cyLi-Tiger - 2
questions about GenerationTest folder
#4 opened by hzfengfengxia - 3
- 1
- 2
questions about rapids folder
#3 opened by hzfengfengxia