turingmotors/swan
This project aims to enable language model inference on FPGAs, supporting AI applications in edge devices and environments with limited resources.
C++Apache-2.0
Issues
- 0
How to support the int8 quantized mode
#3 opened by torukskywalker - 0
options for prompt
#2 opened by torukskywalker