NVIDIA/trt-llm-as-openai-windows
This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows instead of cloud.
PythonNOASSERTION
Stargazers
- 90barricade93Netherlands
- amirabbbbas
- balazs129
- briancaffeyixlayer
- bryetz
- bullbearbull
- CrisescodeFinv
- cytsai1008@HCHS-CSDC
- DevLuukLuzern, Switzerland
- dpresbit
- DylanSiegelGermany
- GitHub30Osaka, Japan
- huyangqiu
- itzanetatos@core-innovation
- ivlcicDropchop d.o.o.
- JMLogs
- josecohenca
- kedarpotdar-nvNVIDIA
- lapres
- maslychmISUE Lab, UCF
- mpvasilisGreece
- MuhtashamTU Munich
- OriginalSimonDeutschland, Hagen
- pabloiMontevideo, Uruguay
- pavankumargoliHyderabad,India
- PedroBrantesRio de Janeiro
- s7d1Toronto
- sharp-pixelAWS
- Slangnesatlanta
- srinivaspavan9University of Florida
- suresiva
- tkersey@thisisartium
- ucalyptusFrontenac County
- YHmaitiUnited States of America
- zhsf
- zilingzhangBoulder, CO