LLM simple serving (tensor model parallel, pubsub, grpc)
Primary LanguagePythonMIT LicenseMIT
No one’s watching this repository yet.