Pinned Repositories
distributed-inference-llm
Serve Llama 2 (7B/13B/70B) Large Language Models efficiently at scale by leveraging heterogeneous Dell™ PowerEdge™ Rack servers in a distributed manner.
generative-ai
This repository contains example notebooks and other helpful resources for generative AI workloads running on Dell infrastructure both on prem or in the public cloud. Resources in this repository are for demo purposes only.
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Scalers AI's Repositories
Github-Scalers-AI/distributed-inference-llm
Serve Llama 2 (7B/13B/70B) Large Language Models efficiently at scale by leveraging heterogeneous Dell™ PowerEdge™ Rack servers in a distributed manner.
Github-Scalers-AI/generative-ai
This repository contains example notebooks and other helpful resources for generative AI workloads running on Dell infrastructure both on prem or in the public cloud. Resources in this repository are for demo purposes only.
Github-Scalers-AI/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs