Scalers AI

Pinned Repositories

distributed-inference-llm
Serve Llama 2 (7B/13B/70B) Large Language Models efficiently at scale by leveraging heterogeneous Dell™ PowerEdge™ Rack servers in a distributed manner.
Language:Python71
generative-ai
This repository contains example notebooks and other helpful resources for generative AI workloads running on Dell infrastructure both on prem or in the public cloud. Resources in this repository are for demo purposes only.
Language:Jupyter Notebook10
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language:C00

Github-Scalers-AI/distributed-inference-llm
Serve Llama 2 (7B/13B/70B) Large Language Models efficiently at scale by leveraging heterogeneous Dell™ PowerEdge™ Rack servers in a distributed manner.
Language:Python71
Github-Scalers-AI/generative-ai
This repository contains example notebooks and other helpful resources for generative AI workloads running on Dell infrastructure both on prem or in the public cloud. Resources in this repository are for demo purposes only.
1
Github-Scalers-AI/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language:C