Pinned Repositories
spark
Apache Spark - A unified analytics engine for large-scale data processing
aws-alb-route-directive-adapter-for-istio
aws-virtual-gpu-device-plugin
AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads
autoscaler
Autoscaling components for Kubernetes
bytedance-oss-template
Bytedance OSS Project Template
kubernetes
Production-Grade Container Scheduling and Management
MobileSecurity
Capstone Security Project (concentrate on PrivacyDetection, AntiTheft and AntiVirus)
serverless-research
Serverless Paper Reading and Discussion
kubeflow
Machine Learning Toolkit for Kubernetes
Jeffwan's Repositories
Jeffwan/HexGen
Serving LLMs on heterogeneous decentralized clusters.
Jeffwan/lorax
Serve 100s of Fine-Tuned LLMs in Production for the Cost of 1
Jeffwan/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Jeffwan/kubernetes
Production-Grade Container Scheduling and Management
Jeffwan/ai-comic-factory
Generate comic panels using a LLM + SDXL. Powered by Hugging Face 🤗
Jeffwan/ai-llm-research
Collect LLM Papers
Jeffwan/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Jeffwan/awesome-ai-playground
Jeffwan/babyagi
Jeffwan/containerd
An open and reliable container runtime
Jeffwan/DAIL-SQL
A efficient and effective few-shot NL2SQL method on GPT-4.
Jeffwan/DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Jeffwan/enhancements
Enhancements tracking repo for Kubernetes
Jeffwan/evals
Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
Jeffwan/FastChat
The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
Jeffwan/go-runc
runc bindings for Go
Jeffwan/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).
Jeffwan/kind-with-gpus-examples
Jeffwan/kuberay
Contributed modules to ray
Jeffwan/langchain
⚡ Building applications with LLMs through composability ⚡
Jeffwan/litellm
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
Jeffwan/LLMFlow
Easy, Fast, Secure and Cost-Efficient LLM Pipelines to generate GhatGPT-like private domain models and knowledgeable agents for your organization.
Jeffwan/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Jeffwan/MOSS
MOSS is an artificial intelligence character in the movie "The Wandering Earth." s, enabling it to autonomously analyze, infer, and make decisions.
Jeffwan/OpenAgents
OpenAgents: An Open Platform for Language Agents in the Wild
Jeffwan/runc
CLI tool for spawning and running containers according to the OCI specification
Jeffwan/spegel
Stateless cluster local OCI registry mirror.
Jeffwan/tiktok-opensdk-ios
The TikTok OpenSDK features Login Kit and Share Kit which allow your users to log in using their TikTok account and share content from your app to TikTok.
Jeffwan/veTurboIO
A library developed by Volcano Engine for high-performance reading and writing of PyTorch model files.
Jeffwan/website
Kubernetes website and documentation repo: