Jeffwan

Software Engineer @ Bytedance

BytedanceSeattle, WA

Pinned Repositories

spark
Apache Spark - A unified analytics engine for large-scale data processing
Language:Scala40k 2k 028.3k
aws-alb-route-directive-adapter-for-istio
Language:Go3 48 23
aws-virtual-gpu-device-plugin
AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads
Language:Jupyter Notebook203 53 2031
autoscaler
Autoscaling components for Kubernetes
Language:Go1 3 00
bytedance-oss-template
Bytedance OSS Project Template
Language:Shell2 3 21
kubernetes
Production-Grade Container Scheduling and Management
Language:Go0 3 00
MobileSecurity
Capstone Security Project (concentrate on PrivacyDetection, AntiTheft and AntiVirus)
Language:Java3 5 43
serverless-research
Serverless Paper Reading and Discussion
36 3 01
kubeflow
Machine Learning Toolkit for Kubernetes
Language:TypeScript14.4k 362 3.8k2.4k

Jeffwan's Repositories

Jeffwan/HexGen
Serving LLMs on heterogeneous decentralized clusters.
Language:Python1 1 0
Jeffwan/lorax
Serve 100s of Fine-Tuned LLMs in Production for the Cost of 1
Language:Python1 1 0
Jeffwan/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python1 1 02
Jeffwan/kubernetes
Production-Grade Container Scheduling and Management
Language:Go0 3 00
Jeffwan/ai-comic-factory
Generate comic panels using a LLM + SDXL. Powered by Hugging Face 🤗
Language:TypeScript1 0
Jeffwan/ai-llm-research
Collect LLM Papers
2 0
Jeffwan/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook1 0
Jeffwan/awesome-ai-playground
Language:Makefile2 0
Jeffwan/babyagi
Language:Python1 0
Jeffwan/containerd
An open and reliable container runtime
Language:Go1 0
Jeffwan/DAIL-SQL
A efficient and effective few-shot NL2SQL method on GPT-4.
Language:Python1 0
Jeffwan/DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Language:Python1 0
Jeffwan/enhancements
Enhancements tracking repo for Kubernetes
Language:Go1 0
Jeffwan/evals
Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
Language:Python1 0
Jeffwan/FastChat
The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
Language:Python1 0
Jeffwan/go-runc
runc bindings for Go
Language:Go1 0
Jeffwan/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).
Language:Python1 0
Jeffwan/kind-with-gpus-examples
Language:Go0 0
Jeffwan/kuberay
Contributed modules to ray
Language:Go2 0
Jeffwan/langchain
⚡ Building applications with LLMs through composability ⚡
Language:Python1 0
Jeffwan/litellm
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
Language:Python0 0
Jeffwan/LLMFlow
Easy, Fast, Secure and Cost-Efficient LLM Pipelines to generate GhatGPT-like private domain models and knowledgeable agents for your organization.
1 0
Jeffwan/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Language:Python1 0
Jeffwan/MOSS
MOSS is an artificial intelligence character in the movie "The Wandering Earth." s, enabling it to autonomously analyze, infer, and make decisions.
2 0
Jeffwan/OpenAgents
OpenAgents: An Open Platform for Language Agents in the Wild
Language:Python1 0
Jeffwan/runc
CLI tool for spawning and running containers according to the OCI specification
Language:Go1 0
Jeffwan/spegel
Stateless cluster local OCI registry mirror.
Language:Go1 0
Jeffwan/tiktok-opensdk-ios
The TikTok OpenSDK features Login Kit and Share Kit which allow your users to log in using their TikTok account and share content from your app to TikTok.
Language:Swift1 0
Jeffwan/veTurboIO
A library developed by Volcano Engine for high-performance reading and writing of PyTorch model files.
Language:Python0 0
Jeffwan/website
Kubernetes website and documentation repo:
Language:HTML1 01