zTaoplus

Hangzhou

Pinned Repositories

argoexec
argoexec:
Language:Dockerfile0 1 00
candle
Minimalist ML framework for Rust
Language:Rust00
ci-pipeline
ci-pipeline
Language:Python00
codebox-api
CodeBox is the simplest cloud infrastructure for your LLM Apps and Services.
Language:Python00
codeinterpreter-api
Open source implementation of the ChatGPT Code Interpreter 👾
Language:Python0 0 00
container-images
Common container images
Language:Dockerfile0 0 00
langfuse
🪢 Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Language:TypeScript10
QiZhenMedicalExpert
Language:Python1 0 00
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python2 0 00
zTaoplus
I know you know what I mean..
2 2 00

zTaoplus's Repositories

zTaoplus/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python2 0 00
zTaoplus/zTaoplus
I know you know what I mean..
2 2 00
zTaoplus/langfuse
🪢 Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Language:TypeScript10
zTaoplus/candle
Minimalist ML framework for Rust
Language:Rust00
zTaoplus/ci-pipeline
ci-pipeline
Language:Python00
zTaoplus/codebox-api
CodeBox is the simplest cloud infrastructure for your LLM Apps and Services.
Language:Python00
zTaoplus/codeinterpreter-api
Open source implementation of the ChatGPT Code Interpreter 👾
Language:Python0 0 00
zTaoplus/container-images
Common container images
Language:Dockerfile0 0 00
zTaoplus/enterprise_gateway
A lightweight, multi-tenant, scalable and secure gateway that enables Jupyter Notebooks to share resources across distributed clusters such as Apache Spark, Kubernetes and others.
Language:Python0 0 00
zTaoplus/guidance
A guidance language for controlling large language models.
Language:Jupyter Notebook0 0 00
zTaoplus/ESFT
Expert Specialized Fine-Tuning
zTaoplus/fastmoe
A fast MoE impl for PyTorch
zTaoplus/image-mirror
mirror unreachable images
Language:Dockerfile0 0
zTaoplus/inference-framework-benchmark
Benchmark for various inference frameworks
Language:Jupyter Notebook
zTaoplus/jupyter-images
Kubeflow Jupyter images
Language:Dockerfile0 0
zTaoplus/langchain
🦜🔗 Build context-aware reasoning applications
zTaoplus/litellm
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
Language:Python0 0
zTaoplus/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python0 0
zTaoplus/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Dockerfile1 0
zTaoplus/Megatron-LM
Ongoing research training transformer models at scale
zTaoplus/mindsdb
MindsDB connects AI models to real time data
Language:Python0 0
zTaoplus/mirrored-image
Language:Python1 0
zTaoplus/Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
Language:Python0 1
zTaoplus/pybox
Language:Python
zTaoplus/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
Language:Jupyter Notebook0 0
zTaoplus/sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
Language:Python0 0
zTaoplus/tablegpt-agent
A pre-built agent for TableGPT2.
Language:Python
zTaoplus/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++0 0
zTaoplus/tensorrtllm_backend
The Triton TensorRT-LLM Backend
Language:Python0 0
zTaoplus/text-generation-inference
Large Language Model Text Generation Inference
Language:Python