Pinned Repositories
djl
An Engine-Agnostic Deep Learning Framework in Java
model_analyzer
Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Server models.
langflow
⛓️ Langflow is a visual framework for building multi-agent and RAG applications. It's open-source, Python-powered, fully customizable, model and vector store agnostic.
LLMLingua
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
OpenDevin
🐚 OpenDevin: Code Less, Make More
captum
Model interpretability and understanding for PyTorch
llama-hub
A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
josephykwang's Repositories
josephykwang/model_analyzer
Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Server models.