Pinned Repositories
llama_index
LlamaIndex (formerly GPT Index) is a data framework for your LLM applications
llms
Olive
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
onnxruntime-genai
Generative AI extensions for onnxruntime
trt-llm-as-openai-windows
This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows instead of cloud.
Olive
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
onnxruntime-genai
Generative AI extensions for onnxruntime
ChatRTX
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
anujj's Repositories
anujj/llama_index
LlamaIndex (formerly GPT Index) is a data framework for your LLM applications
anujj/llms
anujj/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
anujj/onnxruntime-genai
Generative AI extensions for onnxruntime
anujj/trt-llm-as-openai-windows
This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows instead of cloud.