jasonngap1

Singapore

Pinned Repositories

WhisperLive
A nearly-live implementation of OpenAI's Whisper.
Language:Python3.6k 42 246491
speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Language:Python4.2k 49 103484
anlp-oct24
This repository contains the code for the ANLP workshop held by Red Dragon AI on Oct 2024.
Language:Jupyter Notebook00
doc-qa-example
Language:Jupyter Notebook1 1 00
llm-agent-example
Language:Jupyter Notebook1 1 00
langgraph
Build resilient language agents as graphs.
Language:Python20.8k 122 1k3.7k
LLMLingua
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Language:Python5.6k 38 150329
TensorRT-LLM
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
Language:C++12.1k 120 3.1k1.9k
server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Language:Python10k 146 4.1k1.7k
tensorrtllm_backend
The Triton TensorRT-LLM Backend
905 20 561133

jasonngap1/doc-qa-example
Language:Jupyter Notebook1 1 00
jasonngap1/llm-agent-example
Language:Jupyter Notebook1 1 00
jasonngap1/anlp-oct24
This repository contains the code for the ANLP workshop held by Red Dragon AI on Oct 2024.
Language:Jupyter Notebook00