Pinned Repositories
lm-evaluation-harness
A framework for few-shot evaluation of language models.
llama.cpp
LLM inference in C/C++
np_app_GenKBQA_inference
Generative knowledge-base Question and Answering (involves retrieval and generator (LLM))
np_app_hallucination_robust_rag_llm
Rtrieval-augmented generation with large language model robust to hallucination
openai-cookbook
Examples and guides for using the OpenAI API
retrieved_collection_compression_densephrase
Compress retrieved documents collection with densephrase model
torchtune
A Native-PyTorch Library for LLM Fine-tuning
PyCodeGPT
A pre-trained GPT model for Python code completion and generation
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
lifelongeeek's Repositories
lifelongeeek/np_app_GenKBQA_inference
Generative knowledge-base Question and Answering (involves retrieval and generator (LLM))
lifelongeeek/np_app_hallucination_robust_rag_llm
Rtrieval-augmented generation with large language model robust to hallucination
lifelongeeek/openai-cookbook
Examples and guides for using the OpenAI API
lifelongeeek/retrieved_collection_compression_densephrase
Compress retrieved documents collection with densephrase model
lifelongeeek/torchtune
A Native-PyTorch Library for LLM Fine-tuning