Pinned Repositories
mergekit
Tools for merging pretrained large language models.
character-tokenizer
A character tokenizer for Hugging Face Transformers
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
misdelivery's Repositories
misdelivery/character-tokenizer
A character tokenizer for Hugging Face Transformers
misdelivery/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
misdelivery/AI-Scientist
Expand the AI Scientist's capabilities to independently design and execute experiments from your own idea.
misdelivery/tanuki-8x8b-replicate
misdelivery/test
misdelivery/yukkuri_voice_api