Pinned Repositories
rag_requirements
TensorRT-LLM-jais
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
vllm-jais
A high-throughput and memory-efficient inference and serving engine for LLMs
JaisChat4Pets
Fully-featured, beautiful web interface for vLLM - built with NextJS.
MLOpsengineerAssignment
Assignment for Sr. MLOps Engineer candidates to assess skills in Kubernetes, Docker, CI/CD pipelines, and cloud services.
news_crawlers
nlp_assignment_1
nlp_assignment_2
nlp_assignment_3
SrMLOpsAssignment
Assignment for Sr. MLOps Engineer candidates to assess skills in Kubernetes, Docker, CI/CD pipelines, and cloud services.
grandiose-pizza's Repositories
grandiose-pizza/news_crawlers
grandiose-pizza/JaisChat4Pets
Fully-featured, beautiful web interface for vLLM - built with NextJS.
grandiose-pizza/SrMLOpsAssignment
Assignment for Sr. MLOps Engineer candidates to assess skills in Kubernetes, Docker, CI/CD pipelines, and cloud services.
grandiose-pizza/MLOpsengineerAssignment
Assignment for Sr. MLOps Engineer candidates to assess skills in Kubernetes, Docker, CI/CD pipelines, and cloud services.
grandiose-pizza/rag_requirements
grandiose-pizza/TensorRT-LLM-jais
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
grandiose-pizza/vllm-jais
A high-throughput and memory-efficient inference and serving engine for LLMs
grandiose-pizza/nlp_assignment_3
grandiose-pizza/nlp_assignment_2
grandiose-pizza/nlp_assignment_1