glzbcrt

I am a solution architect at @microsoft helping partners develop intelligent and resilient solutions.

georgeluiz.comJoinville - SC - Brazil

glzbcrt's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python73.8k 602 08.8k
golang-standards/project-layout
Standard Go Project Layout
Language:Makefile50.2k 590 1105.2k
ManimCommunity/manim
A community-maintained Python framework for creating mathematical animations.
Language:Python28.7k 174 1.6k2k
duckdb/duckdb
DuckDB is an analytical in-process SQL database management system
Language:C++25.5k 205 5.4k2k
locustio/locust
Write scalable load tests in plain Python 🚗💨
Language:Python25.3k 425 1.7k3k
encode/httpx
A next generation HTTP client for Python. 🦋
Language:Python13.5k 114 882858
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
Language:Jupyter Notebook10.1k 85 250827
NirDiamant/RAG_Techniques
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
Language:Jupyter Notebook10k 123 181k
huggingface/text-generation-inference
Large Language Model Text Generation Inference
Language:Python9.6k 105 1.4k1.1k
huggingface/tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Language:Rust9.2k 122 1k821
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++9.1k 97 2.1k1k
GoogleCloudPlatform/generative-ai
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
Language:Jupyter Notebook8.8k 179 2342.4k
tpn/pdfs
Technically-oriented PDF Collection (Papers, Specs, Decks, Manuals, etc)
Language:HTML7.9k 429 101.5k
docker/docker-py
A Python library for the Docker Engine API
Language:Python6.9k 194 1.7k1.7k
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
Language:Python5.8k 43 1.6k481
meta-llama/llama-models
Utilities intended for use with Llama models.
Language:Python5.5k 76 151912
ray-project/llm-numbers
Numbers every LLM developer should know
4.1k 59 17141
microsoft/ProcMon-for-Linux
A Linux version of the Procmon Sysinternals tool
Language:C4.1k 84 82267
turboderp/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
Language:Python3.8k 35 489290
huggingface/text-embeddings-inference
A blazing fast inference solution for text embeddings models
Language:Rust3k 38 275198
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Language:Python2.3k 33 254149
chiphuyen/aie-book
[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)
1.6k 23 3183
AIDC-AI/Marco-o1
An Open Large Reasoning Model for Real-World Solutions
Language:Python1.3k 17 2268
IST-DASLab/marlin
FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
Language:Python674 16 3052
fabioz/PyDev.Debugger
Sources for the debugger used in PyDev, PyCharm and VSCode Python
Language:Python441 11 189121
microsoft/SemanticKernelCookBook
This is a Semantic Kernel's book for beginners
Language:Jupyter Notebook235 9 144
Zefan-Cai/Awesome-LLM-KV-Cache
Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.
163 4 17
blobfile/blobfile
Read Google Cloud Storage, Azure Blobs, and local paths with the same interface
Language:Python62 2 17429
microsoft/devex-unlocked
Content used in the DevEx Unlocked in LATAM Region
1
oaviles/hello_appconfiguration
Reference Implementation about Azure App Configuration. Build and Deploy Web App with feature flags support on Azure Kubernetes Services based on DevSecOps Practices
Language:C#1 2 00

glzbcrt

glzbcrt's Stars

openai/whisper

golang-standards/project-layout

ManimCommunity/manim

duckdb/duckdb

locustio/locust

encode/httpx

artidoro/qlora

NirDiamant/RAG_Techniques

huggingface/text-generation-inference

huggingface/tokenizers

NVIDIA/TensorRT-LLM

GoogleCloudPlatform/generative-ai

tpn/pdfs

docker/docker-py

xorbitsai/inference

meta-llama/llama-models

ray-project/llm-numbers

microsoft/ProcMon-for-Linux

turboderp/exllamav2

huggingface/text-embeddings-inference

predibase/lorax

chiphuyen/aie-book

AIDC-AI/Marco-o1

IST-DASLab/marlin

fabioz/PyDev.Debugger

microsoft/SemanticKernelCookBook

Zefan-Cai/Awesome-LLM-KV-Cache

blobfile/blobfile

microsoft/devex-unlocked

oaviles/hello_appconfiguration