sagorbrur's Stars
sazzadcsedu/Bangla-Vulgar-Lexicon
A list of Bengali vulgar words
qcri/LLMeBench
Benchmarking Large Language Models
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
TeamTopologies/Team-API-template
A template for defining a Team API - as explained in the Team Topologies book
microsoft/autogen
A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap
tesseract-ocr/langdata
Source training data for Tesseract for lots of languages
explodinggradients/ragas
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
mistralai/mistral-inference
Official inference library for Mistral models
webrecorder/warcio
Streaming WARC/ARC library for fast web archive IO
tiangolo/uvicorn-gunicorn-fastapi-docker
Docker image with Uvicorn managed by Gunicorn for high-performance FastAPI web applications in Python with performance auto-tuning.
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
saiful9379/Bangla_TTS
Bangla TTS Inference pipeline using Vit TTS
google/maxtext
A simple, performant and scalable Jax LLM!
huggingface/trl
Train transformer language models with reinforcement learning.
OpenAccess-AI-Collective/axolotl
Go ahead and axolotl questions
arc53/DocsGPT
GPT-powered chat for documentation, chat with your documents
patrickvonplaten/notebooks
Some notebooks for NLP
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
mallorbc/Finetune_LLMs
Repo for fine-tuning Casual LLMs
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
meta-llama/codellama
Inference code for CodeLlama models
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
facebookresearch/SONAR
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
Alpha-VLLM/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
jsvine/pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
BengaliAI/BADLAD
BDLAD: Bengali Document Layout Analysis Dataset