sagorbrur's Stars
hiyouga/LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
AgentOps-AI/agentops
Python SDK for agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI, Langchain, and Autogen
mistralai/mistral-finetune
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
lmmlzn/Awesome-LLMs-Datasets
Summarize existing representative LLMs text datasets.
BatsResearch/bonito
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
skypilot-org/skypilot
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
huggingface/text-generation-inference
Large Language Model Text Generation Inference
alopatenko/LLMEvaluation
A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use cases, promote the adoption of best practices in LLM assessment, and critically assess the effectiveness of these evaluation methods.
VishnuPJ/MalayaLLM
A Continually LoRA PreTrained and FineTuned 7B Llama-2 Indic model for Malayalam Language.
TimDettmers/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
meta-llama/llama3
The official Meta Llama 3 GitHub site
stanford-crfm/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
ekzhu/datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
anthropics/anthropic-tokenizer-typescript
Open-Speech-EkStep/indic-punct
AI4Bharat/IndicLLMSuite
A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages
VirusProton/Awesome_Bangla_Datasets
Awesome Bangla Datasets
hjian42/CommunityLM
[COLING 2022]: CommunityLM: Probing Partisan Worldviews from Language Models
allenai/fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
ultralytics/ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
microsoft/promptbench
A unified evaluation framework for large language models
stanford-oval/WikiChat
WikiChat stops the hallucination of large language models by retrieving data from Wikipedia.
Zhen-Tan-dmml/LLM4Annotation
LargeWorldModel/LWM
lhao499/ringattention
Transformers with Arbitrarily Large Context
sazzadcsedu/Bangla-Vulgar-Lexicon
A list of Bengali vulgar words
qcri/LLMeBench
Benchmarking Large Language Models