peiji1981's Stars
xai-org/grok-1
Grok open release
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
mistralai/mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
rspeer/python-ftfy
Fixes mojibake and other glitches in Unicode text, after the fact.
defog-ai/sqlcoder
SoTA LLM for converting natural language questions to SQL queries
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
aixcoder-plugin/aiXcoder-7B
official repository of aiXcoder-7B Code Large Language Model
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
nexB/scancode-toolkit
:mag: ScanCode detects licenses, copyrights, dependencies by "scanning code" ... to discover and inventory open source and third-party packages used in your code. Sponsored by NLnet project https://nlnet.nl/project/vulnerabilitydatabase, the Google Summer of Code, Azure credits, nexB and others generous sponsors!
eosphoros-ai/Awesome-Text2SQL
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
microsoft/DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
dora-rs/dora
DORA (Dataflow-Oriented Robotic Architecture) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.
OpenGenerativeAI/llm-colosseum
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
BAAI-DCAI/Bunny
A family of lightweight multimodal models.
huggingface/lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
EleutherAI/cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
HIT-SCIR/Chinese-Mixtral-8x7B
中文Mixtral-8x7B(Chinese-Mixtral-8x7B)
huggingface/text-clustering
Easily embed, cluster and semantically label text datasets
huggingface/cosmopedia
microsoft/rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
microsoft/PyCodeGPT
A pre-trained GPT model for Python code completion and generation
shuyanzhou/docprompting
Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023
mediawiki-client-tools/mediawiki-dump-generator
Python 3 tools for downloading and preserving wikis
rjzhb/slimpajama-cpp
C++ version implementation of slimpajama