ahxgw's Stars
ggerganov/llama.cpp
LLM inference in C/C++
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
karpathy/LLM101n
LLM101n: Let's build a Storyteller
karpathy/llm.c
LLM training in simple, raw C/CUDA
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
hua1995116/awesome-ai-painting
AI绘画资料合集(包含国内外可使用平台、使用教程、参数教程、部署教程、业界新闻等等) Stable diffusion、AnimateDiff、Stable Cascade 、Stable SDXL Turbo
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
LLM-Red-Team/kimi-free-api
🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
codefuse-ai/Awesome-Code-LLM
[TMLR] A curated list of language modeling researches for code and related datasets.
hua1995116/indiehackers-steps
《独立开发者的艺术》打造最全的独立开发者指南,一人公司。
ibm-granite/granite-code-models
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
xhluca/bm25s
Fast lexical search library implementing BM25 in Python using Numpy and Scipy
PKU-YuanGroup/LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
QwenLM/CodeQwen1.5
CodeQwen1.5 is the code version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.
microsoft/MS-MARCO-Web-Search
A large-scale information-rich web dataset, featuring millions of real clicked query-document labels
Strivin0311/long-llms-learning
A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks
TXH-mercury/VAST
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
cohere-ai/magikarp
facebookresearch/ssl-data-curation
PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning
ShayanTalaei/CHESS
Contextual Harnessing for Efficient SQL Synthesis
naver/bergen
Benchmarking library for RAG
AlibabaResearch/HLATR
Implementation of paper: HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking
rayliuca/T-Ragx
Enhancing Translation with RAG-Powered Large Language Models
OhadRubin/EPR
KaiLv69/UDR
ACL'23: Unified Demonstration Retriever for In-Context Learning