yangqqq-yq

yangqqq-yq's Stars

feifeibear/long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Language:Python34924
jquesnelle/yarn
YaRN: Efficient Context Window Extension of Large Language Models
Language:Python1.3k116
hkust-nlp/deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
Language:Python49427
microsoft/LongRoPE
LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.
Language:Python10010
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
18.7k1.5k
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python4k428
THUDM/LongBench
[ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Language:Python65653
owenliang/qwen-vllm
通义千问VLLM推理部署DEMO
Language:Python43665
AmruthPillai/Reactive-Resume
A one-of-a-kind resume builder that keeps your privacy in mind. Completely secure, customizable, portable, open-source and free forever. Try it out today!
Language:TypeScript23.5k2.5k
qiuyinco/qiuyin.co
蚯蚓机场最新网址，蚯蚓加速器最新网址，v2ray机场，免费节点每日更新
101
ZhuiyiTechnology/roformer
Rotary Transformer
Language:Python81150
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
Language:Python4.1k366
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python135k26.9k
ashishpatel26/LLM-Finetuning
LLM Finetuning with peft
Language:Jupyter Notebook2.1k596
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python36.9k4.5k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python33.8k4.2k
Hello-SimpleAI/chatgpt-comparison-detection
Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥
Language:Python1.3k120
beyondguo/LLM-Tuning
Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.
Language:HTML96799
zsy-code/LLM-study
Language:Python2
opendatalab/PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Language:Python5.3k361
hushuguo/awesome-time-series-papers
This repository offers a collection of recent time series research papers, including forecasting, anomaly detection and so on , with links to code and resources.
152
mli/paper-reading
深度学习经典、新论文逐段精读
27k2.4k
eosphoros-ai/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
Language:Python13.7k1.8k
explodinggradients/ragas
Supercharge Your LLM Application Evaluations 🚀
Language:Python7.1k725
chatchat-space/Langchain-Chatchat
Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
Language:TypeScript31.9k5.6k
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
Language:Python18.8k1.8k
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
3.6k149
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10k1.3k
chroma-core/chroma
the AI-native open-source embedding database
Language:Rust15.3k1.3k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python29.7k4.5k