currylym

关注自然语言处理、时间序列预测和多模态模型。从事过CTR预估、搜索和知识图谱项目。对推荐系统，信息检索、LLM应用感兴趣。

BUAA北京

currylym's Stars

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python37.9k 220 5.7k4.7k
microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Language:Python37.2k 422 2.2k5.4k
OpenBMB/ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Language:Python26.2k 315 2663.3k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python21.1k 158 1.6k2.3k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python15.3k 112 1.1k1.2k
KindXiaoming/pykan
Kolmogorov Arnold Networks
Language:Jupyter Notebook15.3k 112 4211.4k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python15k 123 1.2k1.4k
Mikoto10032/DeepLearning
深度学习入门教程, 优秀文章, Deep Learning Tutorial
Language:Jupyter Notebook14.9k 305 143.6k
AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Language:Jupyter Notebook14.6k 264 1052k
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python8.2k 51 1.1k599
leptonai/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
Language:TypeScript7.9k 55 671k
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Language:Jupyter Notebook7.3k 78 224464
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
7.1k 138 14421
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Language:Python6.7k 66 84375
microsoft/DeepSpeedExamples
Example models using DeepSpeed
Language:Python6.2k 75 5461.1k
yangjianxin1/Firefly
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Language:Python6k 54 281536
microsoft/LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
Language:Python3.8k 55 141285
ztxz16/fastllm
纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行
Language:C++3.4k 44 368348
DSXiangLi/DecryptPrompt
总结Prompt&LLM论文，开源数据&模型，AIGC应用
2.8k 63 2283
Paitesanshi/LLM-Agent-Survey
2.7k 74 21154
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Language:Python2.7k 13 173278
Eladlev/AutoPrompt
A framework for prompt tuning using Intent-based Prompt Calibration
Language:Python2.3k 12 44202
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Language:Python2.1k 28 166210
Link-AGI/AutoAgents
[IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.
Language:Python1.3k 24 27154
SkyworkAI/Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数，训练数据，评估数据，评估方法。
Language:Python1.2k 24 63110
IEIT-Yuan/Yuan-2.0
Yuan 2.0 Large Language Model
Language:Python683 5 9386
lafmdp/Awesome-Papers-Autonomous-Agent
A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
628 12 455
OpenGVLab/LAMM
[NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agents
Language:Python306 9 4417
OrionStarAI/OrionStar-Yi-34B-Chat
OrionStar-Yi-34B-Chat 是一款开源中英文Chat模型，由猎户星空基于Yi-34B开源模型、使用15W+高质量语料微调而成。
Language:Python258 5 528
lyogavin/Anima
Moved to here: https://github.com/lyogavin/airllm
9 2 03

currylym

currylym's Stars

hiyouga/LLaMA-Factory

microsoft/autogen

OpenBMB/ChatDev

haotian-liu/LLaVA

QwenLM/Qwen

KindXiaoming/pykan

Dao-AILab/flash-attention

Mikoto10032/DeepLearning

AI4Finance-Foundation/FinGPT

FlagOpen/FlagEmbedding

leptonai/search_with_lepton

OpenBMB/MiniCPM

WooooDyy/LLM-Agent-Paper-List

mit-han-lab/streaming-llm

microsoft/DeepSpeedExamples

yangjianxin1/Firefly

microsoft/LMOps

ztxz16/fastllm

DSXiangLi/DecryptPrompt

Paitesanshi/LLM-Agent-Survey

dvlab-research/LongLoRA

Eladlev/AutoPrompt

intel/intel-extension-for-transformers

Link-AGI/AutoAgents

SkyworkAI/Skywork

IEIT-Yuan/Yuan-2.0

lafmdp/Awesome-Papers-Autonomous-Agent

OpenGVLab/LAMM

OrionStarAI/OrionStar-Yi-34B-Chat

lyogavin/Anima