LinxinS97's Stars
ZitongYang/Synthetic_Continued_Pretraining
Code implementation of synthetic continued pretraining
pyg-team/pytorch_geometric
Graph Neural Network Library for PyTorch
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
THUDM/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
THUDM/CodeGeeX4
CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more.
THUDM/NaturalCodeBench
NaturalCodeBench (Findings of ACL 2024)
autogen-ai/autogen
A programming framework for agentic AI. Discord: https://discord.gg/pAbnFJrkgZ
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
tmgthb/Autonomous-Agents
Autonomous Agents (LLMs) research papers. Updated Daily.
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
pzzhang/VinVL
project page for VinVL
microsoft/scene_graph_benchmark
image scene graph generation benchmark
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
LZY-the-boys/Twin-Merging
[NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging
teacherpeterpan/Logic-LLM
The project page for "LOGIC-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning"
yule-BUAA/MergeLM
Codebase for Merging Language Models (ICML 2024)
OrangeInSouth/DeePEn
A method of ensemble learning for heterogeneous large language models.
alexa/alexa-apis-for-python
The Alexa APIs for Python consists of python classes that represent the request and response JSON of Alexa services. These models act as core dependency for the Alexa Skills Kit Python SDK (https://github.com/alexa/alexa-skills-kit-sdk-for-python).
ridgerchu/matmulfreellm
Implementation for MatMul-free LM.
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
LLaVA-VL/LLaVA-NeXT
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
JieyuZ2/TaskMeAnything
[NeurIPS 2024] A task generation and model evaluation system for multimodal language models.
suzgunmirac/BIG-Bench-Hard
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
xianshang33/llm-paper-daily
Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个
InfiAgent/InfiAgent
meta-llama/llama3
The official Meta Llama 3 GitHub site
kyegomez/Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.