AndyShaw01's Stars
krahets/hello-algo
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
karpathy/llm.c
LLM training in simple, raw C/CUDA
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
BlinkDL/RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
guanzhi/GmSSL
支持国密SM2/SM3/SM4/SM9/SSL的密码工具箱
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
microsoft/torchscale
Foundation Architecture for (M)LLMs
Clouditera/SecGPT
SecGPT网络安全大模型
facebookresearch/DPR
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
thunlp/TAADpapers
Must-read Papers on Textual Adversarial Attack and Defense
Jamie-Stirling/RetNet
An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
ymcui/Chinese-Mixtral
中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)
chujiezheng/chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
tianyi-lab/Reflection_Tuning
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
WangRongsheng/Aurora
The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"
liu00222/Open-Prompt-Injection
This repository provides implementation to formalize and benchmark Prompt Injection attacks and defenses
PKU-YuanGroup/ProLLaMA
A Protein Large Language Model for Multi-Task Protein Language Processing
jmhIcoding/traffic_classification_utils
网络流量分类对比方法汇总
zjunlp/KnowledgeCircuits
[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers
sleeepeer/PoisonedRAG
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
RazorMegrez/VulHawk
This is the official repository for VulHawk.
SheltonLiu-N/Universal-Prompt-Injection
The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".
AndyShaw01/CrossModal-HomeWork
Record homework for UCAS's cross-modal course
qingqingkk/Clip_crossmodal_retrieval
CLIP Crossmodal retrieval with moscoco and flickr for zero-shot and fine-tune