ruiqianheartseed's Stars
YancyKahn/CoA
Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM
Tencent/Tencent-Hunyuan-Large
DUT-lujunyu/ToxiCN_MM
The code and resource of "Towards Comprehensive Detection of Chinese Harmful Memes" (NeurIPS2024 D&B).
DUT-lujunyu/ToxiCN
The code and resource of "Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmark" (ACL2023).
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
THUzhangga/NMSL
Abstraction your words——never mind the scandal and liber
CLUEbenchmark/SuperCLUE-Safety
SC-Safety: 中文大模型多轮对抗安全基准
whitzard-ai/jade-db
"他山之石、可以攻玉":复旦白泽智能发布面向国内开源和国外商用大模型的Demo数据集JADE-DB
sylinrl/TruthfulQA
TruthfulQA: Measuring How Models Imitate Human Falsehoods
GuyTevet/motion-diffusion-model
The official PyTorch implementation of the paper "Human Motion Diffusion Model"
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
cby-pku/aligner
[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
google-research/t5x
codertimo/BERT-pytorch
Google AI 2018 BERT pytorch implementation
google-research/bert
TensorFlow code and pre-trained models for BERT
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
lucidrains/x-clip
A concise but complete implementation of CLIP with various experimental improvements from recent papers
Cheneng/DPCNN
Deep Pyramid Convolutional Neural Networks for Text Categorization in PyTorch
run-llama/rags
Build ChatGPT over your data, all with natural language
AnswerDotAI/RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
NVIDIA/ChatRTX
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
chatchat-space/Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
alisen39/TrWebOCR
开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~
DayBreak-u/chineseocr_lite
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
myhub/tr
Free Offline OCR 离线的中文文本检测+识别SDK
Ucas-HaoranWei/Vary
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.