YOYO-git-78's Stars
liu673/Awesome-LLM4Security
This project aims to consolidate and share high-quality resources and tools across the cybersecurity domain.
yangjianxin1/Firefly
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Clouditera/SecGPT
SecGPT网络安全大模型
secureIT-project/CVEfixes
CVEfixes: Automated Collection of Vulnerabilities and Their Fixes from Open-Source Software
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
SonarSource/sonarqube
Continuous Inspection
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
ycdxsb/PocOrExp_in_Github
Automatically Collect POC or EXP from GitHub by CVE ID. If you are unable to find the POC/EXP on GitHub, you can also check here: https://pocorexps.nsa.im/
Eshe0922/ReposVul
[ICSE 2024 Industry Challenge Track] Official implementation of "ReposVul: A Repository-Level High-Quality Vulnerability Dataset".
DerrickXuNu/dvrl_pytorch
Data Valuation using Reinforcement Learning Pytorch Version
ntgiang71096/vfdetector
ICL-ml4csec/VulBERTa
Simplified Source Code Pre-Training for Vulnerability Detection
AppFlowy-IO/AppFlowy
Bring projects, wikis, and teams together with AI. AppFlowy is an AI collaborative workspace where you achieve more without losing control of your data. The best open source alternative to Notion.
jingxuyy/PDATC-NCPMKL
PDATC-NCPMKL: Predicting drug’s Anatomical Therapeutic Chemical (ATC) codes based on network consistency projection and multiple kernel learning In this study, a new recommendation system (named PDATC-NCPMKL), which incorporated network consistency projection and multi-kernel learning, was designed to identify drug-ATC code associations.
shibing624/text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
wellinxu/nlp_store
NLP杂货铺,python实现各种算法/工具,辅助算法理解/应用
microsoft/AI-System
System for AI Education Resource.
chatchat-space/Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
MorvanZhou/NLP-Tutorials
Simple implementations of NLP models. Tutorials are written in Chinese on my website https://mofanpy.com
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
reorx/awesome-chatgpt-api
Curated list of apps and tools that not only use the new ChatGPT API, but also allow users to configure their own API keys, enabling free and on-demand usage of their own quota.
yihong0618/xiaogpt
Play ChatGPT and other LLM with Xiaomi AI Speaker
ZiyaoGeng/RecLearn
Recommender Learning with Tensorflow2.x
whoiskatrin/sql-translator
SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.
CompVis/stable-diffusion
A latent text-to-image diffusion model
chenfei-wu/TaskMatrix
shibing624/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。