Pinned Repositories
MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
faceRecognition
利用OpenCV、CNN进行人脸识别
guozhiyao.github.io
introduction-to-github
love.github.io
sam
SAM: Sharpness-Aware Minimization (PyTorch)
WanJuan1.0
万卷1.0多模态语料
MOSS
An open-source tool-augmented conversational language model from Fudan University
Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
guozhiyao's Repositories
guozhiyao/faceRecognition
利用OpenCV、CNN进行人脸识别
guozhiyao/guozhiyao.github.io
guozhiyao/introduction-to-github
guozhiyao/love.github.io
guozhiyao/sam
SAM: Sharpness-Aware Minimization (PyTorch)