Pinned Repositories
100-Days-Of-ML-Code
100-Days-Of-ML-Code中文版
2019-CCF-BDCI-OCR-MCZJ-OCR-IdentificationIDElement
2019CCF-BDCI大赛 最佳创新探索奖获得者 基于OCR身份证要素提取赛题冠军 天晨破晓团队 赛题源码
AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
AL-cblue
这是参加阿里天池比赛的baseline
awesome-knowledge-graph
整理知识图谱相关学习资料
benchmarking-gnns
Repository for benchmarking graph neural networks
handwritten-text-recognition-for-apache-mxnet
This repository lets you train neural networks models for performing end-to-end full-page handwriting recognition using the Apache MXNet deep learning frameworks on the IAM Dataset.
TableCell
在TableBank的基础上,进一步标注到单元格精度,利用目标检测/分割实现单元格定位。
Jadentan's Repositories
Jadentan/Anima
第一个开源的基于QLoRA的33B中文大语言模型First QLoRA based open source 33B Chinese LLM
Jadentan/auto-cot
Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)
Jadentan/BioGPT
Jadentan/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Jadentan/ChatGLM-6B
ChatGLM-6B:开源双语对话语言模型
Jadentan/Chatglm_lora_multi-gpu
chatglm多gpu用deepspeed和
Jadentan/ChatGPT-Hub
Jadentan/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Jadentan/ChineseNLPCorpus
中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
Jadentan/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model
Jadentan/CodeGen
CodeGen is an open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
Jadentan/ColossalAI
Making big AI models cheaper, easier, and scalable
Jadentan/CPM-Bee
百亿参数的中英文双语基座大模型
Jadentan/FLAN
Jadentan/Linly
Chinese-LLaMA基础模型;ChatFlow中文对话模型;NLP预训练/指令微调数据集
Jadentan/llama-int8
Quantized inference code for LLaMA models
Jadentan/llama.cpp
Port of Facebook's LLaMA model in C/C++
Jadentan/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Jadentan/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
Jadentan/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
Jadentan/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Jadentan/safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Jadentan/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
Jadentan/shu
中文书籍收录整理, Collection of Chinese Books
Jadentan/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Jadentan/TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
Jadentan/TigerBot
TigerBot: A multi-language multi-task LLM
Jadentan/trl
Train transformer language models with reinforcement learning.
Jadentan/WebCPM
Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"
Jadentan/Yuan-1.0
Yuan 1.0 Large pretrained LM