thesby

I am a graduate in University of Electronic Science and Technology of China. My research field is computer vision and deep learning.

thesby's Stars

KillianLucas/open-interpreter
A natural language interface for computers
Language:Python42.2k 317 7473.7k
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Language:Python39.6k 431 9k7.4k
karpathy/llama2.c
Inference Llama 2 in one file of pure C
Language:C16.5k 187 2131.9k
Sanster/lama-cleaner
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Language:Python15.1k 120 3361.6k
WongKinYiu/yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
Language:Jupyter Notebook12.9k 111 1.8k4.1k
serengil/deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Language:Python10.5k 135 1k1.9k
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook10.4k 140 3131k
brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
9.2k 285 441.5k
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
Language:HTML7.7k 107 438739
advimman/lama
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Language:Jupyter Notebook7.4k 80 242799
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python7.1k 111 146408
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Language:Python7k 75 382566
skypilot-org/skypilot
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
Language:Python6.1k 68 1.6k417
Facico/Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca
Language:C4.1k 58 244427
THUDM/VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
Language:Python4k 40 345413
shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
Language:Python2.8k 32 355437
cuifengcn/TAICHI-flet
基于flet的一款windows桌面应用，实现了浏览图片、音乐、小说、漫画、各种资源的功能。
Language:Python2.7k 21 62333
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！
Language:Jupyter Notebook2.5k 37 97235
deepdoctection/deepdoctection
A Repo For Document AI
Language:Python2.3k 16 169116
DLLXW/baby-llama2-chinese
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
Language:Python2.2k 17 67277
deepcam-cn/yolov5-face
YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931) ECCV Workshops 2022)
Language:Python2k 31 249483
Sanster/text_renderer
Generate text images for training deep learning ocr model
Language:Python1.3k 43 104381
michael-wzhu/Chinese-LlaMA2
Repo for adapting Meta LlaMA2 in Chinese! META最新发布的LlaMA2的汉化版！（完全开源可商用）
Language:Python750 17 1151
HIT-SCIR/huozi
活字通用大模型
Language:Python304 13 1218
WenmuZhou/TableGeneration
通过浏览器渲染生成表格图像
Language:Python169 5 1234
zcswdt/Color_OCR_image_generator
Language:C++144 5 1039
cvlab-stonybrook/PaperEdge
The code and the DIW dataset for "Learning From Documents in the Wild to Improve Document Unwarping" (SIGGRAPH 2022)
Language:Python112 11 1722
Kamino666/watermark-tracer
一个基于可视水印检测识别的数字媒体溯源应用系统，是我的大作业项目，包含这个系统以及一个开源的大规模常见水印图像数据集（Large-scale Common Watermark Dataset, LCWD）。输入一个带有可视水印的图片或视频，系统会检测定位到水印所在的区域，然后将其提取出来，然后借助百度AI开放平台的OCR和logo识别以及Bing搜索引擎，溯源到这个图片或视频的源头。
Language:Python71 4 614
FutureRising007/Table_Structure_Recognition
Table Structure Recognition
33 2 02
c-chaitanya/language-identification
Code for Detecting language from text in python using fasttext
Language:Python123

thesby

thesby's Stars

KillianLucas/open-interpreter

PaddlePaddle/PaddleOCR

karpathy/llama2.c

Sanster/lama-cleaner

WongKinYiu/yolov7

serengil/deepface

facebookresearch/seamless_communication

brightmart/nlp_chinese_corpus

LianjiaTech/BELLE

advimman/lama

jzhang38/TinyLlama

ymcui/Chinese-LLaMA-Alpaca-2

skypilot-org/skypilot

Facico/Chinese-Vicuna

THUDM/VisualGLM-6B

shibing624/MedicalGPT

cuifengcn/TAICHI-flet

PhoebusSi/Alpaca-CoT

deepdoctection/deepdoctection

DLLXW/baby-llama2-chinese

deepcam-cn/yolov5-face

Sanster/text_renderer

michael-wzhu/Chinese-LlaMA2

HIT-SCIR/huozi

WenmuZhou/TableGeneration

zcswdt/Color_OCR_image_generator

cvlab-stonybrook/PaperEdge

Kamino666/watermark-tracer

FutureRising007/Table_Structure_Recognition

c-chaitanya/language-identification