beargolden

My research interests are document analysis and recognition, scene text detection and recognition, computer vision and pattern recognition.

Hubei University of TechnologyWuhan 430068, P. R. China

beargolden's Stars

3b1b/manim
Animation engine for explanatory math videos
Language:Python62.9k5.8k
ManimCommunity/manim
A community-maintained Python framework for creating mathematical animations.
Language:Python21.4k1.6k
convdepth/ConvDepth
ConvDepth: Self-Supervised Monocular Depth Estimation for Autonomous Driving
Language:Python1
DepthAnything/Depth-Anything-V2
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Language:Python3.4k281
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Language:Python6.8k523
0nutation/SpeechGPT
SpeechGPT Series: Speech Large Language Models
Language:Python1.2k83
zcablii/LSKNet
(ICCV 2023) Large Selective Kernel Network for Remote Sensing Object Detection
Language:Python44036
assafelovic/gpt-researcher
LLM based autonomous agent that does online comprehensive research on any given topic
Language:Python14.3k1.9k
zbezj/HEU_KMS_Activator
28.7k3k
stanford-crfm/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).
Language:Python1.9k243
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python31.8k3.9k
zhaominyiz/STIRER
STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023
Language:Python122
MagicMirrorOrg/MagicMirror
MagicMirror² is an open source modular smart mirror platform. With a growing list of installable modules, the MagicMirror² allows you to convert your hallway or bathroom mirror into your personal assistant.
Language:JavaScript19.7k4.2k
ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
Language:TypeScript75.4k58.9k
NITR098/Awesome-U-Net
Official repo for Medical Image Segmentation Review: The Success of U-Net
Language:Jupyter Notebook26237
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python6.2k657
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Language:Python5.4k549
wzpan/wukong-robot
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目，支持ChatGPT多轮对话能力，还可能是首个支持脑机交互的开源智能音箱项目。
Language:Python6.3k1.3k
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
Language:Python35.7k3.4k
xszyou/fay-ue5
可对接fay数字人的ue5工程
44191
NEU-Gou/awesome-reid-dataset
Collection of public available person re-identification datasets
896155
RSL-NEU/person-reid-benchmark
A Systematic Evaluation and Benchmark for Person Re-Identification: Features, Metrics, and Datasets
Language:HTML19575
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python64.5k8k
GaiZhenbiao/ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
Language:Python15.2k2.3k
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Language:Python40.5k5.2k
xszyou/Fay
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
9k1.8k
szad670401/HyperLPR
基于深度学习高性能中文车牌识别 High Performance Chinese License Plate Recognition Framework.
Language:C++5.7k2k
YimianDai/open-atac
code and trained models for "Attention as Activation"
Language:Python194
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Language:Python11.3k1.9k
Fangyh09/pytorch-receptive-field
Compute CNN receptive field size in pytorch in one line
Language:Python34857

beargolden

beargolden's Stars

3b1b/manim

ManimCommunity/manim

convdepth/ConvDepth

DepthAnything/Depth-Anything-V2

LiheYoung/Depth-Anything

0nutation/SpeechGPT

zcablii/LSKNet

assafelovic/gpt-researcher

zbezj/HEU_KMS_Activator

stanford-crfm/helm

hiyouga/LLaMA-Factory

zhaominyiz/STIRER

MagicMirrorOrg/MagicMirror

ChatGPTNextWeb/ChatGPT-Next-Web

NITR098/Awesome-U-Net

modelscope/FunASR

rany2/edge-tts

wzpan/wukong-robot

XingangPan/DragGAN

xszyou/fay-ue5

NEU-Gou/awesome-reid-dataset

RSL-NEU/person-reid-benchmark

binary-husky/gpt_academic

GaiZhenbiao/ChuanhuChatGPT

THUDM/ChatGLM-6B

xszyou/Fay

szad670401/HyperLPR

YimianDai/open-atac

xmu-xiaoma666/External-Attention-pytorch

Fangyh09/pytorch-receptive-field