rkshuai

Pinned Repositories

.tmux
Oh My Tmux! My pretty + versatile tmux configuration that just works (imho the best tmux configuration)
0 2 00
2019-CCF-BDCI-OCR-MCZJ-fake_data_generator
2019CCF-BDCI大赛 OCR赛题第一名天晨破晓团队仿真数据生成方案源码
Language:Python0 1 00
activityrecognition
Information about activity recognition
Language:MATLAB0 2 00
AlphaTree-graphic-deep-neural-network
将深度神经网络中的一些模型进行统一的图示，便于大家对模型的理解
1 3 02
chinese-ocr
运用tensorflow实现自然场景文字检测,keras/pytorch实现crnn+ctc实现不定长中文OCR识别
Language:Python4 3 00
chromium-for-android-56-debug-video
Language:Java4 3 01
chromium_org
android5.0的chromium源码
Language:C++4 2 13
pose-residual-network
Code for 'MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network' paper
1 2 00
st-gcn
Spatial Temporal Graph Convolutional Networks (ST-GCN) for Skeleton-Based Action Recognition in PyTorch
Language:Python1 2 00
TIES_DataGeneration
Dataset Generation Code for: S.R. Qasim, H. Mahmood, and F. Shafait, Rethinking Table Parsing using Graph Neural Networks (2019)
Language:Python1 1 00

rkshuai's Repositories

rkshuai/chromium_org
android5.0的chromium源码
Language:C++4 2 13
rkshuai/TIES_DataGeneration
Dataset Generation Code for: S.R. Qasim, H. Mahmood, and F. Shafait, Rethinking Table Parsing using Graph Neural Networks (2019)
Language:Python1 1 00
rkshuai/Awesome-LLMs-on-device
Awesome LLMs on Device: A Comprehensive Survey
rkshuai/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
0 0
rkshuai/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
0 0
rkshuai/Awesome-Multimodal-Research
A curated list of Multimodal Related Research.
Language:Python0 0
rkshuai/BlueLM
Language:Python0 0
rkshuai/CapsFusion
CapsFusion: Rethinking Image-Text Data at Scale
0 0
rkshuai/chatglm.cpp
C++ implementation of ChatGLM-6B & ChatGLM2-6B & more LLMs
Language:C++0 0
rkshuai/Dewarping-Document-Image-By-Displacement-Flow-Estimation
Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network
Language:Python1 0
rkshuai/DocTr
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
Language:Python0 0
rkshuai/Document-Dewarping-with-Control-Points
Language:Python0 0
rkshuai/improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
Language:Python0 0
rkshuai/llama.cpp
Port of Facebook's LLaMA model in C/C++
Language:C0 0
rkshuai/minigpt4.cpp
Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)
Language:C++0 0
rkshuai/MMBench
Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"
0 0
rkshuai/movenet
Un-official implementation of MoveNet from Google
Language:Python1 0
rkshuai/prompt-to-prompt
Language:Jupyter Notebook0 0
rkshuai/seq2seq-ocr-analysis
end2end layout analysis based seq2seq
Language:Python1 0
rkshuai/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook0 0
rkshuai/stable-diffusion-webui
Stable Diffusion web UI
Language:Python0 0
rkshuai/stablediffusion-infinity
Outpainting with Stable Diffusion on an infinite canvas
Language:Python0 0
rkshuai/TaiSu
TaiSu（太素）--a large-scale Chinese multimodal dataset（亿级大规模中文视觉语言预训练数据集）
Language:Python0 0
rkshuai/Text2Poster-ICASSP-22
Official implementation of the ICASSP-2022 paper "Text2Poster: Laying Out Stylized Texts on Retrieved Images"
Language:Python0 0
rkshuai/torch-fidelity
High-fidelity performance metrics for generative models in PyTorch
Language:Python0 0
rkshuai/VisCPM
Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
Language:Python0 0
rkshuai/visual-chatgpt
VisualChatGPT
Language:Python0 0
rkshuai/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
Language:Python
rkshuai/waveCorrection
OCR Document image deformation correction.复现阿里OCR皱巴巴文档图像形变矫正
Language:Python1 0
rkshuai/yapf
A formatter for Python files
Language:Python1 0