leminhyen2's Stars
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
ultralytics/ultralytics
Ultralytics YOLO11 🚀
xinntao/Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
vadimdemedes/ink
🌈 React for interactive command-line apps
danielgatis/rembg
Rembg is a tool to remove images background
PaddlePaddle/PaddleSeg
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.
rhasspy/piper
A fast, local neural text to speech system
berstend/puppeteer-extra
💯 Teach puppeteer new tricks through plugins.
open-mmlab/mmpose
OpenMMLab Pose Estimation Toolbox and Benchmark.
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
aim-uofa/AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
CjangCjengh/MoeGoe
Executable file for VITS inference
axinc-ai/ailia-models
The collection of pre-trained, state-of-the-art AI models for ailia SDK
WXinlong/SOLO
SOLO and SOLOv2 for instance segmentation, ECCV 2020 & NeurIPS 2020.
ZrrSkywalker/Personalize-SAM
Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds
justfoolingaround/animdl
A highly efficient, fast, powerful and light-weight anime downloader and streamer for your favorite anime.
FORTH-ModelBasedTracker/MocapNET
We present MocapNET, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance
killkimno/MORT
MORT 번역기 프로젝트 - Real-time game translator with OCR
LiyuanLucasLiu/Transformer-Clinic
Understanding the Difficulty of Training Transformers
CarlosNZ/json-edit-react
React component for editing/viewing JSON/object data
rampaa/JL
JL is a program for looking up Japanese words and expressions.
lattas/AvatarMe
Public repository for the CVPR 2020 paper AvatarMe and the TPAMI 2021 AvatarMe++
MendelXu/zsseg.baseline
Open-vocabulary Semantic Segmentation
nathanielfernandes/imagetext-py
A blazing fast text drawing library
microsoft/admin-torch
Understanding the Difficulty of Training Transformers
VoxelCubes/DeepQt
DeepL API front-end using Qt
microsoft/deepnmt
ogkalu2/Human-parity-on-machine-translations
Bilingual (or Multilingual) Large Language models and In-context Learning- The key to human parity on machine translations
danni-cool/node-gtag4
A lightweight, browser api free, Node.js version of the Google Analytics SDK, the data reporting is accepted by Google Analytics 4.