Pinned Repositories
albert-chinese-ner
使用预训练语言模型ALBERT做中文NER
amazon-sagemaker-visual-search
This repository is part of a blog post that guides users through creating a visual search application using Amazon SageMaker and Amazon Elasticsearch service
deepctr_sagemaker
DeepCTR on Amazon SageMaker, train and deploy, BYOC and BYOS
gluonts_sagemaker
GluonTS on Amazon SageMaker, train and deploy, BYOC and BYOS
jittor_sagemaker
RobustPalmprintROI
An robust algorithm for palmprint-roi extraction in complex environment
table_structure_recognition
Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.
THACIL
Temporal Hierarchical Attention at Category- and Item-Level for Micro-Video Click-Through Prediction
wav2lip_288x288
yolov5_sagemaker
whn09's Repositories
whn09/table_structure_recognition
Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.
whn09/wav2lip_288x288
whn09/Baichuan2_sagemaker
whn09/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
whn09/yolov8_sagemaker
whn09/ai-models
whn09/CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
whn09/darknet
Convolutional Neural Networks
whn09/Easy-Wav2Lip
Colab for making Wav2Lip high quality and easy to use
whn09/facefusion
Next generation face swapper and enhancer
whn09/graphcast
whn09/host-yolov8-on-sagemaker-endpoint
whn09/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
whn09/iTransformer
Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting".
whn09/langchain
⚡ Building applications with LLMs through composability ⚡
whn09/Llama2-Chinese
Llama中文社区,最好的中文Llama大模型,完全开源可商用
whn09/MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
whn09/mmdetection3d
OpenMMLab's next-generation platform for general 3D object detection.
whn09/OpenCastKit
The open-source solutions of FourCastNet and GraphCast
whn09/pangu-pytorch
Weather forecast at 1/3/6/24-hour horizon
whn09/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
whn09/SentenceTransformers_sagemaker
whn09/table-transformer
Model training and evaluation code for our dataset PubTables-1M, developed to support the task of table extraction from unstructured documents.
whn09/torchrec
Pytorch domain library for recommendation systems
whn09/TriplaneGaussian
TriplaneGaussian: A new hybrid representation for single-view 3D reconstruction.
whn09/ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
whn09/Wav2Lip-HD
High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN
whn09/whisper.cpp
Port of OpenAI's Whisper model in C/C++
whn09/YogaPoseEstimation
Using Pose Estimation to Judge Yoga Form
whn09/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection