Auxotaku
I am a junior student at the School of Software, Northwestern Polytechnical University, currently exploring the field of multimodal video generation.
Northwestern Polytechnical UniversityXi‘an
Auxotaku's Stars
hkproj/pytorch-stable-diffusion
Stable Diffusion implemented from scratch in PyTorch
NVlabs/stylegan
StyleGAN - Official TensorFlow Implementation
datawhalechina/hugging-multi-agent
A tutorial based on MetaGPT to quickly help you understand the concept of agent and muti-agent and get started with coding development. 基于MetaGPT的多智能体入门与开发教程
Auxotaku/Windows-Pin
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
MAWHA/maa-whmx
基于 MaaFramework 与 Qt6 的物华弥新一键长草小助手 | 通用 MAA PC 端极速预备中!
facebookresearch/sapiens
High-resolution models for human tasks.
FudanNLP/nlp-beginner
NLP上手教程
vinthony/project-page-template
🧸 YAAPPT: Yet Another Academic Project Page Template.
nerfies/nerfies.github.io
eliahuhorwitz/Academic-project-page-template
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
open-mmlab/PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
Ryuk-me/Torrent-Api-py
An Unofficial API for 1337x, Piratebay, Nyaasi, Torlock, Torrent Galaxy, Zooqle, Kickass, Bitsearch, MagnetDL,Libgen, YTS, Limetorrent, TorrentFunk, Glodls, TorrentProject and YourBittorrent
worldveil/dejavu
Audio fingerprinting and recognition in Python
microsoft/human-pose-estimation.pytorch
The project is an official implement of our ECCV2018 paper "Simple Baselines for Human Pose Estimation and Tracking(https://arxiv.org/abs/1804.06208)"
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
open-mmlab/mmhuman3d
OpenMMLab 3D Human Parametric Model Toolbox and Benchmark
OpenGVLab/Instruct2Act
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
dulucas/siMLPe
A Simple Baseline for Human Motion Prediction
Fang-Haoshu/Halpe-FullBody
Halpe: full body human pose estimation and human-object interaction detection dataset
GT-RIPL/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
NanmiCoder/MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
glide-the/InterpretationoDreams
使用langchain实现 故事情景生成,情感情景引导,剧情总结,性格分析
javabloger/yuqing
思通舆情 是一款开源免费的舆情系统,支持本地化部署。支持对海量的舆情数据进行多维交叉分析和深度挖掘,为用户户提供全面的舆情数据,专业的舆情分析。
Jesse-He/isearch4
舆情分析系统展示
zhe-si/MIDL_compiler
本项目为学校编译原理课程的实验,通过c++编写,实现了对MIDL语言的词法分析和语法分析功能。同时,支持出错恢复机制并维护了较为完整的出错信息。
ExponentialML/AnimateDiff-MotionDirector
MotionDirector Training For AnimateDiff. Train a MotionLoRA and run it on any compatible AnimateDiff UI.
Kosinkadink/ComfyUI-AnimateDiff-Evolved
Improved AnimateDiff for ComfyUI and Advanced Sampling Support
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators