FrankZxShen

What you love is your life.

Southwest Jiaotong UniversitySaturn

Pinned Repositories

Amadeus
アマデウスver 1.0.4
Language:Python20
ATLA-Demo
Source code for "Adversarial Training for Layout-Aware Text-VQA".
Language:Python1 1 00
ATS
[ICME 2024] The code for Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering
3 1 00
Co-Nav-Exp
多智能体目标导航实验
Language:Python1 1 00
EfficientZero
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021. Optimize the residual module
Language:Python1 0 00
FrankZxShen-Visual-Audio-Semantic-Navigation-TEST
Language:Python10
GameAudioCrawler
A script used to climb the wiki audios for some common games.
Language:Jupyter Notebook1 1 00
habitat-installation
安装habitat的简易流程，以v0.2.2为例
1 1 00
MCoCoNav
[AAAI 2025] Enhancing Multi-Robot Semantic Navigation Through Multimodal Chain-of-Thought Score Collaboration
Language:Python9 1 00
visual-chatgpt-zh-vits
visual-chatgpt支持中文的windows版本，融合vits推断模块
Language:Python4 1 00

FrankZxShen's Repositories

FrankZxShen/MCoCoNav
[AAAI 2025] Enhancing Multi-Robot Semantic Navigation Through Multimodal Chain-of-Thought Score Collaboration
Language:Python9 1 00
FrankZxShen/visual-chatgpt-zh-vits
visual-chatgpt支持中文的windows版本，融合vits推断模块
Language:Python4 1 00
FrankZxShen/ATS
[ICME 2024] The code for Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering
3 1 00
FrankZxShen/Amadeus
アマデウスver 1.0.4
Language:Python20
FrankZxShen/ATLA-Demo
Source code for "Adversarial Training for Layout-Aware Text-VQA".
Language:Python1 1 00
FrankZxShen/Co-Nav-Exp
多智能体目标导航实验
Language:Python1 1 00
FrankZxShen/EfficientZero
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021. Optimize the residual module
Language:Python1 0 00
FrankZxShen/FrankZxShen-Visual-Audio-Semantic-Navigation-TEST
Language:Python10
FrankZxShen/GameAudioCrawler
A script used to climb the wiki audios for some common games.
Language:Jupyter Notebook1 1 00
FrankZxShen/habitat-installation
安装habitat的简易流程，以v0.2.2为例
1 1 00
FrankZxShen/latr
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answering (STVQA)
Language:Python1 0 00
FrankZxShen/MNlexNet
This is the PyTorch version repository for MNIST dataset identification.
Language:Python1 1 00
FrankZxShen/so-vits-svc-audio2audio
Replace the song vocals to get the target audio.
Language:Python1 1 00
FrankZxShen/Attention-Efficientzero-Alpaca-Lora-Webui
The Webui based on Alpaca-Lora+ChatGLM aims to visualize Atari game results of Efficientzero.
Language:Python0 1 00
FrankZxShen/ChatGLM-webui
A WebUI for ChatGLM-6B
Language:Python
FrankZxShen/CogVLM2-API4
用于softmax分类的CogVLM API
Language:Python1 0
FrankZxShen/depth_yolo
combination of darknet_ros and iai_kinect2
FrankZxShen/echarts
Apache ECharts is a powerful, interactive charting and data visualization library for browser
Language:TypeScript0 0
FrankZxShen/EdgeDiffusionNav-DEMO
ICCV25-1代码备份
Language:Python
FrankZxShen/efficient-vits-finetuning
Finetuning VITS Efficiently (Lora)
FrankZxShen/FrankZxShen
1 0
FrankZxShen/FrankZxshen.github.io
blog，随便创的
Language:Stylus1 0
FrankZxShen/Grasscutter
A server software reimplementation for a certain anime game.
Language:Java0 0
FrankZxShen/LATLA
LLM portion of ATLA. Used to bring llama2 external knowledge into Text-VQA.
Language:Python1 0
FrankZxShen/Machine-Learning-Assignments
This project is only for SWJTU's students providing their assignments.
0 0
FrankZxShen/sklearn
from mofan python
Language:Python
FrankZxShen/TAP
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)：Add prompt for LLM.
FrankZxShen/visual-chatgpt-zh
visual-chatgpt支持中文版本
Language:Python0 0
FrankZxShen/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
FrankZxShen/vits-fast-fineturing-infer
For vits fine-tuning inference.
Language:Python