IoflyTang's Stars
gmberton/deep-visual-geo-localization-benchmark
Official code for CVPR 2022 (Oral) paper "Deep Visual Geo-localization Benchmark"
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Lu-Feng/SelaVPR
Official repository for the ICLR 2024 paper "Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition".
henry-tujia/UCAS-srun-login-script
采用python编写的国科大(雁栖湖)深澜校园网登录脚本,以实现命令行登录或者断线重连等,仅提供登录功能
yejy53/EP-BEV
About The official implementation of the paper "Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network“. (ECCV 2024)
dfyz/osm-renderer
OpenStreetMap raster tile renderer written in Rust
jgornet/predictive-coding-recovers-maps
tordanik/OSM2World
converter that creates three-dimensional models of the world from OpenStreetMap data
ZhouMengjie/Image-Map-Embeddings
hmf21/UAVLocalization
Implementation of visual based UAV geo-localization using satellite imagery
trailbehind/DeepOSM
Train a deep learning net with OpenStreetMap features and satellite imagery.
ZhouMengjie/you-are-here
You Are Here: Geolocation by Embedding Maps and Images (ECCV2020)
wxywb/history_rag
lipku/LiveTalking
Real time interactive streaming digital human
Ikaros-521/AI-Vtuber
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。
tudelft-iv/CCVPE
Convolutional Cross-View Pose Estimation
HauLiang/PhD-Application-Template
LaTex Template for my PhD Application, including CV and RP...
2noise/ChatTTS
A generative speech model for daily dialogue.
jianchang512/ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
openxrlab/xrlocalization
OpenXRLab Visual Localization Toolbox and Server
openxrlab/xrdslam
Platform for Deep Learning based SLAM
chuchen2017/TrajGDM
Simulating human mobility with a trajectory generation framework based on diffusion model
wnlen/clash-for-linux
clash-for-linux
PhantomGrapes/MGeo
MGeo: Multi-Modal Geographic Language Model Pre-Training
PaddlePaddle/ERNIE
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
mbzuai-oryx/GeoChat
[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing
facebookresearch/OrienterNet
Source Code for Paper "OrienterNet Visual Localization in 2D Public Maps with Neural Matching"
google-research/snap
SNAP: Self-supervised Neural Maps for Visual Positioning and Semantic Understanding (NeurIPS 2023)
SmartFlowAI/Llama3-Tutorial
Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)