IoflyTang

IoflyTang's Stars

gmberton/deep-visual-geo-localization-benchmark
Official code for CVPR 2022 (Oral) paper "Deep Visual Geo-localization Benchmark"
Language:Python18427
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook9k792
Lu-Feng/SelaVPR
Official repository for the ICLR 2024 paper "Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition".
Language:Python18110
henry-tujia/UCAS-srun-login-script
采用python编写的国科大（雁栖湖）深澜校园网登录脚本，以实现命令行登录或者断线重连等，仅提供登录功能
Language:Python72
yejy53/EP-BEV
About The official implementation of the paper "Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network“. (ECCV 2024)
Language:Python332
dfyz/osm-renderer
OpenStreetMap raster tile renderer written in Rust
Language:Rust13413
jgornet/predictive-coding-recovers-maps
673
tordanik/OSM2World
converter that creates three-dimensional models of the world from OpenStreetMap data
Language:Java562124
ZhouMengjie/Image-Map-Embeddings
2
hmf21/UAVLocalization
Implementation of visual based UAV geo-localization using satellite imagery
Language:Python345
trailbehind/DeepOSM
Train a deep learning net with OpenStreetMap features and satellite imagery.
Language:Python1.3k181
ZhouMengjie/you-are-here
You Are Here: Geolocation by Embedding Maps and Images (ECCV2020)
Language:MATLAB212
wxywb/history_rag
Language:Python852113
lipku/LiveTalking
Real time interactive streaming digital human
Language:Python3.7k520
Ikaros-521/AI-Vtuber
AI Vtuber是一个由【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】驱动的虚拟主播【Live2D/UE/xuniren】，可以在【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】直播中与观众实时互动或直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声；指令协同SD画图。
Language:Python2.9k450
tudelft-iv/CCVPE
Convolutional Cross-View Pose Estimation
Language:Python294
HauLiang/PhD-Application-Template
LaTex Template for my PhD Application, including CV and RP...
Language:TeX534
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python31.6k3.4k
jianchang512/ChatTTS-ui
一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
Language:Python6.1k701
openxrlab/xrlocalization
OpenXRLab Visual Localization Toolbox and Server
Language:Python20024
openxrlab/xrdslam
Platform for Deep Learning based SLAM
Language:Python1159
chuchen2017/TrajGDM
Simulating human mobility with a trajectory generation framework based on diffusion model
Language:Python112
wnlen/clash-for-linux
clash-for-linux
Language:Shell1.2k450
PhantomGrapes/MGeo
MGeo: Multi-Modal Geographic Language Model Pre-Training
Language:Python6517
PaddlePaddle/ERNIE
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
Language:Python6.3k1.3k
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Language:Python3.9k304
mbzuai-oryx/GeoChat
[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing
Language:Python41831
facebookresearch/OrienterNet
Source Code for Paper "OrienterNet Visual Localization in 2D Public Maps with Neural Matching"
Language:Python44844
google-research/snap
SNAP: Self-supervised Neural Maps for Visual Positioning and Semantic Understanding (NeurIPS 2023)
Language:Python17615
SmartFlowAI/Llama3-Tutorial
Llama3-Tutorial（XTuner、LMDeploy、OpenCompass）
Language:Python48850