/GeoReasoner

GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Mode

🌐 GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model

Paper Conference

ALT TEXT

Release

  • Data

    • For Stage 1 (Reasoning Tuning Phase), We have released the SFT data on Hugging Face.
    • For Stage 2 (Location Tuning Phase), due to copyright issues with Google Street View images, we are unable to directly provide the corresponding data. However, you can retrieve the relevant data by using the official API provided by Google Street View.
  • Code

    • Coming Soon

Usage and License Notices

This project utilizes certain datasets and checkpoints that are subject to their respective original licenses. It is important to emphasize that the collected data from GeoGuessr and Tuxun cannot be used for commercial purposes.

Description

Acknowledgments

Citation

@inproceedings{li2024georeasoner,
  title={GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model},
  author={Li, Ling and Ye, Yu and Jiang, Bingchuan and Zeng, Wei},
  booktitle={International Conference on Machine Learning (ICML)},
  year={2024}
}