/YoloGemma

Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detection and segmentation.

Primary LanguagePythonMIT LicenseMIT

Watchers