florence-2

There are 23 repositories under florence-2 topic.

roboflow/maestro
streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL
Language:Python1.4k 20 18102
jhc13/taggui
Tag manager and captioner for image datasets
Language:Python761 14 19936
autodistill/autodistill-grounded-sam-2
Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.
Language:Python96 5 813
Ravi-Teja-konda/Surveillance_Video_Summarizer
VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.
Language:Python92 6 29
autodistill/autodistill-florence-2
Use Florence 2 to auto-label data for use in training fine-tuned object detection models.
Language:Python59 4 47
retkowsky/florence-2
Florence-2
Language:Jupyter Notebook43 3 411
Damarcreative/rem-wm
Rem-WM, a powerful watermark remover tool that leverages the capabilities of Microsoft Florence and Lama Cleaner models.
Language:Python37 1 25
D-Ogi/WatermarkRemover-AI
AI-Powered Watermark Remover using Florence-2 and LaMA Models: A Python application leveraging state-of-the-art deep learning models to effectively remove watermarks from images with a user-friendly PyQt6 interface.
Language:Python27 4 56
fireicewolf/wd-llm-caption-cli
A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.
Language:Python25
ANYANTUDRE/Florence-2-Vision-Language-Model
Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.
Language:Jupyter Notebook13 1 12
jacobmarks/fiftyone_florence2_plugin
Run SOTA Vision-Language Model Florence-2 on your data!
Language:Python9 2 0
sayedmohamedscu/Vision-language-models-VLM
vision language models finetuning notebooks & use cases (paligemma - florence .....)
Language:Jupyter Notebook7 1 02
Mithunprb/text2segment_video
Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes advanced AI models, specifically Florence2 and SAM2, to detect and segment specific objects or activities in a video based on textual descriptions.
Language:Python6 4 01
Ambruk-chan/DiscordBot
The Ultimate Local LLM Discord Bot!!!
Language:Python4 2 02
regiellis/ecko-cli
ecko-cli is a simple CLI tool that streamlines the process of processing images in a directory, generating captions, and saving them as text files. Additionally, it provides functionalities to create a JSONL file from images in the directory you specify. Images will be captioned using the Microsoft Florence-2-large model and ONNX
Language:Python3 2 00
Gabriellgpc/computer-vision-dataset-maker
The Power of Florence-2 with OpenVINO & FiftyOne: Real-World Applications in Image Analysis
Language:Python2 2 00
Kazuhito00/Florence-2-Colaboratory-Sample
Microsoft の軽量VLMのFlorence-2のColaboratory上でのサンプル
Language:Jupyter Notebook2 1 0
sitamgithub-MSIT/TextSnap
TextSnap: Demo for Florence 2 model used in OCR tasks to extract and visualize text from images.
Language:Python2 2 0
Zuellni/Qt-Caption
Image captioning GUI using Florence-2.
Language:Python1 2 00
phamkinhquoc2002/florence2-football-analysis
Language:Python0 2 00
Zuellni/Image-Tools
Various image processing scripts.
Language:Python0 2 00
Abdeen-A-AI/Image-Feature-Extraction-Using-GenAI
This project implements an advanced generative AI pipeline for extracting and rating features from images. It combines the power of Florence-2, a state-of-the-art vision-language model, with a fine-tuned version of Mistral-v3, a cutting-edge large language model.
Language:Jupyter Notebook1 0
antonio-f/Florence-2-test
Florence-2 quick test
Language:Jupyter Notebook1 0

florence-2

roboflow/maestro

jhc13/taggui

autodistill/autodistill-grounded-sam-2

Ravi-Teja-konda/Surveillance_Video_Summarizer

autodistill/autodistill-florence-2

retkowsky/florence-2

Damarcreative/rem-wm

D-Ogi/WatermarkRemover-AI

fireicewolf/wd-llm-caption-cli

ANYANTUDRE/Florence-2-Vision-Language-Model

jacobmarks/fiftyone_florence2_plugin

sayedmohamedscu/Vision-language-models-VLM

Mithunprb/text2segment_video

Ambruk-chan/DiscordBot

regiellis/ecko-cli

Gabriellgpc/computer-vision-dataset-maker

Kazuhito00/Florence-2-Colaboratory-Sample

sitamgithub-MSIT/TextSnap

Zuellni/Qt-Caption

phamkinhquoc2002/florence2-football-analysis

Zuellni/Image-Tools

Abdeen-A-AI/Image-Feature-Extraction-Using-GenAI

antonio-f/Florence-2-test