Pinned Repositories
AS-One
Easy & Modular Computer Vision Detectors, Trackers & SAM - Run YOLOv9,v8,v7,v6,v5,R,X in under 10 lines of code.
Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
AcTiV-annotation-tool
Semi-automatic annotation of arabic text in news videos
AcTiV-Evaluation-Tool
EvalOCR
A python wrapper for the Evaluation Metrics of the Hidden Markov Model Toolkit (HTK): ```HResults$```, used for OCR purposes.
Textar
Arabic text recognition in mutlimedia documents ____ part of IntelligencIA Project
object-navigation
Implicit Obstacle Map-driven indoor navigation
recognize-anything
Open-source and strong foundation image recognition models.
HawkEye
Official implementation of HawkEye: Training Video-Text LLMs for Grounding Text in Videos
AnomalyRuler
Implementation for paper "Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Model"
ooza's Repositories
ooza/AcTiV-annotation-tool
Semi-automatic annotation of arabic text in news videos
ooza/AcTiV-Evaluation-Tool
ooza/EvalOCR
A python wrapper for the Evaluation Metrics of the Hidden Markov Model Toolkit (HTK): ```HResults$```, used for OCR purposes.
ooza/Textar
Arabic text recognition in mutlimedia documents ____ part of IntelligencIA Project