omniparser

There are 11 repositories under omniparser topic.

  • omniparse

    adithya-s-k/omniparse

    Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

    Language:Python6.7k4290530
  • yuruotong1/autoMate

    Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves

    Language:Python3.7k40109468
  • OpenAdapt

    OpenAdaptAI/OpenAdapt

    Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models

    Language:Python1.4k15508203
  • OpenAdaptAI/OmniMCP

    OmniMCP uses Microsoft OmniParser and Model Context Protocol (MCP) to provide AI models with rich UI context and powerful interaction capabilities.

    Language:Python6111410
  • presidio-oss/factif-ai

    AI-powered computer control for automated testing. Factifai uses vision models (Claude, GPT-4o, Gemini) to interact with applications naturally - clicking, typing, and verifying results just like a human would.

    Language:TypeScript491325
  • FareedKhan-dev/ai-desktop

    AI agent that controls a computer

    Language:Python45205
  • GML-FMGroup/cappuccino

    Cappuccino is an GUI Agent based on desktop screen. It is a Manus-like AI Agent that can be deployed locally.

    Language:Python28306
  • aryasaatvik/omniparser-sagemaker

    🤖 deploy OmniParser v2 model on Amazon SageMaker with async inference endpoint

    Language:Python10202
  • xiaoyao9184/docker-omni

    Docker implementation of the OmniParser screen parsing tool

  • mednax-it/Omniparser_Schemas

    Placeholder for Omniparser Schemas used by universal-etl-parser

  • OpenAdaptAI/OpenAdapter

    Effortless Deployment and Integration for SOTA Screenshot Parsing and Action Models