data-annotation

There are 104 repositories under data-annotation topic.

  • diffgram

    diffgram/diffgram

    The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

    Language:Python1.9k30831119
  • taivop/awesome-data-annotation

    A list of tools for annotating data, managing annotations, etc.

  • explosion/prodigy-recipes

    🍳 Recipes for the Prodigy, our fully scriptable annotation tool

    Language:Jupyter Notebook480260115
  • yihong1120/Construction-Hazard-Detection

    Enhances construction site safety using YOLO for object detection, identifying hazards like workers without helmets or safety vests, and proximity to machinery or vehicles. HDBSCAN clusters safety cone coordinates to create monitored zones. Post-processing algorithms improve detection accuracy.

    Language:Python22716425
  • explosion/jupyterlab-prodigy

    🧬 A JupyterLab extension for annotating data with Prodigy

    Language:TypeScript18818022
  • avenix/WDK

    The Wearables Development Toolkit - a development environment for activity recognition applications with sensor signals

    Language:MATLAB1457214
  • thepanacealab/SMMT

    Social Media Mining Toolkit (SMMT) main repository

    Language:Python13312737
  • classifai

    CertifaiAI/classifai

    :fire: One of the most comprehensive open-source data annotation platform.

    Language:Java119514525
  • slrbl/human-in-the-loop-machine-learning-tool-tornado

    Tornado is an open source Human-in-the-loop machine learning tool. It helps you label your dataset on the fly while training your model through a simple web user interface. It supports all data types: structured, text and image.

    Language:Ruby6321011
  • BatsResearch/alfred

    A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.

    Language:Python50336
  • pixano/pixano

    Data-centric AI building blocks for computer vision applications

    Language:Python4431298
  • PersianDataAnnotations

    amastaneh/PersianDataAnnotations

    PersianDataAnnotations is ASP.NET Core MVC & ASP.NET MVC Custom Localization DataAnnotations (Localized MVC Errors) for Persian(Farsi) language - فارسی سازی خطاهای اعتبارسنجی توکار ام.وی.سی. و کور.ام.وی.سی. برای نمایش اعتبار سنجی سمت کلاینت

    Language:C#436812
  • tamsviz

    TAMS-Group/tamsviz

    Visualization and Annotation Tool for ROS

    Language:C++431405
  • explosion/vscode-prodigy

    🧬 A VS Code extension for annotating data with Prodigy

    Language:TypeScript30412
  • saran9991/llm-data-annotation

    Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with LLMs, employs Iterative Active Learning for continuous improvement, and integrates CleanLab (Confident Learning) to ensure high-quality datasets and better model performance

    Language:Python30004
  • focus_annotator

    13hannes11/focus_annotator

    This is a tool to annotate the focus plane of z-stacked images.

    Language:Rust273121
  • joactr/AnnoTheia

    AnnoTheia is a data annotation toolkit that identifies when a person speaks in a scene and transcribes their speech, also offering flexibility to replace modules for different languages.

    Language:Python26110
  • ufal/factgenie

    Lightweight self-hosted span annotation tool

    Language:Python233852
  • minnesotanlp/infoVerse

    Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information"

    Language:Python16101
  • jangedoo/jupino

    Annotate data using Jupyter notebooks

    Language:Python13210
  • astutic/brat-standoff-to-json

    Converts brat standoff format to JSONL format

    Language:Go12212
  • monatis/asr-annotation-bot

    Simple Telegram bot to annotate and varify automatic speech recognition datasets

    Language:Python12401
  • ziliHarvey/smart-annotation-pointrcnn

    A PointRCNN version of SAnE, which is a web-based semi-automatic annotation tool for point cloud data.

    Language:Python12476
  • diffgram/awesome-training-data

    Curated list of Awesome Training Data! (Data Labeling, Annotation, Discovery, Workflow etc)

  • rbsathish/Data-annotation

    Convert your annotated data from one format to another format

    Language:Python9203
  • AI4Bharat/Anudesh-Frontend

    Language:JavaScript7604
  • dkalpakchi/Textinator

    An internationalized highly customizable annotation and evaluation tool for Natural Language Processing (NLP) tasks

    Language:Python72181
  • fastent/fastent

    custom models for named-entity recognition

    Language:Python7392
  • donlapark/XLabel

    XLabel: An Explainable Data Labeling Assistant

    Language:Jupyter Notebook5102
  • irgroup/labelstudio-to-fonduer

    This small module connects Label Studio with Fonduer by creating a fonduer labeling function for gold labels from a label studio export. Documentation: https://irgroup.github.io/labelstudio-to-fonduer/

    Language:Python5210
  • NLPForUA/UA-LLM

    The entry point for adapting, training, evaluating, and leveraging various Large Language Models (LLMs) for a wide range of Ukrainian NLP tasks.

    Language:Python510
  • FilamentAI/qa-annotation

    The Streamlit tool for the Filament Synthetic QA Pairs project, used to annotate generated data.

    Language:Python4400
  • liamtoran/flippers

    Flippers is a weak supervision library for creating high quality labels using your domain kownledge and weak supervision sources.

    Language:Python4421
  • riaj0224/ObjectRecognition_Nuts_v_Screws

    This repository presents a project focused on image recognition of nuts and screws using object detection techniques. The objective is to develop a model capable of accurately detecting and classifying nuts and screws in images, enabling automation and quality control in industrial settings.

    Language:Jupyter Notebook4200
  • thu-west/AnnotationTool

    An Annotation Tool Designed for Health Unstructured Data (标注工具)

    Language:Java4404