data-labeling
There are 93 repositories under data-labeling topic.
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
doccano/doccano
Open source annotation tool for machine learning practitioners.
HumanSignal/awesome-data-labeling
A curated list of awesome data labeling tools
code-kern-ai/refinery
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
alteryx/compose
A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.
shoumikchow/bbox-visualizer
Make drawing and labeling bounding boxes easy as cake
Slava/label-tool
Web application for image labeling and segmentation
phurwicz/hover
:speedboat: Label data at scale. Fun and precision included.
dataqa/nlp-labelling
Labelling platform for text using weak supervision.
samueldobbie/markup
A web-based document annotation tool, powered by GPT-4 :rocket:
Toloka/toloka-kit
Toloka-Kit is a Python library for working with Toloka API.
doccano/awesome-annotation-tools
A curated list of awesome data annotation tools
expectedparrot/edsl
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.
HumanSignal/label-studio-transformers
Label data using HuggingFace's transformers and automatically get a prediction service
infinitylogesh/mutate
A library to synthesize text datasets using Large Language Models (LLM)
gereleth/jupyter-bbox-widget
A Jupyter widget for annotating images with bounding boxes
HuangCongQing/awesome-data-labeling-tools
图像images/点云point clouds标注工具汇总
villagecomputing/superpipe
Superpipe - optimized LLM pipelines for structured data
doccano/doccano-client
A simple client for doccano API.
honghanhh/coursera-practical-data-science-specialization
Solutions on Practical Data Science Specialization on Coursera (offered by deeplearning.ai)
CyberAgent/fast-annotation-tool
FAST is an annotation tool that focuses on mobile devices. https://aclanthology.org/2021.emnlp-demo.41/
amusi/awesome-data-label-tools
开源的标注工具大全(含2D图像/视频/3D点云等)
Zhenye-Na/crnn-pytorch
✍️ Convolutional Recurrent Neural Network in Pytorch | Text Recognition
datagym-ai/datagym-core
Open source annotation and labeling tool for image and video assets
astutic/Acharya
A Data Centric NER annotation tool for your Named Entity Recognition projects
doccano/auto-labeling-pipeline
doccano auto labeling pipeline helps doccano to annotate a document automatically.
megagonlabs/ruler
Data Programming by Demonstration (DPBD) for Document Classification
megagonlabs/tagruler
Data programming by demonstration for information extraction and span annotation
microsoft/OneLabeler
A system for building labeling tools
benbo/interactive-weak-supervision
Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling
explosion/vscode-prodigy
🧬 A VS Code extension for annotating data with Prodigy
cleanlab/cleanlab-studio
Client interface for all things Cleanlab Studio
HumanSignal/pyheartex
Heartex Python SDK - Connect your own models to Heartex Data Labeling
rafaelsandroni/gpt3-data-labeling
Data labeling using few shot learning GPT-3.
segments-ai/segments-ai
Segments.ai Python SDK