burtenshaw

Building Argilla @ 🤗 Hugging face

ArgillaBrussels

Pinned Repositories

argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
Language:Python3.8k 30 2.1k360
argilla-python
The Argilla API python SDK
Language:Python8 3 801
distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Language:Python1.4k 13 413110
distilabel-workbench
A working repository for experimental pipelines in distilabel
Language:Jupyter Notebook6 4 01
advent
Language:Go0 1 00
agenta
The LLMOps platform to build robust LLM apps. Easily experiment and evaluate different prompts, models, and workflows.
Language:TypeScript0 0 00
argilla
✨ Open-source tool for data-centric NLP. Argilla helps domain experts and data teams to build better NLP datasets in less time.
Language:Python0 0 00
AutoOPRO
Language:Shell0 0 00
BLEHeartRateLogger
Bluetooth Low-Energy Heart Rate Monitor data logging in Python
Language:Python0 2 00
dtaantwerp.github.io
Language:Jupyter Notebook13 10 548

burtenshaw's Repositories

burtenshaw/advent
Language:Go0 1 00
burtenshaw/agenta
The LLMOps platform to build robust LLM apps. Easily experiment and evaluate different prompts, models, and workflows.
Language:TypeScript0 0 00
burtenshaw/argilla
✨ Open-source tool for data-centric NLP. Argilla helps domain experts and data teams to build better NLP datasets in less time.
Language:Python0 0 00
burtenshaw/AutoOPRO
Language:Shell0 0 00
burtenshaw/bs4_scraping
A quick introductory class to scraping with beautiful soup and wrangling scraped tables with pandas.
Language:Jupyter Notebook0 2 00
burtenshaw/burtenshaw.github.io
Personal for website research, design, code, and life.
Language:HTML0 2 00
burtenshaw/ccnlg
CCNLG Proceedings
Language:TeX0 3 30
burtenshaw/CCNLG_2019
Convert data from EasyChair for use with aclpub
Language:TeX0 2 00
burtenshaw/movie-chatter
Language:Python0 8 70
burtenshaw/See-Whence
Sequence classification base code, used for PhD thesis and SemEval 2020 sarcasm detection.
Language:Jupyter Notebook0 2 00
burtenshaw/soft_conv
Parsing and wrangling package for Whatsapp and Facebook conversations. Interprets multiple formats and incorporates annotator validation.
Language:Python0 2 80
burtenshaw/spanwijdte
Binary and Multilabel toxic span detection in Dutch.
Language:Python0 2 00
burtenshaw/wingspan
Toxic span detection system submitted to SemEval Task 5 2021.
Language:Python0 2 01
burtenshaw/burtenshaw
1 0
burtenshaw/data-is-better-together
Let's build better datasets, together!
Language:Jupyter Notebook0 0
burtenshaw/data-viber
Data viber is your chill repo for data collection and vibe checks.
Language:Python
burtenshaw/distilabel
⚗️ AI Feedback framework for scalable LLM alignment
Language:Python0 0
burtenshaw/distilabel_trigger
Language:Python
burtenshaw/distilabel_triggers
burtenshaw/doccano
Open source text annotation tool for machine learning practitioner.
Language:Python1 0
burtenshaw/dta
Language:Jupyter Notebook2 0
burtenshaw/instructdantic
Language:Jupyter Notebook1 0
burtenshaw/kidscrawler
A child safe web crawler
Language:Jupyter Notebook3 0
burtenshaw/llm-autoeval
Automatically evaluate your LLMs in Google Colab
Language:Python0 0
burtenshaw/orpo
Official repository for ORPO
Language:Python0 0
burtenshaw/productizing_os_llms
Language:Python1 0
burtenshaw/RelGraph
Data Science project on topic and relationship analysis, using the Harry Potter series as a case study.
Language:Python3 3
burtenshaw/share-lm
ShareLM is a Chrome extension that lets you share your open-source conversations
Language:JavaScript0 0
burtenshaw/trl
Train transformer language models with reinforcement learning.
Language:Python0 0
burtenshaw/weasimov-documentation
4 02