Pinned Repositories
argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
argilla-python
The Argilla API python SDK
distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
distilabel-workbench
A working repository for experimental pipelines in distilabel
advent
agenta
The LLMOps platform to build robust LLM apps. Easily experiment and evaluate different prompts, models, and workflows.
argilla
✨ Open-source tool for data-centric NLP. Argilla helps domain experts and data teams to build better NLP datasets in less time.
AutoOPRO
BLEHeartRateLogger
Bluetooth Low-Energy Heart Rate Monitor data logging in Python
dtaantwerp.github.io
burtenshaw's Repositories
burtenshaw/advent
burtenshaw/agenta
The LLMOps platform to build robust LLM apps. Easily experiment and evaluate different prompts, models, and workflows.
burtenshaw/argilla
✨ Open-source tool for data-centric NLP. Argilla helps domain experts and data teams to build better NLP datasets in less time.
burtenshaw/AutoOPRO
burtenshaw/bs4_scraping
A quick introductory class to scraping with beautiful soup and wrangling scraped tables with pandas.
burtenshaw/burtenshaw.github.io
Personal for website research, design, code, and life.
burtenshaw/ccnlg
CCNLG Proceedings
burtenshaw/CCNLG_2019
Convert data from EasyChair for use with aclpub
burtenshaw/movie-chatter
burtenshaw/See-Whence
Sequence classification base code, used for PhD thesis and SemEval 2020 sarcasm detection.
burtenshaw/soft_conv
Parsing and wrangling package for Whatsapp and Facebook conversations. Interprets multiple formats and incorporates annotator validation.
burtenshaw/spanwijdte
Binary and Multilabel toxic span detection in Dutch.
burtenshaw/wingspan
Toxic span detection system submitted to SemEval Task 5 2021.
burtenshaw/burtenshaw
burtenshaw/data-is-better-together
Let's build better datasets, together!
burtenshaw/data-viber
Data viber is your chill repo for data collection and vibe checks.
burtenshaw/distilabel
⚗️ AI Feedback framework for scalable LLM alignment
burtenshaw/distilabel_trigger
burtenshaw/distilabel_triggers
burtenshaw/doccano
Open source text annotation tool for machine learning practitioner.
burtenshaw/dta
burtenshaw/instructdantic
burtenshaw/kidscrawler
A child safe web crawler
burtenshaw/llm-autoeval
Automatically evaluate your LLMs in Google Colab
burtenshaw/orpo
Official repository for ORPO
burtenshaw/productizing_os_llms
burtenshaw/RelGraph
Data Science project on topic and relationship analysis, using the Harry Potter series as a case study.
burtenshaw/share-lm
ShareLM is a Chrome extension that lets you share your open-source conversations
burtenshaw/trl
Train transformer language models with reinforcement learning.
burtenshaw/weasimov-documentation