Pinned Repositories
adept-augmentations
A Python library aimed at dissecting and augmenting NER training data.
argilla
Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.
argilla-server
A Python native FastAPI server for the Argilla backend.
awesome-llm-datasets
👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)
biome-text
Custom Natural Language Processing with big and small models 🌲🌱
distilabel
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
get_started_with_deep_learning_for_text_with_allennlp
Getting started with AllenNLP and PyTorch by training a tweet classifier
notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach
spacy-wordnet
spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface
Argilla's Repositories
argilla-io/argilla
Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.
argilla-io/distilabel
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
argilla-io/notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach
argilla-io/distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
argilla-io/argilla-llama-index
A public repo that contains integrations for Argilla and LlamaIndex.
argilla-io/argilla-server
A Python native FastAPI server for the Argilla backend.
argilla-io/argilla-python
The Argilla API python SDK
argilla-io/argilla-haystack
A public repo that contains integrations for Argilla and Haystack.
argilla-io/argilla-hf-dataset-sync
argilla-io/distilabel-workbench
A working repository for experimental pipelines in distilabel
argilla-io/dataset-cron-refresh
argilla-io/ray-clay
Ray Clay is a tool to train and deploy models from Argilla using the Ray framework.
argilla-io/chat-ui
Open source codebase powering the HuggingChat app
argilla-io/haystack
:mag: Haystack is an open source NLP framework to interact with your data using Transformer models and LLMs (GPT-4, ChatGPT and alike). Haystack offers production-ready tools to quickly build complex decision making, question answering, semantic search, text generation applications, and more.
argilla-io/roadmap
Argilla Public Roadmap
argilla-io/.github
✨ Argilla: the open-source feedback platform for LLMs
argilla-io/argilla-docker-deploy
argilla-io/argilla-workshop
A repo with everything someone might need to give a nice workshop on NLP with Argilla.
argilla-io/awesome-argilla-datasets
The Argilla team periodically creates datasets and loves to share the process and data with the world.
argilla-io/cookbook
argilla-io/data-is-better-together
Let's build better datasets, together!
argilla-io/dataset_examples
A public repo for holding dataset examples.
argilla-io/dill
serialize all of Python
argilla-io/distilabel-helm-instruct-adaptable-evaluation-criteria
A repo that implements Stanford CRFM their HELM Instruct with adaptable evaluation criteria
argilla-io/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
argilla-io/genai-stack
Langchain + Docker + Neo4j + Ollama + Argilla
argilla-io/LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.
argilla-io/orpo
Official repository for ORPO
argilla-io/prompt-collective-dashboard
A Gradio app to monitor a collective effort from the Open Source AI Community to understand and collect good quality and diverse prompts.
argilla-io/trl
Train transformer language models with reinforcement learning.