Pinned Repositories
club
Official repository of the Catalan Language Understanding Benchmark (CLUB) to evaluate NLP models.
datapipe
An audio ETL pipeline for generating datasets from youtube sources
espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
lm-catalan
Official source for Catalan Language Models and resources made within Aina project.
minibot
Minibot is a demonstration of how conversational experiences can be created in Catalan using open-source language technologies.
oTranscribe-plus
A free & open tool for transcribing audio interviews with offline ASR support
oTranscribe-plus-desktop
A free & open desktop tool for transcribing audio interviews with offline ASR support
Plume
Code for the paper "Investigating the translation capabilities of Large Language Models trained on parallel data only"
spacy
Pre-production releases for Spacy in Catalan
tts-api
RESTful API for synthesizing speech in catalan
projecte-aina's Repositories
projecte-aina/oTranscribe-plus
A free & open tool for transcribing audio interviews with offline ASR support
projecte-aina/lm-catalan
Official source for Catalan Language Models and resources made within Aina project.
projecte-aina/spacy
Pre-production releases for Spacy in Catalan
projecte-aina/tts-api
RESTful API for synthesizing speech in catalan
projecte-aina/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
projecte-aina/datapipe
An audio ETL pipeline for generating datasets from youtube sources
projecte-aina/club
Official repository of the Catalan Language Understanding Benchmark (CLUB) to evaluate NLP models.
projecte-aina/oTranscribe-plus-desktop
A free & open desktop tool for transcribing audio interviews with offline ASR support
projecte-aina/Plume
Code for the paper "Investigating the translation capabilities of Large Language Models trained on parallel data only"
projecte-aina/minibot
Minibot is a demonstration of how conversational experiences can be created in Catalan using open-source language technologies.
projecte-aina/demo-mt-aina
projecte-aina/tei2txt
Files to process pdf into txt, using grobid
projecte-aina/catalan-language-understanding-benchmark
CLUB Site
projecte-aina/dynamic-stance-analysis
projecte-aina/festcat-process
scripts to process festcat dataset
projecte-aina/flor_language_adaptation
projecte-aina/pytube
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
projecte-aina/amazon-sagemaker-examples
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
projecte-aina/catalan_common_voice_filter
projecte-aina/common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
projecte-aina/common-voice-monitor
projecte-aina/docg-pipeline
projecte-aina/eadop-rag
projecte-aina/mt-api
API for serving machine translation models.
projecte-aina/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
projecte-aina/rag_notebook
Rag with Flor6.3b
projecte-aina/Recorderjs
A plugin for recording/exporting the output of Web Audio API nodes
projecte-aina/sparknlp_ca
Repositori de recursos sparknlp pel català
projecte-aina/text2lang
Language detection api based on ivanlau/language-detection-fine-tuned-on-xlm-roberta-base