finiteautomata

🤖 Machine Learning Engineer at @Accenture 🎓 Professor at Universidad de San Andrés My interests: NLP, LLMs 🦜 and Hate Speech

Argentina

finiteautomata's Stars

apankrat/nullboard
Nullboard is a minimalist kanban board, focused on compactness and readability.
Language:HTML3.8k243
tobymao/sqlglot
Python SQL Parser and Transpiler
Language:Python6.9k738
opencv/opencv-python
Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
Language:Python4.6k860
RManLuo/Awesome-LLM-KG
Awesome papers about unifying LLMs and KGs
2.1k160
ftvalentini/BiasPMI
On the Interpretability and Significance of Bias Metrics in Texts: a PMI-based Approach (Valentini et al., ACL 2023)
Language:Jupyter Notebook3
qurator-spk/eynollah
Document Layout Analysis
Language:Python35828
microsoft/wslg
Enabling the Windows Subsystem for Linux to include support for Wayland and X server related scenarios
Language:C++10.3k310
natdebandi/hate_speech_ar
Language:Jupyter Notebook1
stanfordnlp/stanza
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
Language:Python7.3k895
intro-stat-learning/ISLP_labs
Up-to-date version of labs for ISLP
Language:Jupyter Notebook797459
freddyaboulton/gradio-pdf
Source code of the gradio_pdf custom component.
Language:JavaScript227
VikParuchuri/marker
Convert PDF to markdown + JSON quickly with high accuracy
Language:Python18.8k1.1k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.4k1.3k
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2.3k189
allenai/gooaq
Question-answers, collected from Google
Language:Python12412
asg017/sqlite-vss
A SQLite extension for efficient vector search, based on Faiss!
Language:C++1.8k65
BlueLiteBlocker/BlueLiteBlocker
A Chrome & Firefox extension for filtering out tweets from Twitter Blue users based on if they follow you and their follower count.
Language:JavaScript14512
state-spaces/mamba
Mamba SSM architecture
Language:Python13.6k1.2k
somosnlp/corpus-es
Lista de corpus de PLN en español ✨ #Somos600M: Ayuda a desarrollar IA inclusiva que entienda las diferentes variedades de nuestras lenguas ✨ English-speaking contributors welcome!
Language:Python194
castorini/pyserini
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
Language:Python1.7k385
luferrer/ConfidenceIntervals
Confidence interval computation for evaluation in machine learning using the bootstrapping approach
Language:Jupyter Notebook679
texttron/tevatron
Tevatron - A flexible toolkit for neural retrieval research and development.
Language:Python546101
dorianbrown/rank_bm25
A Collection of BM25 Algorithms in Python
Language:Python1.1k89
joacosaralegui/deteccion-automatica-de-frases-chequeables
Language:Python1
chequeado/chequeabot
This repository contains all the tools we are working with related to Chequeabot's ecosystem.
Language:Python144
vladkens/twscrape
2024! X / Twitter API scrapper with authorization support. Allows you to scrape search results, User's profiles (followers/following), Tweets (favoriters/retweeters) and more.
Language:Python1.2k139
kts/gzip-knn
Reimplentation of paper using gzip + knn for text classification
Language:Python183
erikernst4/entrainment-metrics
Acoustic-prosodic entrainment measurement in spoken dialogue and approximation of the evolution of a speaker’s a/p features.
Language:Python102
caserec/Datasets-for-Recommender-Systems
This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)
Language:Jupyter Notebook999168
schmich/marinara
Pomodoro® time management assistant for Chrome
Language:JavaScript2.4k311

finiteautomata

finiteautomata's Stars

apankrat/nullboard

tobymao/sqlglot

opencv/opencv-python

RManLuo/Awesome-LLM-KG

ftvalentini/BiasPMI

qurator-spk/eynollah

microsoft/wslg

natdebandi/hate_speech_ar

stanfordnlp/stanza

intro-stat-learning/ISLP_labs

freddyaboulton/gradio-pdf

VikParuchuri/marker

huggingface/trl

eric-mitchell/direct-preference-optimization

allenai/gooaq

asg017/sqlite-vss

BlueLiteBlocker/BlueLiteBlocker

state-spaces/mamba

somosnlp/corpus-es

castorini/pyserini

luferrer/ConfidenceIntervals

texttron/tevatron

dorianbrown/rank_bm25

joacosaralegui/deteccion-automatica-de-frases-chequeables

chequeado/chequeabot

vladkens/twscrape

kts/gzip-knn

erikernst4/entrainment-metrics

caserec/Datasets-for-Recommender-Systems

schmich/marinara