Pinned Repositories
article-extraction-dataset
Article title, authors, date and body extraction dataset.
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
FakeNewsDataset
a consolidated and cleaned up fake news dataset classified in the following categories: reliable, unreliable, political, bias, fake, conspiracy, rumor clickbait, junk science, satire, hate
newspaper4k
📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
ninox-api
An API wrapper for the ninox db api (https://docs.ninox.com/en/api/public-cloud-apis)
RO-Diacritics
Python package for Romanian diacritics restoration
romanian-nlp-datasets
A list of Romanian NLP Datasets
news-ro-offense
a novel Romanian language dataset for offensive message detection with manually annotated comment from a local Romanian news website (stiri de cluj) into five classes
ro-offense
RO-Offense: A Novel Romanian Dataset for Offensive Language in Online Comments
ro-offense-sequences
AndyTheFactory's Repositories
AndyTheFactory/newspaper4k
📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
AndyTheFactory/romanian-nlp-datasets
A list of Romanian NLP Datasets
AndyTheFactory/RO-Diacritics
Python package for Romanian diacritics restoration
AndyTheFactory/article-extraction-dataset
Article title, authors, date and body extraction dataset.
AndyTheFactory/FakeNewsDataset
a consolidated and cleaned up fake news dataset classified in the following categories: reliable, unreliable, political, bias, fake, conspiracy, rumor clickbait, junk science, satire, hate
AndyTheFactory/ninox-api
An API wrapper for the ninox db api (https://docs.ninox.com/en/api/public-cloud-apis)
AndyTheFactory/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
AndyTheFactory/AAIT
AndyTheFactory/andythefactory.github.io
AndyTheFactory/anserini
A Lucene toolkit for replicable information retrieval research
AndyTheFactory/bert
TensorFlow code and pre-trained models for BERT
AndyTheFactory/Colab_Auto_Reconnect
A Chrome extension and Firefox AddOn that automatically reconnects your Colab's ongoing session without a manual click. It also provides a timer for getting notified after finishing a task.
AndyTheFactory/DAI
AndyTheFactory/evalita2023
AndyTheFactory/gzip_ranged_simple_httpserver
SimpleHTTPServer with support for Range requests and GZip Compressing
AndyTheFactory/hate-speech-ro
Dataset creation for hate speech detection in Romanian
AndyTheFactory/keras-xlnet
Implementation of XLNet that can load pretrained checkpoints
AndyTheFactory/MAS
AndyTheFactory/newspaper
News, full-text, and article metadata extraction in Python 3. Advanced docs:
AndyTheFactory/NN
AndyTheFactory/py-homebox
Python Wrapper for Homebox API
AndyTheFactory/requests
A simple, yet elegant, HTTP library.
AndyTheFactory/ro-paraphrase-bible
Romanian paraphrase corpus based on different translations/versions of the bible
AndyTheFactory/Romanian-Transformers
This repo is the home of Romanian Transformers.
AndyTheFactory/semeval20_task11
Repository for SemEval 2020 Task 11 related topics
AndyTheFactory/SOA
AndyTheFactory/SSL
AndyTheFactory/thephpfactory.com
Repository for thephpfactory.com website
AndyTheFactory/wikiextractor
A tool for extracting plain text from Wikipedia dumps
AndyTheFactory/YT-Blocklist
Blocks Youtube Domains via Pi-Hole