Pinned Repositories
redpajama-v2-filter-2023
Backups of RedPajama v2 filter scripts
multilingual-PII-tool
Multilingual extension of BigScience data tool "PII-manager" (https://github.com/bigscience-workshop/data_tooling/tree/master/pii-manager/src/pii_manager)
SACX-backup
Backup for SACX keyword extraction pipeline
umap-embeddings
Dump for umap visualisations of documents embeddings
class-explainer
register-qa
Analysis-of-biosignals
Course work for Analysis and Acquisition of Biosignals, UTU fall 2021.
Applications-of-Data-Analysis-2021
Repository for UTU course Applications of Data Analysis 2021. Consist of Jupyter notebooks.
Biosignal-Analysis-Project
UTU Biosignal Analysis spring 2022, EEG multiclass classification
europa-PII-scripts
Scripts used to redact PII on LUMI
mmanteli's Repositories
mmanteli/register-and-genre
Studying the overlap between register and genre
mmanteli/umap-embeddings
Dump for umap visualisations of documents embeddings
mmanteli/mahti-tokenisation
Dump for tokenisation scripts on Mahti
mmanteli/redpajama-v2-filter-2023
Backups of RedPajama v2 filter scripts
mmanteli/PII-initial-tests
Initial testing of different PII methods before deciding on BigScience tool
mmanteli/SACX-backup
Backup for SACX keyword extraction pipeline
mmanteli/europa-PII-scripts
Scripts used to redact PII on LUMI
mmanteli/multilingual-PII-tool
Multilingual extension of BigScience data tool "PII-manager" (https://github.com/bigscience-workshop/data_tooling/tree/master/pii-manager/src/pii_manager)
mmanteli/gradu
Gradun kuvat helposti muokattavina
mmanteli/Biosignal-Analysis-Project
UTU Biosignal Analysis spring 2022, EEG multiclass classification
mmanteli/Analysis-of-biosignals
Course work for Analysis and Acquisition of Biosignals, UTU fall 2021.
mmanteli/multilabel_explainability
Code for training a multilabel register model and explaining its predictions
mmanteli/Applications-of-Data-Analysis-2021
Repository for UTU course Applications of Data Analysis 2021. Consist of Jupyter notebooks.