Pinned Repositories
codiesp-evaluation-script
Evaluation library for CodiEsp Task
corpus-cleaner-acl
distemist_evaluation_library
embeddings_v1.0
iaa-computation
Compute Inter Annotator Agreement from Brat files
PharmaCoNER-Tagger
PharmaCoNER Tagger is a Neural Named Entity Recognition program targeting domain adaptation, particularly in the case of Spanish medical texts. It is based on NeuroNER.
spanish-person-names-generator
Generator of Spanish names based on the lists of INE
TEMUNormalizer
Baseline term normalizer to find Snomed and CIE-10 codes in a list of terms
TemuSTS
Programa de análisis de frases similares en dos corpus.
web-scrapping
Repository that contains all web scrapping scripts written by text mining group
Text Mining Unit at BSC's Repositories
TeMU-BSC/codiesp-evaluation-script
Evaluation library for CodiEsp Task
TeMU-BSC/PharmaCoNER-Tagger
PharmaCoNER Tagger is a Neural Named Entity Recognition program targeting domain adaptation, particularly in the case of Spanish medical texts. It is based on NeuroNER.
TeMU-BSC/TEMUNormalizer
Baseline term normalizer to find Snomed and CIE-10 codes in a list of terms
TeMU-BSC/corpus-cleaner-acl
TeMU-BSC/catalan_CC0_sentences
collected CC0 sentences written in Catalan
TeMU-BSC/demos
Web demos for some text mining projects
TeMU-BSC/distemist_evaluation_library
TeMU-BSC/embeddings_v1.0
TeMU-BSC/iberifier
TeMU-BSC/temu-webpage
Landing page of the Text Mining Unit at Barcelona Supercomputing Center.
TeMU-BSC/Biomedical_NER_models
TeMU-BSC/BioTextMiner
BioTextMiner is a web application developed by the NLP4BIA that provides a user-friendly interface for corpus control of biomedical corpora. With BioTextMiner, researchers can easily manage and manipulate large-scale biomedical text data by organizing and curating it in a centralized database
TeMU-BSC/cantemist-evaluation-library
Compute evaluation metrics for Cantemist submissions
TeMU-BSC/clinical-nested-ner
TeMU-BSC/detect-annotations
Detect missed annotations in BRAT files based on previous annotations (from other files).
TeMU-BSC/indexer
DeCS Indexer frontend and backend for MESINESP task.
TeMU-BSC/language-model-prepro
Preprocessing scripts for language models
TeMU-BSC/medprocner_evaluation_library
Evaluation library for the MedProcNER/ProcTEMIST shared task (https://temu.bsc.es/medprocner/)
TeMU-BSC/seq-to-seq-catalan
Sequence to sequence language resources for Catalan and for two tasks, namely: Summarization and Machine Translation.
TeMU-BSC/socialdisner_evaluation_script
TeMU-BSC/ictusnet-webapp
Web tool for the ICTUSnet task.
TeMU-BSC/meddoplace_scoring_script
Scoring script of the MEDDOPLACE Shared Task
TeMU-BSC/biowikipedia-dbpedia
TeMU-BSC/decs-abstracts
Web visualizer to select best companies to annotate DeCS codes for MESINESP project.
TeMU-BSC/Embeddings
[PlanTL/medicine/word embeddings] Word embeddings generated from Spanish corpora.
TeMU-BSC/gpt3-queries
GPT-3 multilingual evaluation
TeMU-BSC/presto-nlp
Presto NLP project
TeMU-BSC/prodigy-documentation
TeMU-BSC/streamlit
Streamlit — The fastest way to build data apps in Python
TeMU-BSC/Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.