davidsbatista
Machine Learning | NLP Engineer | Software Developer
@veeva-link-data-processing Berlin, Germany
Pinned Repositories
Annotated-Semantic-Relationships-Datasets
A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)
Aspect-Based-Sentiment-Analysis
Aspect-Based Sentiment Analysis Experiments
BREDS
"Bootstrapping Relationship Extractors with Distributional Semantics" (Batista et al., 2015) in EMNLP'15 - Python implementation
ConvNets-for-Sentence-Classification
"Convolutional Neural Networks for Sentence Classification" (Kim 2014) - https://www.aclweb.org/anthology/D14-1181
machine-learning-notebooks
Assorted exercises and proof-of-concepts to understand and study machine learning and statistical learning theory
NER-datasets
Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
NER-Evaluation
An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the tokens that are part of the named-entity
SLANG-Sequence-LAbeliNG
Sequence LAbeliNG with Neural Networks: "Neural Architectures for Named Entity Recognition" (Lample et al., 2016) and "End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF" (Ma, 2016)
Snowball
Implementation with some extensions of the paper "Snowball: Extracting Relations from Large Plain-Text Collections" (Agichtein and Gravano, 2000)
text-classification
An example on how to train supervised classifiers for multi-label text classification using sklearn pipelines
davidsbatista's Repositories
davidsbatista/Annotated-Semantic-Relationships-Datasets
A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)
davidsbatista/NER-datasets
Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
davidsbatista/NER-Evaluation
An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the tokens that are part of the named-entity
davidsbatista/Snowball
Implementation with some extensions of the paper "Snowball: Extracting Relations from Large Plain-Text Collections" (Agichtein and Gravano, 2000)
davidsbatista/BREDS
"Bootstrapping Relationship Extractors with Distributional Semantics" (Batista et al., 2015) in EMNLP'15 - Python implementation
davidsbatista/Aspect-Based-Sentiment-Analysis
Aspect-Based Sentiment Analysis Experiments
davidsbatista/text-classification
An example on how to train supervised classifiers for multi-label text classification using sklearn pipelines
davidsbatista/ConvNets-for-Sentence-Classification
"Convolutional Neural Networks for Sentence Classification" (Kim 2014) - https://www.aclweb.org/anthology/D14-1181
davidsbatista/machine-learning-notebooks
Assorted exercises and proof-of-concepts to understand and study machine learning and statistical learning theory
davidsbatista/REACTION-resources
Resources developed by and for the project REACTION (Retrieval, Extraction and Aggregation Computing Technology for Integrating and Organizing News) an initiative for developing a computational journalism platform (mostly) for Portuguese.
davidsbatista/StanfordNER-experiments
davidsbatista/SLANG-Sequence-LAbeliNG
Sequence LAbeliNG with Neural Networks: "Neural Architectures for Named Entity Recognition" (Lample et al., 2016) and "End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF" (Ma, 2016)
davidsbatista/Toponym-Disambiguation-Using-Ontology-Based-Semantic-Similarity
Toponym Disambiguation using Ontology-based Semantic Similarity.
davidsbatista/Temporal-Information-Datasets
davidsbatista/GermEval-2019-Task_1
GermEval 2019 Task 1 - Shared Task on Hierarchical Classification of Blurbs
davidsbatista/sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
davidsbatista/davidsbatista.net
my personal homepage and blog
davidsbatista/GermEval-2017-Aspect-Based-Sentiment-Analysis
davidsbatista/nostalgia
Old projects of mine, done during high-school or university and found in old hard-drives
davidsbatista/politiquices
Explore relações de apoio e oposição, entre personalidades políticas, expressas em títulos de notícias preservadas no arquivo.pt
davidsbatista/setfit
Efficient few-shot learning with Sentence Transformers
davidsbatista/Awesome-CV
:page_facing_up: Awesome CV is LaTeX template for your outstanding job application
davidsbatista/chilosopher.com
webpage for my musical experiments
davidsbatista/davidsbatista.github.io
davidsbatista/dotfiles
My dot files for several things
davidsbatista/jena-docker
Docker image for Apache Jena riot
davidsbatista/Multimodal-Toolkit
Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
davidsbatista/newspaper
News, full-text, and article metadata extraction in Python 3. Advanced docs:
davidsbatista/pt-elections-2024
davidsbatista/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python