Pinned Repositories
define-semantic-annotation
Define is a semantic annotation software aimed at enhancing and constraining hand similarity annotation tasks.
describe_corpus
This is a dataset where each file is associated to a term. Each file in turn contains definitions for the associated term. All text snippets are embedded into doc2vec vector representations.
dummy_fraud_detection
Fraud detection in credit card payments and auto insurance claims using PySpark
expconditions
Learning Machine trained for extraction of experimental conditions from scientific literature in the biomedical area
lamda-signal-clustering
Software system implementation for signal acquisition and pattern clustering via LAMDA method (Learning Algorithm for Multivariate Data Analysis, byJoseph Aguilar-Martín, CNRS). It is written in C/C++ and it requires licenced National Instruments software called LabWindows/CVI and a USB Digital I/O Device. This software was tested for the last time over windows 7 Professional OS. See associated MSc thesis for details.
lxmls-toolkit
Machine Learning applied to Natural Language Processing Toolkit used in the Lisbon Machine Learning Summer School
multinomial-bayes-document-classifier
This is a matlab library which is implemented a multinomial Bayes classifier for text document classification. Send me a mail for using doubts. Any way, each function gives you a little help.
nlp-pipeline
Script series for NLP: PMI, TF-IDF and Neural cooccurrence vectorization, vector (TF/IDF & PMI) data base distributed querying and population with Hadoop. Deep learning and kernel learning in sklearn.
open-ncd-kbc
This repo contains software and results derived from the PRODEP project entitled "Reinforcement learning in the automatic acquisition of knowledge in noncommunicable diseases"
sentence_embedding
A sentence embedding method based on weighted series
iarroyof's Repositories
iarroyof/dummy_fraud_detection
Fraud detection in credit card payments and auto insurance claims using PySpark
iarroyof/sentence_embedding
A sentence embedding method based on weighted series
iarroyof/nlp-pipeline
Script series for NLP: PMI, TF-IDF and Neural cooccurrence vectorization, vector (TF/IDF & PMI) data base distributed querying and population with Hadoop. Deep learning and kernel learning in sklearn.
iarroyof/describe_corpus
This is a dataset where each file is associated to a term. Each file in turn contains definitions for the associated term. All text snippets are embedded into doc2vec vector representations.
iarroyof/expconditions
Learning Machine trained for extraction of experimental conditions from scientific literature in the biomedical area
iarroyof/open-ncd-kbc
This repo contains software and results derived from the PRODEP project entitled "Reinforcement learning in the automatic acquisition of knowledge in noncommunicable diseases"
iarroyof/seismic_embeddings
This project aims to represent seismic data samples in an embedding space to observe similarities among embeddings. Data samples were provided by the Mexican National Seismic service (Servicio Sismológico Nacional) including intensity measurements from 1900 to 2018.
iarroyof/summ_features
iarroyof/2nd_half_wiki_generator
This is a document to document prediction model. Given the fisrt half of a Wikipedia article, predict first the probable topics of his 2nd half and then, try to generate such 2nd half article.
iarroyof/address_duplix
Address Duplication problem with supervised learning
iarroyof/aggression_identification
iarroyof/contexto_nlp
Automated Q&A for assessing lexicon acquisition
iarroyof/csk4open-agro-reasoning
Common Sense Knowledge for Open Vocabulary Reasoning in Agroecology
iarroyof/csr2open-kbc
iarroyof/cultural_nlp
Natual Language Applications to Cultural Heritage
iarroyof/discrimative_attributes
Implementation of the unsupervised model for semantic discriminative attributes using neural word embeddings. Participant system SemEval 2018 -- Task 10: Capturing Discriminative Attributes
iarroyof/elastic_pytorch_loader
Python class to load a page of es_page_size from ElasticSearch. This page is consumed in batches of batch_size documents by a pytorch data loader. A new page is loaded before the last batch is consumed by the torch model in training time.
iarroyof/herrl
Hierarchical Entropy Relational Reinforcement Learning
iarroyof/historical_sources
iarroyof/iarroyof.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
iarroyof/iword_embeddings
iarroyof/KonwChainApi
Una API para extracción automática de conocimiento biomédico basada en Inteligencia Artificial
iarroyof/lagartija
A number of tests applying NLP and Machine Learning techniques to chromosome characterization are performed in order to observe genetic factors determining sex
iarroyof/mlprl_orderbook
Baseline estimator for profit maximization in orderbook RL environments
iarroyof/ov-llm-reasoning
Open Vocabulary LLM Reasoning
iarroyof/pytorch
Some exersices in Pytorch
iarroyof/rl4kbc-csr
RL4KBC&CSR is a self-attention based Neural Language Model trained with different Knowledge Bases. The main application of RL4KBC&CSR is focused on supporting biomedical research related to the study of NonCommunicable Diseases. The goal of trained NLM is reconstruct/generate missing parts of semantic structures.
iarroyof/semanticrl
Semantic Reinforcement Learning. This preprint provides first insights: Arroyo-Fernández, I., Carrasco-Ruíz, M., & Arias-Aguilar, J. A. (2019). On the Possibility of Rewarding Structure Learning Agents: Mutual Information on Linguistic Random Sets. arXiv preprint arXiv:1910.04023.
iarroyof/topic_prediction
Latest version of topic predictor using multiple SVMs as a generative model
iarroyof/ukp_app
Ubiquitous Knowledge Processing application materials