Pinned Repositories
user-controllable-peer-recommendations
User-Controllable Peer Recommendations in Online Health Communities
kaggle
A repository to publish and, more importantly, share solution approaches to Kaggle challenges
language-identification
This repository presents an approach to predict the language in which a document is written. In particular, the proposed approach transforms a text into character n-gram features and uses them to support the predictive power of a machine-learned classifier. Experimental results show that it is capable of identifying 14 languages with high accuracy and that its performance is better than that of some of the most popular language identification libraries in the Python ecosystem.
normalesp
An open-source spell checker for texts written in Spanish, with a focus on tweets.
sentiment-analysis-and-voting-inference-datasets
Datasets for paper: J. A. Cerón-Guzmán and E. León-Guzmán (2016), A Sentiment Analysis System of Spanish Tweets and Its Application in Colombia 2014 Presidential Election. SocialCom 2016. DOI: 10.1109/BDCloud-SocialCom-SustainCom.2016.47
spammer-detection-dataset
Dataset for paper: J. A. Cerón-Guzmán and E. León (2015), Detecting Social Spammers in Colombia 2014 Presidential Election. MICAI 2015. DOI: 10.1007/978-3-319-27101-9_9
TASS-2017
An open-source, top-ranked sentiment analysis system of Spanish tweets.
jacerong's Repositories
jacerong/spammer-detection-dataset
Dataset for paper: J. A. Cerón-Guzmán and E. León (2015), Detecting Social Spammers in Colombia 2014 Presidential Election. MICAI 2015. DOI: 10.1007/978-3-319-27101-9_9
jacerong/TASS-2017
An open-source, top-ranked sentiment analysis system of Spanish tweets.
jacerong/language-identification
This repository presents an approach to predict the language in which a document is written. In particular, the proposed approach transforms a text into character n-gram features and uses them to support the predictive power of a machine-learned classifier. Experimental results show that it is capable of identifying 14 languages with high accuracy and that its performance is better than that of some of the most popular language identification libraries in the Python ecosystem.
jacerong/sentiment-analysis-and-voting-inference-datasets
Datasets for paper: J. A. Cerón-Guzmán and E. León-Guzmán (2016), A Sentiment Analysis System of Spanish Tweets and Its Application in Colombia 2014 Presidential Election. SocialCom 2016. DOI: 10.1109/BDCloud-SocialCom-SustainCom.2016.47
jacerong/normalesp
An open-source spell checker for texts written in Spanish, with a focus on tweets.
jacerong/kaggle
A repository to publish and, more importantly, share solution approaches to Kaggle challenges