marinamashina

Madrid Spain

Pinned Repositories

analysis-of-dblp-authors-data-mongodb-and-neo4j
We analysed 2GB of DBLP repository authors data by using MongoDB and Neo4j. Dataset link: https://dblp.uni-trier.de/xml/
Language:Python1 2 00
feedback-app
React feedback app from React course
Language:JavaScript0 1 00
gender-detection-elastic-search-kibana-genderize
Gender study of DBLP authors by means of elasticsearch and kibana and genderize.io API for python. Dataset link: https://dblp.uni-trier.de/xml/
Language:Python0 2 00
github-finder-app
Find github users and display their info
Language:JavaScript0 1 00
gravitas-agent
First agent to test with simple tasks
Language:Python0 0 00
house-marketplace
House marketplace built with React and FIrebase
Language:JavaScript00
IAB-brain-drain-tableau
Getting Started The dataset is from German IAB (Insitute for Employment and Research). 20 OECD destination countries and 195 countries of origin. The data is desaggregated by gender. The time period covered is 1980 to 2010 in 5-year intervals.
Language:Python2 3 02
madrid-metro-data-retrieval-scrappy
We retrieved data from Madrid Metro website and completed an existing csv data file with the obtained info by means of Scrappy and python.
Language:Python0 2 00
natural-language-processing-nltk
Natural language processing of html news files by means of nltk library for python. The base project for the NLP has been provided by Alberto Fernandez Isabel and consists of news clustering by means of tf and tf-idf algorithms and the use of ARI metric.
Language:Python0 2 00
occupancy-detection-spark-streaming
Data production and calculation by means of Spark Streaming and Kafka. Data is1 from Occupancy detection dataset from UCI repository: https://archive.ics.uci.edu/ml/datasets/Occupancy+Detection+#
Language:Jupyter Notebook0 2 00

marinamashina's Repositories

marinamashina/IAB-brain-drain-tableau
Getting Started The dataset is from German IAB (Insitute for Employment and Research). 20 OECD destination countries and 195 countries of origin. The data is desaggregated by gender. The time period covered is 1980 to 2010 in 5-year intervals.
Language:Python2 3 02
marinamashina/analysis-of-dblp-authors-data-mongodb-and-neo4j
We analysed 2GB of DBLP repository authors data by using MongoDB and Neo4j. Dataset link: https://dblp.uni-trier.de/xml/
Language:Python1 2 00
marinamashina/feedback-app
React feedback app from React course
Language:JavaScript0 1 00
marinamashina/gender-detection-elastic-search-kibana-genderize
Gender study of DBLP authors by means of elasticsearch and kibana and genderize.io API for python. Dataset link: https://dblp.uni-trier.de/xml/
Language:Python0 2 00
marinamashina/github-finder-app
Find github users and display their info
Language:JavaScript0 1 00
marinamashina/gravitas-agent
First agent to test with simple tasks
Language:Python0 0 00
marinamashina/house-marketplace
House marketplace built with React and FIrebase
Language:JavaScript00
marinamashina/madrid-metro-data-retrieval-scrappy
We retrieved data from Madrid Metro website and completed an existing csv data file with the obtained info by means of Scrappy and python.
Language:Python0 2 00
marinamashina/natural-language-processing-nltk
Natural language processing of html news files by means of nltk library for python. The base project for the NLP has been provided by Alberto Fernandez Isabel and consists of news clustering by means of tf and tf-idf algorithms and the use of ARI metric.
Language:Python0 2 00
marinamashina/occupancy-detection-spark-streaming
Data production and calculation by means of Spark Streaming and Kafka. Data is1 from Occupancy detection dataset from UCI repository: https://archive.ics.uci.edu/ml/datasets/Occupancy+Detection+#
Language:Jupyter Notebook0 2 00
marinamashina/pathway-forte
A Python package for benchmarking pathway database with functional enrichment and classification methods
Language:Python0 1 00
marinamashina/reshaping-data-into-long
Takes in an excel file object with multiple tabs in a wide format, and a specified index of the tab to be parsed and reshaped. Returns a data frame of the specified tab reshaped to long format, suitable por data processing, for example visualization with Tableu, etc.
Language:Python1 0
marinamashina/telco-customer-churn
We gave our solution to the customer churn in telcos binary classification problem from Kaggle competition dataset: https://www.kaggle.com/blastchar/telco-customer-churn
Language:HTML2 0
marinamashina/test
Language:HTML2 0
marinamashina/tripadvisor-restaurants-study-spark
Study of data on restaurants from Tripadvisor by means of Spark dataframe operations and SQL queries. Dataset link: https://www.kaggle.com/damienbeneschi/krakow-ta-restaurans-data-raw
Language:Jupyter Notebook2 0

marinamashina

Pinned Repositories

analysis-of-dblp-authors-data-mongodb-and-neo4j

feedback-app

gender-detection-elastic-search-kibana-genderize

github-finder-app

gravitas-agent

house-marketplace

IAB-brain-drain-tableau

madrid-metro-data-retrieval-scrappy

natural-language-processing-nltk

occupancy-detection-spark-streaming

marinamashina's Repositories

marinamashina/IAB-brain-drain-tableau

marinamashina/analysis-of-dblp-authors-data-mongodb-and-neo4j

marinamashina/feedback-app

marinamashina/gender-detection-elastic-search-kibana-genderize

marinamashina/github-finder-app

marinamashina/gravitas-agent

marinamashina/house-marketplace

marinamashina/madrid-metro-data-retrieval-scrappy

marinamashina/natural-language-processing-nltk

marinamashina/occupancy-detection-spark-streaming

marinamashina/pathway-forte

marinamashina/reshaping-data-into-long

marinamashina/telco-customer-churn

marinamashina/test

marinamashina/tripadvisor-restaurants-study-spark