nykolai-d
Mathematician with a Master's Degree in Linguistics. I use Python (Pandas, Numpy, TensorFlow, NLTK) and R. Deeply interested in LLM.
Bogota, Colombia
Pinned Repositories
algebraic-topology
We cover some of the the most important concepts of algebraic topology in order to explore how deep neural networks operate in their hidden layers and why they are so amazingly efficient
classification_1
This code trains a neural network model to classify images of clothing using tf.keras
fake_news_detection
This project trains a Long Short Term Memory (LSTM) network to detect and classify fake news.
FFT
The Fast Fourier Transform With One Application In Audio Correction
manual-pruebas-de-hipotesis
Manual de pruebas de hipótesis en R
most_common_words_in_news
We create a function that takes the data from the csv file and prints the five classifications with a list of the x most repeated words for each classification.
random-forests-breast-cancer-prediction
We use the Breast Cancer Wisconsin Diagnostic Data Set to train and test the model that classifies whether a tumor with certain characteristics is a malignant or a benign tumor.
schrodinger
The Schrodinger equation and other cool physics concepts explained.
time-series-decomposition
Time series decomposition using pandas-datareader and dtw
web-scraping-legal-entities-usa
Web scraping of wikipedia page regarding the legal entity types per state, territory or commonwealth in the US. We use Beautiful Soup. Text is then preprocessed using NLTK, and most common words are shown in a word cloud.
nykolai-d's Repositories
nykolai-d/fake_news_detection
This project trains a Long Short Term Memory (LSTM) network to detect and classify fake news.
nykolai-d/algebraic-topology
We cover some of the the most important concepts of algebraic topology in order to explore how deep neural networks operate in their hidden layers and why they are so amazingly efficient
nykolai-d/FFT
The Fast Fourier Transform With One Application In Audio Correction
nykolai-d/manual-pruebas-de-hipotesis
Manual de pruebas de hipótesis en R
nykolai-d/schrodinger
The Schrodinger equation and other cool physics concepts explained.
nykolai-d/classification_1
This code trains a neural network model to classify images of clothing using tf.keras
nykolai-d/most_common_words_in_news
We create a function that takes the data from the csv file and prints the five classifications with a list of the x most repeated words for each classification.
nykolai-d/random-forests-breast-cancer-prediction
We use the Breast Cancer Wisconsin Diagnostic Data Set to train and test the model that classifies whether a tumor with certain characteristics is a malignant or a benign tumor.
nykolai-d/time-series-decomposition
Time series decomposition using pandas-datareader and dtw
nykolai-d/web-scraping-legal-entities-usa
Web scraping of wikipedia page regarding the legal entity types per state, territory or commonwealth in the US. We use Beautiful Soup. Text is then preprocessed using NLTK, and most common words are shown in a word cloud.
nykolai-d/cleaning_data_quiz_1
Quiz 1 - Getting and Cleaning Data | Coursera
nykolai-d/colombianos_exterior
Análisis de base de datos sobre los colombianos registrados en el exterior
nykolai-d/concentrese
El popular juego de cartas "Concéntrese" (Memory en inglés) en Java.
nykolai-d/data-analysis
A non-comprehensive data analysis of a data set as part of a hiring process. Developed entirely in Python, using pandas, seaborn and matplotlib as the main tools.
nykolai-d/eda_with_python
Exploratory data analysis of a Kaggle dataset using Pandas, Numpy and Seaborn
nykolai-d/elteoremaderolle
nykolai-d/k-nearest-neighbor_1
The algorithm k-nearest-neighbor with an anonymous data set, in Python.
nykolai-d/linearregression1
The relationship between the schooling level and life expectancy around the world: an exploratory model using linear regression in Python
nykolai-d/LSTM_Brown_LOB
This project trains a Long Short Term Memory (LSTM) network to detect and classify a text written in English according to a particular variant: whether it is British or American.
nykolai-d/nlp_tag_prediction
Tag prediction on Stack Overflow using TensorFlow Keras and Text Vectorization
nykolai-d/nykolai-d
Config files for my GitHub profile.
nykolai-d/poblacion_2032
Proyección de crecimiento poblacional en Colombia con base en cifras del DANE. Usamos interpolación cúbica y un ajuste de curva polinomial de grado 5.
nykolai-d/processing-csv-file
We create a function that cleans, processes and transforms the data of a csv file into a friendlier dataframe, and writes it into a new csv file.
nykolai-d/ProgrammingAssignment2
Nicolas Diaz's Repository for Programming Assignment 2 for R Programming on Coursera: Lexical Scoping
nykolai-d/pruebas_saber11_2020
Análisis estadístico Pruebas Saber 11 Calendario A (Colombia)
nykolai-d/separation_of_variables
Using the method of separation of variables, we will solve a problem describing the vertical movement of a string.
nykolai-d/SQL_basics
Basic SQL syntax
nykolai-d/teilur_wordcount
This notebook identifies the most common words in five large datasets covering the following themes: data engineering, data analytics, data science, software engineering and business analytics, as well as the most common words for the five joined datasets as a whole.
nykolai-d/transfer-learning-nlp-tf-hub
Using pre-trained models from TensorFlow Hub with tf.keras for text classification
nykolai-d/writing-a-csv-file-into-smaller-csv-files
In this program, we take the file 'sales.csv', which covers data of a two and a half year span (01-2003 to 05-2005), preprocess it and divide it into weeks. We then write one csv file per week having as title the first date of the corresponding week.