Pinned Repositories
Ableist-Detector
Big-data-project-student-dropout-rates
Predictive Models and Analysis of Student Enrollment vs. Dropout and the Ethics Entailed
custom-pipeline
Building custom data pipeline from user input
dbt-ga4
dbt Package for modeling raw data exported by Google Analytics 4. BigQuery support, only.
flight-delay-prediction
Predicting flight delays using machine learning models
Hate-Speech-Detection
Dynamics of hate speech analysis in tweets using text mining techniques
ML-for-drug-discovery-dopamine-receptors
Machine learning for drug discover
Pyspark-Big-Data-Analytics
IST 718 Big Data Analytics
sql-murder-mystery
SQL murder mystery - for practise
twitter-sentiment-bilstm
Sentiment analysis on twitter using BiLSTM
gughapriyaaelango's Repositories
gughapriyaaelango/Big-data-project-student-dropout-rates
Predictive Models and Analysis of Student Enrollment vs. Dropout and the Ethics Entailed
gughapriyaaelango/custom-pipeline
Building custom data pipeline from user input
gughapriyaaelango/dbt-ga4
dbt Package for modeling raw data exported by Google Analytics 4. BigQuery support, only.
gughapriyaaelango/flight-delay-prediction
Predicting flight delays using machine learning models
gughapriyaaelango/Hate-Speech-Detection
Dynamics of hate speech analysis in tweets using text mining techniques
gughapriyaaelango/ML-for-drug-discovery-dopamine-receptors
Machine learning for drug discover
gughapriyaaelango/Pyspark-Big-Data-Analytics
IST 718 Big Data Analytics
gughapriyaaelango/sql-murder-mystery
SQL murder mystery - for practise
gughapriyaaelango/twitter-sentiment-bilstm
Sentiment analysis on twitter using BiLSTM
gughapriyaaelango/Ableist-Detector
gughapriyaaelango/Applied-Machine-Learning
gughapriyaaelango/BERT-ChatBot-v1
Chatbot made using distilBERT-base-uncased model - Review of code for IST 664 NLP
gughapriyaaelango/C4Lab
gughapriyaaelango/classroom-engagement-gauging-with-vision-transformers
Measuring engagement of students in classroom using Vision Transformer models
gughapriyaaelango/dbt-bigquery-tcp-project
TCP project - using dbt and bigquery
gughapriyaaelango/Decision-Tree-in-Python-for-Continuous-Attributes
This code constructs a Decision Tree for a dataset with continuous Attributes. Each training instance has 16 numeric attributes (features) and a classification label, all separated by commas. In deciding which attribute to test at any point, the information gain metric is used. The node test threshold for each potential attribute is set using this same metric i.e. at each point, all the values that exist for a particular attribute in the remaining instances are ordered, and threshold values that are (half way) between successive attribute values are used to find the Information Gain. The threshold value that gives the highest information gain is used. The same attribute can be tested again later in the tree (with a different threshold).
gughapriyaaelango/DeepLearning
gughapriyaaelango/etl_goodreads
ETL using Goodreads API and scheduling using Airflow
gughapriyaaelango/git_practice
A practice repo for forking and submitting pull requests
gughapriyaaelango/Health-Insurance-Prediction
Intro to Data Science course - group project for health insurance preediction
gughapriyaaelango/Introduction-to-Data-Science-in-R
IST 687 Introduction to Data Science in R
gughapriyaaelango/Layout-practise-1
Layout practise 1
gughapriyaaelango/linear-regression-gradient-descent
Linear regression using Stochastic Gradient Descent
gughapriyaaelango/mice-protein-expression
Mice protein expression
gughapriyaaelango/Natural-Language-Processing
IST 664 Natural Language Processing - Masters in Applied Data Science
gughapriyaaelango/novartis-datathon
Drug lifecycle prediction for Novartis cluster_ids
gughapriyaaelango/Real-valued-specificity-prediction
gughapriyaaelango/SQL-practise
SQL practise from many websites
gughapriyaaelango/stackexchange-rest-api
Using stackexchange REST API
gughapriyaaelango/Text-Mining-Labs
IST 736 - Text Mining