Pinned Repositories
Advanced-Data-Wrangling
This assignment involves the preprocessing of two main datasets prior to being merged. The first data set is imported. It has an unused variable removed and another variable renamed. The data set is then parsed for missing values. The identified missing values are replaced or removed using a variety of techniques including mean imputation, ratio replacement, removal, logical assumption replacement and constant value substitution. The second main data set is a binding of two smaller data sets. Both smaller data sets are imported from a large excel document, using specialised import specifications. The data sets are then subsetted to produce the respective desired tables. The subsetted data sets are then cleaned by the removal of blank columns. Once clean the data sets are bound by row. This main dataset then has a variable name changed. Both main data sets have their variable data types scanned and corrected. The two main data sets are then merged to form a grand final data set. The final data set has it's data types double-checked, leading to the factorising and labelling of a variable.
ALTA_Dummy
ANZ-Virtual-Internship
ANZ Data Science Program
Aus_Suicide_Dashboard
Shiny Dashboard produced in R relating to Australian Suicide Statistics.
Data-Wrangling-Project
A project showcasing data wrangling skills
Deconstruct-Reconstruct-Web-Report
https://rpubs.com/Od-Lanir/MATH2270
Detecting-Fake-News-with-Python
This advanced python project of detecting fake news deals with fake and real news. Using sklearn, we build a TfidfVectorizer on our dataset. Then, we initialize a PassiveAggressive Classifier and fit the model. In the end, the accuracy score and the confusion matrix tell us how well our model fares.
ML-Model-Predicting-Ship-Crew-Size
Machine Learning Model for Predicting a Ship's Crew Size using multiple predictors
Statistical-Analysis-of-Crime-Data
Chi-square Goodness of Fit Test performed on the proposed hypothesis.
Twitter-Mining-and-Analysis
Gathering of Twitter API data on various sources, clean and mine the data through the programming tool R, and create visualisations dictating the main aspects that Twitter accounts in order to enhance and improve McPherson College's Twitter Account/Presence
RinaldoG's Repositories
RinaldoG/Aus_Suicide_Dashboard
Shiny Dashboard produced in R relating to Australian Suicide Statistics.
RinaldoG/ML-Model-Predicting-Ship-Crew-Size
Machine Learning Model for Predicting a Ship's Crew Size using multiple predictors
RinaldoG/Advanced-Data-Wrangling
This assignment involves the preprocessing of two main datasets prior to being merged. The first data set is imported. It has an unused variable removed and another variable renamed. The data set is then parsed for missing values. The identified missing values are replaced or removed using a variety of techniques including mean imputation, ratio replacement, removal, logical assumption replacement and constant value substitution. The second main data set is a binding of two smaller data sets. Both smaller data sets are imported from a large excel document, using specialised import specifications. The data sets are then subsetted to produce the respective desired tables. The subsetted data sets are then cleaned by the removal of blank columns. Once clean the data sets are bound by row. This main dataset then has a variable name changed. Both main data sets have their variable data types scanned and corrected. The two main data sets are then merged to form a grand final data set. The final data set has it's data types double-checked, leading to the factorising and labelling of a variable.
RinaldoG/Detecting-Fake-News-with-Python
This advanced python project of detecting fake news deals with fake and real news. Using sklearn, we build a TfidfVectorizer on our dataset. Then, we initialize a PassiveAggressive Classifier and fit the model. In the end, the accuracy score and the confusion matrix tell us how well our model fares.
RinaldoG/Statistical-Analysis-of-Crime-Data
Chi-square Goodness of Fit Test performed on the proposed hypothesis.
RinaldoG/Twitter-Mining-and-Analysis
Gathering of Twitter API data on various sources, clean and mine the data through the programming tool R, and create visualisations dictating the main aspects that Twitter accounts in order to enhance and improve McPherson College's Twitter Account/Presence
RinaldoG/ALTA_Dummy
RinaldoG/ANZ-Virtual-Internship
ANZ Data Science Program
RinaldoG/Data-Wrangling-Project
A project showcasing data wrangling skills
RinaldoG/Deconstruct-Reconstruct-Web-Report
https://rpubs.com/Od-Lanir/MATH2270
RinaldoG/Generate_MCQ_BERT_Wordnet_Conceptnet
Generate Multiple choice Questions from any content or news article using BERT Extractive Summarization, Wordnet and Conceptnet
RinaldoG/logit_predict
Authorship Attribution based on logit scores
RinaldoG/Modeling-Body-Measurements
Determine if a dataset of body measurements fits a normal distribution.
RinaldoG/NSW-Gov-Virtual-Intership
Data Analysis Case Study
RinaldoG/RinaldoG
Config files for my GitHub profile.
RinaldoG/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
RinaldoG/Unrelated_Partitions