chayansraj
D: Discover A: Analyze T: Test A: Act Passionate about Data Engineering. I learn everyday!
EricssonSweden
Pinned Repositories
Advanced-Regression-Techniques-for-Ames-housing-data-prediction
Prediction of Ames house prices using advanced regression techniques and ML algorithms.
Data-Pipeline-with-dbt-using-Airflow-on-GCP
This project demonstrates how to build and automate an ETL pipeline using DAGs in Airflow and load the transformed data to Bigquery. There are different tools that have been used in this project such as Astro, DBT, GCP, Airflow, Metabase.
End-to-end-machine-learning
The idea of this project is to apply statistical methods learned in university lectures to find patterns in the data and use machine learning to solve a supervised classification problem
GPT-and-LangChain-for-Data-analysis
LangChain is a framework for developing applications powered by language models. It connects a language model to sources of context (prompt instructions, content to ground its response in, etc.) and rely on a language model to reason (about how to answer based on provided context, what actions to take, etc.)
LINHAC-2022-Data-Science-Student-Competition
Linköping Hockey Analytics Conference - LINHAC 2022 | Given the event data, generate findings/patterns related to sequences of events leading up to a particular outcome.
Microsoft-Azure-Medallion-Data-pipeline
In this project we are going to create an end-to-end data platform right from Data Ingestion, Data Transformation, Data Loading and Reporting.
Neural-Networks-and-Learning-Systems
Solving problems using different machine learning algorithms. Machine learning, classification, pattern recognition and high-dimensional data analysis.
Python-ETL-pipeline-using-Airflow-on-AWS
This project demonstrates how to build and automate an ETL pipeline written in Python and schedule it using open source Apache Airflow orchestration tool on AWS EC2 instance.
Snowflake-data-ingestion-hands-on-tutorial
This repo covers the two most widely used and recommended file based data ingestion approaches: COPY INTO and Snowpipe.
Youtube-video-data-analytics-using-AWS
This project aims to leverage Amazon Web Services to create trending Youtube videos analytics service. Project contains different data engineering, data analysis and data science parts.
chayansraj's Repositories
chayansraj/Data-Pipeline-with-dbt-using-Airflow-on-GCP
This project demonstrates how to build and automate an ETL pipeline using DAGs in Airflow and load the transformed data to Bigquery. There are different tools that have been used in this project such as Astro, DBT, GCP, Airflow, Metabase.
chayansraj/Python-ETL-pipeline-using-Airflow-on-AWS
This project demonstrates how to build and automate an ETL pipeline written in Python and schedule it using open source Apache Airflow orchestration tool on AWS EC2 instance.
chayansraj/Youtube-video-data-analytics-using-AWS
This project aims to leverage Amazon Web Services to create trending Youtube videos analytics service. Project contains different data engineering, data analysis and data science parts.
chayansraj/LINHAC-2022-Data-Science-Student-Competition
Linköping Hockey Analytics Conference - LINHAC 2022 | Given the event data, generate findings/patterns related to sequences of events leading up to a particular outcome.
chayansraj/Microsoft-Azure-Medallion-Data-pipeline
In this project we are going to create an end-to-end data platform right from Data Ingestion, Data Transformation, Data Loading and Reporting.
chayansraj/End-to-end-machine-learning
The idea of this project is to apply statistical methods learned in university lectures to find patterns in the data and use machine learning to solve a supervised classification problem
chayansraj/Advanced-Regression-Techniques-for-Ames-housing-data-prediction
Prediction of Ames house prices using advanced regression techniques and ML algorithms.
chayansraj/GPT-and-LangChain-for-Data-analysis
LangChain is a framework for developing applications powered by language models. It connects a language model to sources of context (prompt instructions, content to ground its response in, etc.) and rely on a language model to reason (about how to answer based on provided context, what actions to take, etc.)
chayansraj/Neural-Networks-and-Learning-Systems
Solving problems using different machine learning algorithms. Machine learning, classification, pattern recognition and high-dimensional data analysis.
chayansraj/Snowflake-data-ingestion-hands-on-tutorial
This repo covers the two most widely used and recommended file based data ingestion approaches: COPY INTO and Snowpipe.
chayansraj/Vision-Transformer-Research
The purpose of this research project is to compare traditional CNNs to vision transformers, can transformers give a higher AUC when classifying Atypical Femoral Fracture / Normal Femoral Fracture?
chayansraj/CIFAR10-competition_model
Keras implementation of a model for the in-class competition in Neural Networks and Learning Systems course at Linköping Univerity, spring 2022.
chayansraj/dbt_learn_fundamentals
dbt is a SQL-first transformation workflow that lets teams quickly and collaboratively deploy analytics code following software engineering best practices like modularity, portability, CI/CD, and documentation.
chayansraj/Prediction-of-used-car-prices-using-various-regression-techniques
It is a work on a regression problem in which our objective is to predict the prices of used cars given a number of features/predictors about them
chayansraj/h3-py-notebooks
Jupyter notebooks for h3-py, a hierarchical hexagonal geospatial indexing system
chayansraj/text-mining
Notebooks for the course 732A92/TDDE16 Text Mining
chayansraj/Text-Mining-Research
Text Mining develops methods for accessing information in and extracting knowledge from large volumes of text. The purpose of this repository is to research on classification of fake/real news on a dataset obtained from kaggle
chayansraj/chayansraj
May the code be with you!
chayansraj/ML-Clustering-soft-drink-preferences
The aim is to perform cluster analysis to identify grouping by cases of data based on the similarity of shares.
chayansraj/SQL-Data-Analysis-Projects
The goal is to learn and perform hands-on SQL to analyze different types of datasets. All the SQL queries are written in Google BigQuery which is a serverless, distributed SQL Query engine. The projects go from Basic level to Advanced level.