kpradyumna095
Data Scientist,Machine Learning Engineer 👋 Hi, I’m Pradyumna! 📊 I am currently an Analytics and Data Strategy Senior Associate.
Pinned Repositories
Big-Data-Training-Tutorial
Learn from Tutorials
Chatbot
This is a machine learning project where a user can interact with a chatbot
Clustering---Airline-Dataset-Using-Python
Perform clustering (Both hierarchical and K means clustering) for the airlines data to obtain optimum number of clusters. Draw the inferences from the clusters obtained.
Clustering-Crime-Dataset-Using-R
Perform Clustering for the crime data and identify the number of clusters formed and draw inferences.
Decision-Tree---Company-Dataset-Using-Python
Build Decision Tree with Sales as target variable
hands-on-nltk-tutorial
The hands-on NLTK tutorial for NLP in Python
Logistics-Regression-Extra-Marital-Affairs-Dataset-Using-R
I have a dataset containing family information of married couples, which have around 10 variables & 600+ observations. Independent variables are ~ gender, age, years married, children, religion etc. I have one response variable which is number of extra marital affairs. Now, I want to know what all factor influence the chances of extra marital affair. Since extra marital affair is a binary variable (either a person will have or not), so we can fit logistic regression model here to predict the probability of extra marital affair.
Multilinear-Regression-Startup-Dataset-Using-Python
Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model.
Multilinear-Regression-Toyota-Corolla-Dataset-Using-Python
Consider only the below columns and prepare a prediction model for predicting Price. Corolla<-Corolla[c("Price","Age_08_04","KM","HP","cc","Doors","Gears","Quarterly_Tax","Weight")]
R-Code-for-visualisation-Plots
kpradyumna095's Repositories
kpradyumna095/360_assgn
kpradyumna095/AAAMLP-CN
Approaching (Almost) Any Machine Learning Problem中译版,在线文档地址:https://ytzfhqs.github.io/AAAMLP-CN/
kpradyumna095/ai-notes
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.
kpradyumna095/airbyte
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
kpradyumna095/awesome-gemini
A collection of awesome things regarding the gemini protocol ecosystem.
kpradyumna095/awesome-mlops
A curated list of references for MLOps
kpradyumna095/awesome-transformer-nlp
A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.
kpradyumna095/aws-insurancelake-etl
This solution helps you deploy ETL processes and data storage resources to create an Insurance Lake using Amazon S3 buckets for storage, AWS Glue for data transformation, and AWS CDK Pipelines. It is originally based on the AWS blog Deploy data lake ETL jobs using CDK Pipelines, and complements the InsuranceLake Infrastructure project
kpradyumna095/code-pilot
A Github bot that automatically responds to issues.
kpradyumna095/ColossalAI
Making large AI models cheaper, faster and more accessible
kpradyumna095/Complete-Life-Cycle-of-a-Data-Science-Project
Complete-Life-Cycle-of-a-Data-Science-Project
kpradyumna095/consulting-handbook
A guide for technical professionals looking to start consulting
kpradyumna095/copilot-gpt4-service
Convert Github Copilot to ChatGPT, free to use the GPT-4 model
kpradyumna095/generative-models
Generative Models by Stability AI
kpradyumna095/gobblin
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
kpradyumna095/gold-miner
🥇掘金翻译计划,可能是世界最大最好的英译中技术社区,最懂读者和译者的翻译平台:
kpradyumna095/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
kpradyumna095/Linkedin_post
kpradyumna095/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
kpradyumna095/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
kpradyumna095/Medium_My_Blog
Links of blogs
kpradyumna095/outlines
Guided Text Generation
kpradyumna095/pandas-ai
PandasAI is the Python library that integrates Gen AI into pandas, making data analysis conversational
kpradyumna095/polyaxon
MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
kpradyumna095/project-based-learning
Curated list of project-based tutorials
kpradyumna095/pydantic
Data validation using Python type hints
kpradyumna095/pytorch-deep-learning
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
kpradyumna095/Snowflake_ML_Intro
Introduction to performing Machine Learning on Snowflake
kpradyumna095/sqlchat
Chat-based SQL Client and Editor for the next decade
kpradyumna095/text-to-text-transfer-transformer
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"