data-scientists

There are 78 repositories under data-scientists topic.

  • academic/awesome-datascience

    :memo: An awesome Data Science repository to learn and apply for real world problems.

  • ujjwalkarn/DataSciencePython

    common data analysis and machine learning tasks using python

    Language:Python5.6k34851.5k
  • oegedijk/explainerdashboard

    Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.

    Language:Python2.4k22238347
  • PizzaDeDados/datascience-pizza

    🍕 Repositório para juntar informações sobre materiais de estudo em análise de dados e áreas afins, empresas que trabalham com dados e dicionário de conceitos

  • ashishpatel26/Amazing-Feature-Engineering

    Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.

    Language:Jupyter Notebook746151267
  • interpretml/interpret-text

    A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in dashboard.

    Language:Python426185868
  • ClimbsRocks/machineJS

    [UNMAINTAINED] Automated machine learning- just give it a data file! Check out the production-ready version of this project at ClimbsRocks/auto_ml

    Language:Python4023417563
  • Data-Science-Resources

    storieswithsiva/Data-Science-Resources

    👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋

    Language:Jupyter Notebook20718164
  • Ashton-Sidhu/aethos

    Automated Data Science and Machine Learning library to optimize workflow.

    Language:Python1046318
  • fuseml/fuseml

    FuseML aims to provide an MLOps framework as the medium dynamically integrating together the AI/ML tools of your choice. It's an extensible tool built through collaboration, where Data Engineers and DevOps Engineers can come together and contribute with reusable integration code.

    Language:Go8571559
  • prashanthbasani/Awesome-DataScience-Cheatsheets

    Collection of cheatsheets for data science, machine learning and deep learning :).

  • yusufarist/Data-Science-Learning-Path

    The Learning Path and Comprehensive List of Materials from Data Science

    Language:Python472010
  • AllanCameron/PDFR

    An R package to extract text from pdf.

    Language:C++40373
  • aliarslanansari/Data-Science-Study

    This repository contains jupyter notebook and other resources made by me during learning Data Science

    Language:Jupyter Notebook388129
  • giocoal/minimal-portfolio

    Minimal Portfolio for my Data Science projects. Jekyll minimal template hosted on GitHub Pages.

    Language:HTML25106
  • dhaitz/data-science-links

    A curated list of links to great data science articles, videos, ...

  • ucbrise/jarvis

    Build, configure, and track workflows with Jarvis.

    Language:Python1310138
  • Correia-jpv/fucking-awesome-datascience

    📝 An awesome Data Science repository to learn and apply for real world problems. With repository stars⭐ and forks🍴

  • Vedant-S/MLOps-Project

    Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.

    Language:Jupyter Notebook12008
  • JCardenasRdz/SpringBoard

    Notes, Ideas, and Projects related to my Springboard data science career track

    Language:Jupyter Notebook11307
  • prajaktaag/Credit-Card-Fraud-Detection

    Machine learning models to automatically predict credit card frauds

    Language:Python11002
  • exajobs/data-engineering-collection

    A collection of awesome software, libraries, Learning Tutorials, documents, books, resources and interesting stuff about Big Data Science & Engineering

  • ajaymache/bingo

    Simulation of advanced algorithmic and probabilistic bingo game

    Language:Jupyter Notebook7106
  • Malastare.ai

    malastare-ai/Malastare.ai

    Malastare.ai is a startup Analytics Consulting Firm. Based in Texas, USA. We combine deep industry knowledge with specialized expertise in analytics, strategy, operations, and risk management. We leverage our clients' real-world experience, industry best practices and technology best practices to enable them to succeed in their big data projects.

    Language:HTML7303
  • VladimirNikiforov/netology-ds

    Data Scientist Specialization in Netology

    Language:Jupyter Notebook7203
  • mikeroyal/R-Guide

    R Guide

    Language:R6202
  • 2KRISHNAYADAV/Amazon-USA-Data-Financial-Insights-Across-All-Stateslog-normalization

    Amazon USA Financial Data : Insights across all states, focusing on R&D, marketing, campaigns, and profit. Features log normalization to stabilize skewed data and aligns with Gaussian distribution for better analysis. Free Excel sheet available for practice.

    Language:Jupyter Notebook5100
  • javier-arango/gainesville-rentals

    This project will be focused on helping students who are looking to rent an off-campus apartment, but this could help anyone who is looking to rent an apartment in Gainesville.

    Language:Jupyter Notebook5100
  • RottenFruits/Analyticker

    Analyticker is analytics environments of docker for data scientist and data analyst.

    Language:Dockerfile5200
  • arasgungore/awesome-datascience

    :memo: An awesome Data Science repository to learn and apply for real world problems.

  • Rahulkumarr2080/Comcast-Telecom-Consumer-Complaints

    Comcast is an American global telecommunication company. The firm has been providing terrible customer service. They continue to fall short despite repeated promises to improve. Only last month (October 2016) the authority fined them $2.3 million, after receiving over 1000 consumer complaints. The existing database will serve as a repository of public customer complaints file.

    Language:HTML3200
  • Anouk2311/indeed-job-listings

    This repository contains the entire workflow for our Online Data Collection & Management and Data Preparation & Workflow Management group projects (group 3).

    Language:R22203
  • chrimaho/chrimaho

    My Personal Repository

  • leestott/ResponsibleAI

    Microsoft Ignite - Getting started on your health-tech journey using responsible AI

    Language:Jupyter Notebook2003
  • mariapushkareva/Kaggle-DS-and-ML-Survey-2020

    This year, 20,036 Kaggle users told us how they learn and level up, which tools they’re using, and what they recommend. The results include raw numbers about who is working with data, what’s happening with machine learning in different industries, and the best ways for new data scientists to break into the field.

    Language:Jupyter Notebook2100
  • yomazini/Reddit-Scraper

    Comprehensive Python toolkit for extracting, analyzing, and exporting Reddit data across multiple dimensions.

    Language:Python2100