Pinned Repositories
adfdataflowdocs
Azure Data Factory Data Flow Documentation
adflab
Azure Data Factory hands-on lab, self-paced. Learn how to lift & shift SSIS packages to the Cloud with ADF. Build new ETL pipelines in ADF, transform data at scale, load Azure Data Warehouse data marts. Also walks through operationalizing ADF pipelines with scheduling and monitoring modules.
app-ask-craig
Ask Craig application
app-consumer-loan
app-malicious-domains
Domain name classifier looking for good vs. possibly malicious providers
AWS_Openguide
caffe
Caffe: a fast open framework for deep learning.
CDM
The Common Data Model (CDM) is a standard and extensible collection of schemas (entities, attributes, relationships) that represents business concepts and activities with well-defined semantics, to facilitate data interoperability. Examples of entities include: Account, Contact, Lead, Opportunity, Product, etc.
cdm-azure-data-services-integration
Tutorials and sample code for integrating CDM folders with Azure Data Services
connectors
Connectors for Delta Lake
scottsunsh's Repositories
scottsunsh/AWS_Openguide
scottsunsh/adfdataflowdocs
Azure Data Factory Data Flow Documentation
scottsunsh/adflab
Azure Data Factory hands-on lab, self-paced. Learn how to lift & shift SSIS packages to the Cloud with ADF. Build new ETL pipelines in ADF, transform data at scale, load Azure Data Warehouse data marts. Also walks through operationalizing ADF pipelines with scheduling and monitoring modules.
scottsunsh/caffe
Caffe: a fast open framework for deep learning.
scottsunsh/CDM
The Common Data Model (CDM) is a standard and extensible collection of schemas (entities, attributes, relationships) that represents business concepts and activities with well-defined semantics, to facilitate data interoperability. Examples of entities include: Account, Contact, Lead, Opportunity, Product, etc.
scottsunsh/cdm-azure-data-services-integration
Tutorials and sample code for integrating CDM folders with Azure Data Services
scottsunsh/connectors
Connectors for Delta Lake
scottsunsh/cs224u
Code for Stanford CS224u
scottsunsh/data
scottsunsh/data-governance
ODPi Egeria's Guidance on Governance - simplifying governance for the enterprise
scottsunsh/data-pipelines-with-apache-airflow
Code for Data Pipelines with Apache Airflow
scottsunsh/databricks
Repository of sample Databricks notebooks
scottsunsh/docker-python
Kaggle Python docker image
scottsunsh/Enterprise-Scale
The Enterprise-Scale architecture provides prescriptive guidance coupled with Azure best practices, and it follows design principles across the critical design areas for organizations to define their Azure architecture
scottsunsh/eventsim
Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.
scottsunsh/FEDOT
Automated modeling and machine learning framework FEDOT
scottsunsh/GitHub-Chinese-Top-Charts
:cn: GitHub中文排行榜,帮助你发现高分优秀中文项目、更高效地吸收国人的优秀经验成果;榜单每周更新一次,敬请关注!(武汉加油!**加油!世界加油!)
scottsunsh/gridstudio
Grid studio is a web-based application for data science with full integration of open source data science frameworks and languages.
scottsunsh/h2o-meetups
Presentations from H2O meetups & conferences by the H2O.ai team
scottsunsh/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
scottsunsh/Microsoft-Power-BI-Performance-Best-Practices
Microsoft Power BI Performance Best Practices, published by Packt
scottsunsh/neo4j-etl
Data import from relational databases to Neo4j.
scottsunsh/pai
Resource scheduling and cluster management for AI
scottsunsh/papermill
📚 Parameterize, execute, and analyze notebooks
scottsunsh/procfwk
A cross tenant metadata driven processing framework for Azure Data Factory and Azure Synapse Analytics achieved by coupling orchestration pipelines with a SQL database and a set of Azure Functions.
scottsunsh/qlik-py-tools
Data Science algorithms for Qlik implemented as a Python Server Side Extension (SSE).
scottsunsh/RippleNet
A tensorflow implementation of RippleNet
scottsunsh/spark-cdm-connector
scottsunsh/spark-standalone-cluster-on-docker
Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker. :zap:
scottsunsh/WSO2-Training