Pinned Repositories
Apache-Sedona-Spatial-Analytics
An overview of using Apache Sedona for geospatial analytics
CDE_Tour_ACE_HOL
CML_LLM_HOL_Workshop
A hands on lab for Cloudera machine learning
CML_MLOps_ACE_Workshop
CSA2CML
dask_distributed_quickstart_cml
MLOps
Oozie2CDE_Migration
Spark3_Iceberg_CML
Using_CDE_Airflow
pdefusco's Repositories
pdefusco/CML_DataGen_Utils
Datagen Utility for dbldatagen in CML
pdefusco/CML_MLOps_Logistics_DEV
DEV Project for CML Logistics HOL
pdefusco/CML_MLOps_Telco_MLFlow
CML Demo focusing on MLOps in the Telco Industry including MLFlow, Spark, Iceberg, and XGBoost
pdefusco/CDE_121_HOL
Cloudera Data Engineering Hands on Lab based on Version 1.21
pdefusco/CDE_122_HOL
Cloudera Data Engineering Hands on Lab based on Version 1.22
pdefusco/cde_3rdp_airflow_providers
A Repository Containing Examples of 3rd Party Airflow Providers in CDE
pdefusco/cde_airflow_python_envs
An example of a Python Environment used by a CDE Airflow Job
pdefusco/cde_cdepy_articles
Collection of Articles focused on CDEPY in Cloudera Data Engineering
pdefusco/CDE_CLI_Articles
Some useful commands you must know when you are using the CDE CLI
pdefusco/CDE_Demo_Auto_Deploy
An Automated, Dockerized CDE Demo including Spark, Airflow, and Iceberg.
pdefusco/cde_git_repo
A sample git repo containing code to demonstrate cde git integration available in version 1.20
pdefusco/cde_iceberg_articles
Collection of Articles focused on Apache Iceberg in Cloudera Data Engineering
pdefusco/CDE_Spark_Bucketing
Spark Bucketing Examples
pdefusco/cdepy
pdefusco/CML_afrank_sko_demo
pdefusco/CML_AMP-Chromadb-rest-api
A rest api server to upsert document into chromadb
pdefusco/CML_AMP_Intelligent-QA-Chatbot-with-NiFi-Pinecone-and-Llama2
The prototype deploys an Application in CML using a Llama2 model from Hugging Face to answer questions augmented with knowledge extracted from the website. This prototype introduces Pinecone as a database for storing vectors for semantic search.
pdefusco/CML_MLops_Banking_MLFlow
Banking MLOps Demo with MLFlow
pdefusco/CML_MLOps_Healthcare_MLFlow
A demo of MLOps in Healthcare in CML with MLFlow, XGBoost, Spark and Iceberg
pdefusco/CML_MLOps_Healthcare_PRD
Artifacts to manage Registry Models in a PRD Workspace
pdefusco/CML_MLOps_Logistics_HOL
Hands On Labs of Cloudera Machine Learning in Logistics with Time Series Data
pdefusco/CML_MLOps_Logistics_Mlflow
Third and final piece of Logistics MLOps HOL focused on MLFLOW
pdefusco/CML_MLOps_Logistics_PRD
PRD Project for Logistics CML HOL
pdefusco/CML_MLOps_Telco_PRD
pdefusco/CML_Serving_Demo
Demo of CML Serving
pdefusco/flink-tutorials
pdefusco/gpu-optimization-workshop
Slides, notes, and materials for the workshop
pdefusco/marketing-campaign-pyspark-cdsw
pdefusco/Ray_on_CML_QuickStart_AMP
pdefusco/sparknlp
SparkNLP Quickstart