iemejia
Committer and PMC member of Apache Beam and Apache Avro. Free education and Open Source enthusiast. Distributed Systems practitioner (victim?)
Microsoft
Pinned Repositories
avro
Apache Avro is a data serialization system.
beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
flink-docker
Docker packaging for Apache Flink
coursera-dl
Script for downloading Coursera.org videos and naming them.
edx-dl
A simple tool to download video lectures from edx.org (and other openedx sites)
catho
A file catalog utility inspired by the awesome Robert Vasicek's Cathy project. Or my excuse to hack something that I really need.
dotfiles
A repo to keep a copy of my dotfiles
formation-bigdata
playlistr
playlistr is a util to export and import playlists from/to streaming services
streamingcolombia
This project contains different utilities to watch streaming media from Colombia
iemejia's Repositories
iemejia/dotfiles
A repo to keep a copy of my dotfiles
iemejia/atlas
Apache Atlas
iemejia/avro
Mirror of Apache Avro
iemejia/azure-search-openai-demo
A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure Cognitive Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.
iemejia/azuredatastudio
Azure Data Studio is a data management and development tool with connectivity to popular cloud and on-premises databases. Azure Data Studio supports Windows, macOS, and Linux, with immediate capability to connect to Azure SQL and SQL Server. Browse the extension library for more database support options including MySQL, PostgreSQL, and MongoDB.
iemejia/continuous-delivery-azure
Create two deployment workflows using GitHub Actions and Microsoft Azure.
iemejia/data-science
iemejia/data-science-minimal
iemejia/docker-japi-compliance-checker
Docker image for Java API Compliance Checker
iemejia/duckdb
DuckDB is an in-process SQL OLAP Database Management System
iemejia/fabric-playground
A repository with Microsoft Fabric related resources
iemejia/Fabric-Readiness
A collection of useful materials for presenters interested in topics related to Microsoft Fabric
iemejia/fabric-samples
Samples and data for Microsoft Fabric Learn content
iemejia/FLAML
A fast library for AutoML and tuning.
iemejia/handson-ml3
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
iemejia/incubator-xtable
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
iemejia/jslogo
Logo in JavaScript
iemejia/moaw
Grab-and-go resources to help you learn new skills, with all the tools you need to create, host and share your own workshop
iemejia/notebooks
Easy to run notebooks with devcontainers
iemejia/openjdk-docker
Repository of Container Images for the official MSFT Build of OpenJDK
iemejia/parquet-mr
Apache Parquet
iemejia/practical-statistics-for-data-scientists
Code repository for O'Reilly book
iemejia/pydata-book
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media
iemejia/spark
Apache Spark
iemejia/spark-website
Apache Spark Website
iemejia/sqltoolsservice
SQL Tools API service that provides SQL Server data management capabilities.
iemejia/SynapseML
Simple and Distributed Machine Learning
iemejia/template-template
<<Not a course>> A template to make course templates. Search and replace "TBD".
iemejia/WhatTheHack
A collection of challenge based hack-a-thons including student guide, coach guide, lecture presentations, sample/instructional code and templates. Please visit the What The Hack website at: https://aka.ms/wth
iemejia/www-site
The ASF Website