Pinned Repositories
dagcheck
advanced_arrays
Implementation of extendible array, and hash table structures in C++
awesome-beam
A curated list of awesome resources for Apache Beam
beam-covid-example
An example using Beam to analyze COVID-19 data for Beam Learning Month!
beam-workshop
Repository with utilities for a Beam workshop based on the Mobile Gaming Example Edit Add topics
beam_utils
A repo with a few tiny Apache Beam utilities that I've coded.
dagwhat
A local testing framework for Airflow DAGs
hanja-graph
Playing around with Sino-Korean words
ScopusScrapus
A few small routines to scrape the data from Elsevier's Scopus API
pabloem's Repositories
pabloem/awesome-beam
A curated list of awesome resources for Apache Beam
pabloem/dagwhat
A local testing framework for Airflow DAGs
pabloem/Zookeeper-example
A Zookeeper emulation
pabloem/beam
Mirror of Apache Beam
pabloem/beam-javascript
A Javascript SDK for Beam
pabloem/beam-learning-month
pabloem/beam-site
Mirror of Apache Beam Site
pabloem/beam-summit-website
Beam Summit website code
pabloem/chat-langchain
pabloem/chris_investigacion_ray
pabloem/click-to-deploy-repos
Source for Google Click to Deploy solutions listed on Google Cloud Marketplace.
pabloem/DataflowTemplates
Google-provided Cloud Dataflow template pipelines for solving simple in-Cloud data tasks
pabloem/incubator-airflow
Apache Airflow (Incubating)
pabloem/langchain
⚡ Building applications with LLMs through composability ⚡
pabloem/logseq-nov2022
pabloem/opentelemetry-collector-contrib
Contrib repository for the OpenTelemetry Collector
pabloem/pabloem.github.io
Build a Jekyll blog in minutes, without touching the command line.
pabloem/pgmq
A lightweight distributed message queue. Like AWS SQS and RSMQ but on Postgres.
pabloem/postgresml
PostgresML is an AI application database. Download open source models from Huggingface, or train your own, to create and index LLM embeddings, generate text, or make online predictions using only SQL.
pabloem/ProyectoZookeeper
pabloem/pykep
PyKEP is a scientific library providing basic tools for research in interplanetary trajectory design.
pabloem/python-aiplatform
A Python SDK for Vertex AI, a fully managed, end-to-end platform for data science and machine learning.
pabloem/qdrant
Qdrant - Vector Database for the next generation of AI applications. Also available in the cloud https://cloud.qdrant.io/
pabloem/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
pabloem/ray_beam_runner
Ray-based Apache Beam runner
pabloem/rules
Falco rule repository
pabloem/spark
Apache Spark - A unified analytics engine for large-scale data processing
pabloem/teavm
Compiler of Java bytecode to JavaScript
pabloem/test-beam-bq
pabloem/vector
A high-performance observability data pipeline.