Pinned Repositories
beam
Mirror of Apache Beam (Incubating)
mini-dev-cluster
Mini YARN/DFS cluster for developing and testing YARN-based applications (e.g., Tez)
pyspark_gcs
GCS connector batteries for pyspark
simple-s3-scio-example
spark-schema-utils
spark-schema-utils
gce-github-runner
Ephemeral GCE/GCP GitHub self-hosted runner
sgkit
Scalable genetics toolkit
scio
A Scala API for Apache Beam and Google Cloud Dataflow.
scio-idea-plugin
Scio IDEA plugin
ravwojdyla's Repositories
ravwojdyla/pyspark_gcs
GCS connector batteries for pyspark
ravwojdyla/spark-schema-utils
spark-schema-utils
ravwojdyla/awesome-runners
A curated list of awesome self-hosted GitHub Action runners in a large comparison matrix
ravwojdyla/bio2zarr
Convert bioinformatics file formats to Zarr
ravwojdyla/cpython
The Python programming language
ravwojdyla/duckdb-wasm
WebAssembly version of DuckDB
ravwojdyla/duckdbwasm-vitebrowser
Barebones example of querying with duckdb-wasm using Vite and just the browser (no front-end framework). No dataset file is loaded; the data is created using the generate_series function.
ravwojdyla/elver
Package python app in a Docker with ease
ravwojdyla/ensembl-genes
Extract the Ensembl genes catalog to simple tables
ravwojdyla/fastobo-py
Faultless AST for Open Biomedical Ontologies in Python.
ravwojdyla/icml2022
This repository contains code for the ICML 2022 submission: "Meaningfully Debugging Model Mistakes Using Conceptual Counterfactual Explanations"
ravwojdyla/inverting-proxy
Reverse proxy that inverts the direction of traffic
ravwojdyla/jupyterlite_test
ravwojdyla/langchain
⚡ Building applications with LLMs through composability ⚡
ravwojdyla/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
ravwojdyla/openai-python
The OpenAI Python library provides convenient access to the OpenAI API from applications written in the Python language.
ravwojdyla/parquet-mr
Apache Parquet
ravwojdyla/pronto
A Python frontend to (Open Biomedical) Ontologies.
ravwojdyla/python-annoy-feedstock
A conda-smithy repository for python-annoy.
ravwojdyla/quadfeather
ravwojdyla/scikit-learn
scikit-learn: machine learning in Python
ravwojdyla/scio-idea-plugin-1
Scio IDEA plugin
ravwojdyla/sgkit
Statistical genetics toolkit
ravwojdyla/shap
A game theoretic approach to explain the output of any machine learning model.
ravwojdyla/spark
Apache Spark - A unified analytics engine for large-scale data processing
ravwojdyla/spectral_workshop
ravwojdyla/sqlfluff
A SQL linter and auto-formatter for Humans
ravwojdyla/visx
🐯 visx | visualization components
ravwojdyla/xarray
N-D labeled arrays and datasets in Python
ravwojdyla/zarr-python
An implementation of chunked, compressed, N-dimensional arrays for Python.