Pinned Repositories
airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
awesome-data-engineering
A curated list of data engineering tools for software developers
fork-stats
Get statistics about who forked a Github repository
itmatch
Connect students with similar IT interets
job-scraper
Scraping jobs data from Stack Overflow Careers
theseus
Data Mining Thesis Topics in Finland
valohai-fasttext-example
Production Machine Learning Pipeline for Text Classification with fastText
scrapingbee-python
A Python library to interact with ScrapingBee's API for headless browsers and proxy rotation
scrapy-scrapingbee
JavaScript support and proxy rotation for Scrapy with ScrapingBee.
airflow-valohai-plugin
:shark: Airflow plugin to scale machine learning tasks with Valohai and get automatic version control
arimbr's Repositories
arimbr/valohai-fasttext-example
Production Machine Learning Pipeline for Text Classification with fastText
arimbr/theseus
Data Mining Thesis Topics in Finland
arimbr/awesome-data-engineering
A curated list of data engineering tools for software developers
arimbr/airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
arimbr/airflow
Apache Airflow
arimbr/arxiv-sanity-preserver
Web interface for browsing, search and filtering recent arxiv submissions
arimbr/Awesome-LLMOps
An awesome & curated list of best LLMOps tools for developers
arimbr/awesome-notebooks
😎 Awesome list of public Jupyter Notebooks on all kinds of interesting topics, published by the community.
arimbr/bonobo
ALPHA - Extract Transform Load for Python 3.5+
arimbr/boto3
AWS SDK for Python
arimbr/charts
Curated applications for Kubernetes
arimbr/data-diff
Efficiently diff data in or across relational databases
arimbr/datafold-docs
Datafold documentation including Overviews, Guides, APIs, Examples, & FAQs
arimbr/django-rest-framework
Web APIs for Django. ⚡️
arimbr/docs
arimbr/docusaurus
Easy to maintain open source documentation websites.
arimbr/elementary
Open-source data observability for analytics engineers.
arimbr/fastapi
FastAPI framework, high performance, easy to learn, fast to code, ready for production
arimbr/literal-docs
arimbr/modern-data-stack-demo
arimbr/modern-data-stack-demo-dbt
arimbr/pm-ai
arimbr/pygsheets
Google Sheets Python API v4
arimbr/scrapingbee-python
A Python library to interact with ScrapingBee's API for headless browsers and proxy rotation
arimbr/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
arimbr/scrapy-history-middleware
Scrapy middleware for building historical S3 cache for crawled web resources.
arimbr/scrapy-scrapingbee
JavaScript support and proxy rotation for Scrapy with ScrapingBee.
arimbr/strapi-cloud-template-blog-523454d056
arimbr/valohai-cli
:heavy_dollar_sign: Command line client for Valohai
arimbr/wikidata
several scripts for exploration of Wikidata