eddyzhow

Bangkok, Thailand

eddyzhow's Stars

tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
Language:C++60.9k 1.7k 2.6k9.4k
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
Language:Scala39.3k 2k 028.2k
mingrammer/diagrams
:art: Diagram as Code for prototyping cloud system architectures
Language:Python36.8k 400 4962.4k
getredash/redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Language:Python25.9k 577 2.5k4.3k
EthicalML/awesome-production-machine-learning
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
17.3k 402 752.2k
twintproject/twint
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
Language:Python15.7k 328 1.2k2.7k
aio-libs/aiohttp
Asynchronous HTTP client/server framework for asyncio and Python
Language:Python14.9k 216 3k2k
cayleygraph/cayley
An open-source graph database
Language:Go14.8k 576 4901.3k
codelucas/newspaper
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
Language:Python14.1k 387 6742.1k
quickwit-oss/tantivy
Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
Language:Rust11.8k 141 998651
postgresml/postgresml
Postgres with GPUs for ML/AI apps.
Language:Rust5.9k 53 228292
vespa-engine/vespa
AI + Data, online. https://vespa.ai
Language:Java5.6k 160 976589
Azure/azure-sdk-for-python
This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our versioned developer docs at https://azure.github.io/azure-sdk-for-python.
Language:Python4.5k 436 10.1k2.8k
ckan/ckan
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data.humdata.org among many other sites.
Language:Python4.4k 194 3.4k2k
amundsen-io/amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Language:Python4.4k 234 685954
pyinfra-dev/pyinfra
pyinfra turns Python code into shell commands and runs them on your servers. Execute ad-hoc commands and write declarative operations. Target SSH servers, local machine and Docker containers. Fast and scales from one server to thousands.
Language:Python3.8k 37 761373
adilkhash/Data-Engineering-HowTo
A list of useful resources to learn Data Engineering from scratch
3.4k 102 2495
indradb/indradb
A graph database written in rust
Language:Rust2.2k 37 114112
gunnarmorling/awesome-opensource-data-engineering
An Awesome List of Open-Source Data Engineering Projects
2k 56 8320
strapdata/elassandra
Elassandra = Elasticsearch + Apache Cassandra
Language:Java1.7k 88 389198
jodal/pykka
🌀 Pykka makes it easier to build concurrent Python applications.
Language:Python1.2k 34 78107
mateusz-brainhub/awesome-cto-resources
:bulb: A community-curated list of awesome resources to help you grow as a CTO
831 44 199
Machine-Learning-Tokyo/papers-with-annotations
Research papers with annotations, illustrations and explanations
830 84 275
Hydrospheredata/mist
Serverless proxy for Spark cluster
Language:Scala327 39 17968
thespianpy/Thespian
Python Actor concurrency library
Language:Python315 37 6562
Fitblip/wsstat
Websocket stress testing made beautiful
Language:Python175 9 1224
MLBazaar/BTB
A simple, extensible library for developing AutoML systems
Language:Python171 23 11641
o19s/hello-ltr
Set of Jupyter notebooks demonstrating Learning to Rank integrated with Solr and Elasticsearch
Language:Jupyter Notebook163 18 4962
MLBazaar/MLBlocks
A library for composing end-to-end tunable machine learning pipelines.
Language:Python114 14 7335
arsenvlad/docker-presto-adls-wasb
Example of a single node Presto with Azure Data Lake Store (ADLS) and Azure Storage Blob (WASB) access via Hive metastore
Language:Dockerfile18 2 416

eddyzhow

eddyzhow's Stars

tesseract-ocr/tesseract

apache/spark

mingrammer/diagrams

getredash/redash

EthicalML/awesome-production-machine-learning

twintproject/twint

aio-libs/aiohttp

cayleygraph/cayley

codelucas/newspaper

quickwit-oss/tantivy

postgresml/postgresml

vespa-engine/vespa

Azure/azure-sdk-for-python

ckan/ckan

amundsen-io/amundsen

pyinfra-dev/pyinfra

adilkhash/Data-Engineering-HowTo

indradb/indradb

gunnarmorling/awesome-opensource-data-engineering

strapdata/elassandra

jodal/pykka

mateusz-brainhub/awesome-cto-resources

Machine-Learning-Tokyo/papers-with-annotations

Hydrospheredata/mist

thespianpy/Thespian

Fitblip/wsstat

MLBazaar/BTB

o19s/hello-ltr

MLBazaar/MLBlocks

arsenvlad/docker-presto-adls-wasb