vishalkhondre

python, oracle SQL, PL/SQL, Google Cloud

Pune, India

vishalkhondre's Stars

malloydata/malloy
Malloy is an experimental language for describing data relationships and transformations.
Language:TypeScript2k77
StarRocks/starrocks
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
Language:Java9.4k1.9k
run-llama/llama-hub
A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
Language:Jupyter Notebook3.5k737
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Jupyter Notebook96.3k15.7k
great-expectations/great_expectations
Always know what to expect from your data.
Language:Python10.1k1.6k
logicalclocks/hopsworks
Hopsworks - Data-Intensive AI platform with a Feature Store
Language:Java1.2k145
rilldata/rill
Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.
Language:Go1.8k122
julianhyde/sqlline
Shell for issuing SQL to relational databases via JDBC
Language:Java626148
linkedin/Hoptimator
Multi-hop declarative data pipelines
Language:Java10212
mage-ai/mage-ai
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Language:Python8k781
tobymao/sqlglot
Python SQL Parser and Transpiler
Language:Python6.9k730
elyase/geotext
Geotext extracts country and city mentions from text
Language:Python13748
apache/dolphinscheduler
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Language:Java13.1k4.7k
apache/seatunnel
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
Language:Java8.1k1.9k
superstreamlabs/memphis
Memphis.dev is a highly scalable and effortless data streaming platform
Language:Go3.3k219
Breaka84/Spooq
Language:Python81
aws/aws-cdk
The AWS Cloud Development Kit is a framework for defining cloud infrastructure in code
Language:TypeScript11.7k4k
Beuth-Erdelt/Benchmark-Experiment-Host-Manager
This python tool helps managing DBMS benchmarking experiments in a Kubernetes-based HPC cluster environment. It enables users to configure hardware / software setups for easily repeating tests over varying configurations.
Language:Jupyter Notebook6
Beuth-Erdelt/DBMS-Benchmarker
DBMS-Benchmarker is a Python-based application-level blackbox benchmark tool for Database Management Systems (DBMS). It connects to a given list of DBMS (via JDBC) and runs a given list of parametrized and randomized (SQL) benchmark queries. Evaluations are available via a Python interface and on an interactive multi-dimensional dashboard.
Language:HTML133
Swiple/swiple
Swiple enables you to easily observe, understand, validate and improve the quality of your data
Language:Python8111
petl-developers/petl
Python Extract Transform and Load Tables of Data
Language:Python1.3k193
frictionlessdata/frictionless-py
Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data
Language:Python726148
frictionlessdata/datapackage
Data Package is a standard consisting of a set of simple yet extensible specifications to describe datasets, data files and tabular data. It is a data definition language (DDL) and data API that facilitates findability, accessibility, interoperability, and reusability (FAIR) of data.
Language:MDX502114
papers-we-love/papers-we-love
Papers from the computer science community to read and discuss.
Language:Shell88.8k5.8k
sfu-db/connector-x
Fastest library to load data from DB to DataFrames in Rust and Python
Language:Rust2k162
zsvoboda/ngods
New generation opensource data stack
Language:Dockerfile628
edornd/clidantic
Typed Command Line Interfaces powered by Click and Pydantic
Language:Python243
apache/iceberg
Apache Iceberg
Language:Java6.6k2.3k
zsvoboda/dbd
dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.
Language:Python572
zsvoboda/ngods-stocks
New Generation Opensource Data Stack Demo
Language:Jupyter Notebook41595

vishalkhondre

vishalkhondre's Stars

malloydata/malloy

StarRocks/starrocks

run-llama/llama-hub

langchain-ai/langchain

great-expectations/great_expectations

logicalclocks/hopsworks

rilldata/rill

julianhyde/sqlline

linkedin/Hoptimator

mage-ai/mage-ai

tobymao/sqlglot

elyase/geotext

apache/dolphinscheduler

apache/seatunnel

superstreamlabs/memphis

Breaka84/Spooq

aws/aws-cdk

Beuth-Erdelt/Benchmark-Experiment-Host-Manager

Beuth-Erdelt/DBMS-Benchmarker

Swiple/swiple

petl-developers/petl

frictionlessdata/frictionless-py

frictionlessdata/datapackage

papers-we-love/papers-we-love

sfu-db/connector-x

zsvoboda/ngods

edornd/clidantic

apache/iceberg

zsvoboda/dbd

zsvoboda/ngods-stocks