data-analytics
There are 3039 repositories under data-analytics topic.
apache/superset
Apache Superset is a Data Visualization and Data Exploration Platform
oxnr/awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
javascriptdata/danfojs
Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
lightdash/lightdash
Self-serve BI to 10x your data team ⚡️
lancedb/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
diffgram/diffgram
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
running-elephant/datart
Datart is a next generation Data Visualization Open Platform
brimdata/zui
Zui is a powerful desktop application for exploring and working with data. The official front-end to the Zed lake.
pretzelai/pretzelai
The modern replacement for Jupyter Notebooks
datageartech/datagear
数据可视化分析平台,自由制作任何您想要的数据看板
dremio/dremio-oss
Dremio - the missing link in modern data
mining/mining
Business Intelligence (BI) in Python, OLAP
traildb/traildb
TrailDB is an efficient tool for storing and querying series of events
mariusandra/insights
Open Source Self-Hosted Business Intelligence Platform
arbox/data-science-with-ruby
Practical Data Science with Ruby based tools.
program-spiritual/DataAnalysisInAction
(Finished) Geek Time Data Analysis Practical 45 Lecture - Detailed notes containing markdown images mind map code data can be read directly code test
abixen/abixen-platform
Abixen Platform is a microservices based software platform for building enterprise applications delivering functionalities through creating particular microservices and integrating by provided CMS.
latitude-dev/latitude
Developer-first embedded analytics
elmoallistair/google-data-analytics
google data analytics professional certificate
Squarespace/datasheets
Read data from, write data to, and modify the formatting of Google Sheets
arx-deidentifier/arx
ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.
mrankitgupta/Data-Analyst-Roadmap
I am sharing my Journey of 66DaysofData into Data Analytics by participating in Ken Jee's #66daysofdata challenge
essandess/isp-data-pollution
ISP Data Pollution to Protect Private Browsing History with Obfuscation
unytics/bigfunctions
Supercharge BigQuery with BigFunctions
BCG-X-Official/facet
Human-explainable AI.
metatron-app/metatron-discovery
Powerful & Easy way for big data discovery
gchq/stroom
Stroom is a highly scalable data storage, processing and analysis platform.
girder/girder
A data management platform for the web, developed by Kitware
blockchain-etl/bitcoin-etl
ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
blockchain-etl/ethereum-etl-airflow
Airflow DAGs for exporting, loading, and parsing the Ethereum blockchain data. How to get any Ethereum smart contract into BigQuery https://towardsdatascience.com/how-to-get-any-ethereum-smart-contract-into-bigquery-in-8-mins-bab5db1fdeee
RandomFractals/geo-data-viewer
Geo Data Analytics tool for VSCode IDE with kepler.gl support to generate and view maps 🗺️ without any Python 🐍, IPyWidgets ⚙️, pandas 🐼, Jupyter notebooks 📚, or ReactJS ⚛️ app code.
ActivitySchema/ActivitySchema
Repository for the ActivitySchema spec and supporting materials
aiguofer/gspread-pandas
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
cathytanimura/sql_book
Code repository for the book SQL for Data Analysis
Desbordante/desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.