data-analytics

There are 3039 repositories under data-analytics topic.

  • superset

    apache/superset

    Apache Superset is a Data Visualization and Data Exploration Platform

    Language:TypeScript59.7k1.5k10.3k12.8k
  • oxnr/awesome-bigdata

    A curated list of awesome big data frameworks, ressources and other awesomeness.

  • danfojs

    javascriptdata/danfojs

    Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.

    Language:TypeScript4.7k32372207
  • lightdash/lightdash

    Self-serve BI to 10x your data team ⚡️

    Language:TypeScript3.5k265k368
  • lancedb/lance

    Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..

    Language:Rust3.4k38799175
  • pathwaycom/pathway

    Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

    Language:Python2.3k205386
  • diffgram

    diffgram/diffgram

    The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

    Language:Python1.8k29827118
  • running-elephant/datart

    Datart is a next generation Data Visualization Open Platform

    Language:TypeScript1.8k481.1k549
  • brimdata/zui

    Zui is a powerful desktop application for exploring and working with data. The official front-end to the Zed lake.

    Language:TypeScript1.7k291k129
  • pretzelai/pretzelai

    The modern replacement for Jupyter Notebooks

    Language:TypeScript1.5k938104
  • datageartech/datagear

    数据可视化分析平台,自由制作任何您想要的数据看板

    Language:Java1.3k3023318
  • dremio/dremio-oss

    Dremio - the missing link in modern data

    Language:Java1.3k880426
  • mining/mining

    Business Intelligence (BI) in Python, OLAP

    Language:Python1.3k118184232
  • traildb/traildb

    TrailDB is an efficient tool for storing and querying series of events

    Language:C1.1k785676
  • mariusandra/insights

    Open Source Self-Hosted Business Intelligence Platform

    Language:JavaScript1.1k291170
  • arbox/data-science-with-ruby

    Practical Data Science with Ruby based tools.

    Language:Ruby69540251
  • program-spiritual/DataAnalysisInAction

    (Finished) Geek Time Data Analysis Practical 45 Lecture - Detailed notes containing markdown images mind map code data can be read directly code test

    Language:Python6943515277
  • abixen-platform

    abixen/abixen-platform

    Abixen Platform is a microservices based software platform for building enterprise applications delivering functionalities through creating particular microservices and integrating by provided CMS.

    Language:Java679102776211
  • latitude-dev/latitude

    Developer-first embedded analytics

    Language:TypeScript658510225
  • elmoallistair/google-data-analytics

    google data analytics professional certificate

    Language:Jupyter Notebook655201302
  • Squarespace/datasheets

    Read data from, write data to, and modify the formatting of Google Sheets

    Language:Python618361159
  • arx-deidentifier/arx

    ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.

    Language:Java60134192212
  • mrankitgupta/Data-Analyst-Roadmap

    I am sharing my Journey of 66DaysofData into Data Analytics by participating in Ken Jee's #66daysofdata challenge

  • essandess/isp-data-pollution

    ISP Data Pollution to Protect Private Browsing History with Obfuscation

    Language:Python582432953
  • bigfunctions

    unytics/bigfunctions

    Supercharge BigQuery with BigFunctions

    Language:Python53376546
  • facet

    BCG-X-Official/facet

    Human-explainable AI.

    Language:Jupyter Notebook500123046
  • metatron-app/metatron-discovery

    Powerful & Easy way for big data discovery

    Language:TypeScript432232.7k108
  • gchq/stroom

    Stroom is a highly scalable data storage, processing and analysis platform.

    Language:Java423313.1k56
  • girder/girder

    A data management platform for the web, developed by Kitware

    Language:Python421391.1k173
  • blockchain-etl/bitcoin-etl

    ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ

    Language:Python3883143113
  • blockchain-etl/ethereum-etl-airflow

    Airflow DAGs for exporting, loading, and parsing the Ethereum blockchain data. How to get any Ethereum smart contract into BigQuery https://towardsdatascience.com/how-to-get-any-ethereum-smart-contract-into-bigquery-in-8-mins-bab5db1fdeee

    Language:Python3871652180
  • geo-data-viewer

    RandomFractals/geo-data-viewer

    Geo Data Analytics tool for VSCode IDE with kepler.gl support to generate and view maps 🗺️ without any Python 🐍, IPyWidgets ⚙️, pandas 🐼, Jupyter notebooks 📚, or ReactJS ⚛️ app code.

    Language:HTML3861415341
  • ActivitySchema/ActivitySchema

    Repository for the ActivitySchema spec and supporting materials

  • gspread-pandas

    aiguofer/gspread-pandas

    A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.

    Language:Python384136052
  • cathytanimura/sql_book

    Code repository for the book SQL for Data Analysis

  • desbordante-core

    Desbordante/desbordante-core

    Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

    Language:C++36087161