DonYum's Stars
fireducks-dev/fireducks
Create an issue on FireDucks
rlabbe/filterpy
Python Kalman filtering and optimal estimation library. Implements Kalman filter, particle filter, Extended Kalman filter, Unscented Kalman filter, g-h (alpha-beta), least squares, H Infinity, smoothers, and more. Has companion book 'Kalman and Bayesian Filters in Python'.
pykalman/pykalman
Kalman Filter, Smoother, and EM Algorithm for Python
feast-dev/feast
The Open Source Feature Store for Machine Learning
apache/seatunnel
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
almond-sh/almond
A Scala kernel for Jupyter
jupyter-incubator/sparkmagic
Jupyter magics and kernels for working with remote Spark clusters
apache/zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
microsoft/monaco-editor
A browser based code editor
huggingface/dataset-viewer
Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
sdv-dev/SDV
Synthetic data generation for tabular data
Avaiga/taipy
Turns Data and AI algorithms into production-ready web applications in no time.
optuna/optuna
A hyperparameter optimization framework
ResidentMario/missingno
Missing data visualization module for Python.
louisnw01/lightweight-charts-python
Python framework for TradingView's Lightweight Charts JavaScript library.
PAIR-code/facets
Visualizations for machine learning datasets
bokeh/jupyter_bokeh
An extension for rendering Bokeh content in JupyterLab notebooks
jupyterlite/jupyterlite
Wasm powered Jupyter running in the browser 💡
tradingview/lightweight-charts
Performant financial charts built with HTML5 canvas
hackingthemarkets/tradekit
a collection of open source server components and Python libraries for financial data projects and automated trading
twopirllc/pandas-ta
Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 150+ Indicators
igrigorik/gharchive.org
GH Archive is a project to record the public GitHub timeline, archive it, and make it easily accessible for further analysis.
coyzeng/fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
google-research/timesfm
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
FXLP/MarkTool
DoTAT 是一款基于web、面向领域的通用文本标注工具,支持大规模实体标注、关系标注、事件标注、文本分类、基于字典匹配和正则匹配的自动标注以及用于实现归一化的标准名标注,同时也支持迭代标注、嵌套实体标注和嵌套事件标注。标注规范可自定义且同类型任务中可“一次创建多次复用”。通过分级实体集合扩大了实体类型的规模,并设计了全新高效的标注方式,提升了用户体验和标注效率。此外,本工具增加了审核环节,可对多人的标注结果进行一致性检验、自动合并和手动调整,提高了标注结果的准确率。
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
refuel-ai/autolabel
Label, clean and enrich text datasets with LLMs.