Pinned Repositories
pyrobuf
A Cython alternative to Google's Python Protobuf library
bfs_in_parallel
breadth-first search in parallel
cgg
a pl0 parser implemented by Python
HuggingFace-Datasets-Text-Quality-Analysis
Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas
pandasticsearch
An Elasticsearch client exposing DataFrame API
PDF-Chatbot-Local-LLM-Embeddings
Prompt_Engineering_with_Qwen
Qwen 提示词工程 & 最佳实践
weakpoint
static slideshow generator
sqlfluff
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
tellery
Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
onesuper's Repositories
onesuper/pandasticsearch
An Elasticsearch client exposing DataFrame API
onesuper/HuggingFace-Datasets-Text-Quality-Analysis
Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas
onesuper/PDF-Chatbot-Local-LLM-Embeddings
onesuper/Prompt_Engineering_with_Qwen
Qwen 提示词工程 & 最佳实践
onesuper/vivid_schemer
REPL for The Little Schemer
onesuper/awesome-java-cn
Java资源大全中文版,包括开发库、开发工具、网站、博客、微信、微博等,由伯乐在线持续更新。
onesuper/daydayup
Daily report authoring tool for baixingers
onesuper/a-little-java
onesuper/airflow
Apache Airflow
onesuper/codelab-mindstorms
CodeLab Mindstorms关注编程教育, 计划翻译和解读编程教育领域优秀的探索者所做的工作。
onesuper/httpie-hmac-auth
Auth plugin for debugging ODPS API with httpie
onesuper/inference
Xorbits Inference(Xinference) is a powerful and versatile library designed to serve language, speech recognition, and multimodal models.
onesuper/awesome-business-intelligence
Actively curated list of awesome BI tools. PRs welcome!
onesuper/community-supported-connectors
onesuper/datapane
Datapane makes it simple to build shareable reports from Python.
onesuper/dbt_stripe_source
Fivetran's Stripe source dbt package
onesuper/docs.getdbt.com
The code behind docs.getdbt.com
onesuper/ecosystem-projects
onesuper/frameless
Expressive types for Spark.
onesuper/image-hosting
onesuper/jedis-cluster-ext
extend jedis cluster to support pipeline
onesuper/lightdash
An open source alternative to Looker built using dbt. Made for analysts ❤️
onesuper/Orestes-Bloomfilter
Library of different Bloom filters in Java with optional Redis-backing, counting and many hashing options.
onesuper/redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
onesuper/spark-janelia
scripts for using spark on janelia's cluster
onesuper/spark-jobserver
REST job server for Apache Spark
onesuper/Stream-Framework
Stream Framework is a Python library, which allows you to build newsfeed and notification systems using Cassandra and/or Redis.
onesuper/tellery
Tellery helps analysts organize analyses and narrate them in one place. As easy as to use a notebook. As powerful as a data modeling tool.
onesuper/xorbits
Scalable Python data science, in an API compatible & lightning fast way.
onesuper/yellowbase