feng-tao
Engineering Leadership @databricks on Data Catalog && Data Lineage | @apache Airflow PMC and committer | @amundsen-io co-creator
@databricks @amundsen-io @apache San Francisco
Pinned Repositories
amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
airflow-maintenance-dags
A series of DAGs/Workflows to help maintain the operation of Airflow
AlgoPython
All Algorithms implemented in Python
awesome-business-intelligence
Actively curated list of awesome BI tools. PRs welcome!
awesome-flink
😎 A curated list of amazingly awesome Flink and Flink ecosystem resources
dr-elephant
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
engineer-manager
A list of engineering manager resource links.
free-programming-books-zh_CN
:books: 免费的计算机编程类中文书籍,欢迎投稿
Leetcode-4
Complete solutions to Leetcode problems; updated daily.
feng-tao's Repositories
feng-tao/parquet-format
Apache Parquet
feng-tao/actual-server
Actual's server
feng-tao/data-diff
Efficiently diff rows across two different databases.
feng-tao/databricks-cli
Command Line Interface for Databricks
feng-tao/databricks-ml-examples
feng-tao/dbt-databricks
A dbt adapter for Databricks.
feng-tao/dh-actions
DataHub Actions is a framework for responding to changes to your DataHub Metadata Graph in real time.
feng-tao/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
feng-tao/elementary
Open-source data observability for analytics engineers.
feng-tao/ESL-CN
The Elements of Statistical Learning (ESL)的中文翻译、代码实现及其习题解答。
feng-tao/feng-tao
feng-tao/feng-tao.github.io
💎 🐳 A super customizable Jekyll theme for personal site, team site, blog, project, documentation, etc.
feng-tao/googleapis
Public interface definitions of Google APIs.
feng-tao/gorilla
Gorilla: An API store for LLMs
feng-tao/jaffle_shop
A self-contained dbt project for testing purposes (test dbt databricks lineage)
feng-tao/langchain
⚡ Building applications with LLMs through composability ⚡
feng-tao/lucene
Apache Lucene open-source search software
feng-tao/Luotuo-Chinese-LLM
骆驼(Luotuo): Open Sourced Chinese Language Models. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技
feng-tao/metricflow
MetricFlow allows you to define, build, and maintain metrics in code.
feng-tao/minbpe
Minimal, clean, code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
feng-tao/mlflow
Open source platform for the machine learning lifecycle
feng-tao/naarad
Naarad is a highly configurable system analysis tool that parses and plots timeseries data for better visual correlation. Naarad was built to help in performance analysis and investigations.
feng-tao/opsreview
Compile a report of recent PagerDuty alerts for a single escalation policy.
feng-tao/pandas-profiling
Create HTML profiling reports from pandas DataFrame objects
feng-tao/patterns-of-distributed-systems
《Patterns of Distributed Systems》中文版
feng-tao/pycaret
An open-source, low-code machine learning library in Python
feng-tao/recap
Recap tracks and transform schemas across your whole application.
feng-tao/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
feng-tao/ToDo
Manage your ToDos by Github Issues and Projects
feng-tao/unitycatalog
Open, Multi-modal Catalog for Data & AI