Pinned Repositories
hudi
Upserts, Deletes And Incremental Processing on Big Data.
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
deduplicate-text-datasets
rasa-nlu-benchmark
Collection of dataset and corresponding benchmark for Rasa NLU
data-deduplication
big data documents deduplication with minhash
rasa
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
sdyxl-auto-script
TencentDocs-smartcanvas2markdown
腾讯文档智能文档导出markdown格式的文件,同时下载图片并放置在对应文件夹和标号,供其他软件使用。
zengyangjie's Repositories
zengyangjie/TencentDocs-smartcanvas2markdown
腾讯文档智能文档导出markdown格式的文件,同时下载图片并放置在对应文件夹和标号,供其他软件使用。
zengyangjie/data-deduplication
big data documents deduplication with minhash
zengyangjie/rasa
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
zengyangjie/sdyxl-auto-script