Pinned Repositories
aas
Code to accompany Advanced Analytics with Spark from O'Reilly Media
advanced-java
😮 互联网 Java 工程师进阶知识完全扫盲:涵盖高并发、分布式、高可用、微服务、海量数据处理等领域知识,后端同学必看,前端同学也可学习
all-docs
"All Docs" is a tool that enables online previewing, storage, and sharing of documents such as Word, Excel, PowerPoint, PDF, and images (Pic). It supports full-text search for all document information. “全文档”(All Docs),Word, Excel, PPT, PDF, Pic等文档在线预览、存储、共享的工具,并且支持全文搜索的所有的文档信息。
ambari-elasticsearch-service
Ambari Elasticsearch plugins
ambari-flink-service
Ambari集成Flink
ambari-hue-service
ambari2.7.5,hdp3.1.5集成hue4.11.0
Ambari-Spark3
ambari_es
A plugin for adding elasticsearch cluster service to ambari
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Best_AI_paper_2020
A curated list of the latest breakthroughs in AI by release date with a clear video explanation, link to a more in-depth article, and code
kxg916361108's Repositories
kxg916361108/dolphinscheduler
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
kxg916361108/elasticsearch
Open Source, Distributed, RESTful Search Engine
kxg916361108/OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
kxg916361108/dinky
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
kxg916361108/mlflow
Open source platform for the machine learning lifecycle
kxg916361108/kubeflow
Machine Learning Toolkit for Kubernetes
kxg916361108/datax-web
DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、数据源信息加密等。
kxg916361108/gpt4free
The official gpt4free repository | various collection of powerful language models
kxg916361108/EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
kxg916361108/vnpy
基于Python的开源量化交易平台开发框架
kxg916361108/dbt-core
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
kxg916361108/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
kxg916361108/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
kxg916361108/langchain
🦜🔗 Build context-aware reasoning applications
kxg916361108/MoneyPrinterTurbo
利用大模型,一键生成短视频
kxg916361108/datavines
Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.
kxg916361108/MoneyPrinter
Automate Creation of YouTube Shorts using MoviePy.
kxg916361108/ollama
Get up and running with Llama 2, Mistral, Gemma, and other large language models.
kxg916361108/spark
Apache Spark - A unified analytics engine for large-scale data processing
kxg916361108/crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
kxg916361108/onedev
Git Server with CI/CD, Kanban, and Packages. Ultra Easy to Set Up and Maintain.
kxg916361108/all-docs
"All Docs" is a tool that enables online previewing, storage, and sharing of documents such as Word, Excel, PowerPoint, PDF, and images (Pic). It supports full-text search for all document information. “全文档”(All Docs),Word, Excel, PPT, PDF, Pic等文档在线预览、存储、共享的工具,并且支持全文搜索的所有的文档信息。
kxg916361108/everyone-can-use-english
人人都能用英语
kxg916361108/vanna
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
kxg916361108/Doris-On-Ambari
Ambari 集成 Doris
kxg916361108/Rath
Next generation of automated data exploratory analysis and visualization platform.
kxg916361108/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
kxg916361108/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
kxg916361108/GPTs
leaked prompts of GPTs--提示词项目
kxg916361108/best_AI_papers_2023
A curated list of the latest breakthroughs in AI (in 2023) by release date with a clear video explanation, link to a more in-depth article, and code.