pipeline

There are 4883 repositories under pipeline topic.

  • jina

    jina-ai/jina

    ☁️ Build multimodal AI applications with cloud-native stack

    Language:Python20.2k2081.9k2.2k
  • vector

    vectordotdev/vector

    A high-performance observability data pipeline.

    Language:Rust16.7k1497.4k1.4k
  • argoproj/argo-cd

    Declarative Continuous Deployment for Kubernetes

    Language:Go16.3k1827.6k4.9k
  • PrefectHQ/prefect

    Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines

    Language:Python14.8k1595k1.5k
  • airbytehq/airbyte

    The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

    Language:Python14.3k17813.7k3.7k
  • great-expectations/great_expectations

    Always know what to expect from your data.

    Language:Python9.5k821.8k1.5k
  • prql

    PRQL/prql

    PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

    Language:Rust9.5k44932204
  • kedro

    kedro-org/kedro

    Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.

    Language:Python9.4k1041.8k871
  • Avaiga/taipy

    Turns Data and AI algorithms into production-ready web applications in no time.

    Language:Python8.8k59532643
  • tektoncd/pipeline

    A cloud-native Pipeline resource.

    Language:Go8.3k1302.8k1.8k
  • proposal-pipeline-operator

    tc39/proposal-pipeline-operator

    A proposal for adding a useful pipe operator to JavaScript.

    Language:HTML7.4k254230109
  • mage-ai/mage-ai

    🧙 Build, run, and manage data pipelines for integrating and transforming data.

    Language:Python7.2k61646650
  • projectdiscovery/httpx

    httpx is a fast and multi-purpose HTTP toolkit that allows running multiple probes using the retryablehttp library.

    Language:Go6.9k79549768
  • brunch/brunch

    :fork_and_knife: Web applications made easy. Since 2011.

    Language:JavaScript6.8k1341.3k436
  • kestra

    kestra-io/kestra

    Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

    Language:Java6.6k611.7k377
  • nteract/papermill

    📚 Parameterize, execute, and analyze notebooks

    Language:Python5.7k89396416
  • gaia-pipeline/gaia

    Build powerful pipelines in any programming language.

    Language:Go5.2k107161242
  • iam-veeramalla/Jenkins-Zero-To-Hero

    Install Jenkins, configure Docker as slave, set up cicd, deploy applications to k8s using Argo CD in GitOps way.

    Language:Python5k279159k
  • jx

    jenkins-x/jx

    Jenkins X provides automated CI+CD for Kubernetes with Preview Environments on Pull Requests using Cloud Native pipelines from Tekton

    Language:Go4.5k1194.2k783
  • GameDevMind

    gonglei007/GameDevMind

    最全面的游戏开发技术图谱。帮助游戏开发者们在已知问题上节省时间,省出更多的精力投入到更有创造性的工作中去。

    Language:Shell4.5k622486
  • marimo-team/marimo

    A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.

    Language:Python4.3k21318118
  • Production-Level-Deep-Learning

    alirezadir/Production-Level-Deep-Learning

    A guideline for building practical production-level deep learning systems to be deployed in real world applications.

  • kubeflow/pipelines

    Machine Learning Pipelines for Kubeflow

    Language:Python3.5k1033.7k1.6k
  • towhee-io/towhee

    Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

    Language:Python3k29653240
  • EpicGamesExt/BlenderTools

    Blender addons that improve the game development workflow between Blender and Unreal.

    Language:Python2.6k23450514
  • cube-studio

    tencentmusic/cube-studio

    cube studio开源云原生一站式机器学习/深度学习AI平台,支持sso登录,多租户/多项目组,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式

    Language:Jupyter Notebook2.6k68141485
  • nextflow

    nextflow-io/nextflow

    A DSL for data-driven computational pipelines

    Language:Groovy2.6k853.1k594
  • yuanzhoulvpi2017/zero_nlp

    中文nlp解决方案(大模型、数据、模型、训练、推理)

    Language:Python2.5k30171324
  • alibaba/pipcook

    Machine learning platform for Web developers

    Language:TypeScript2.5k49267204
  • EntilZha/PyFunctional

    Python library for creating data pipelines with chain functional programming

    Language:Python2.3k48133130
  • mara/mara-pipelines

    A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow

    Language:Python2.1k5634102
  • firecow/gitlab-ci-local

    Tired of pushing to test your .gitlab-ci.yml?

    Language:TypeScript1.9k16453109
  • instill-core

    instill-ai/instill-core

    🔮 Instill Core is an open-source no-/low-code data, model and pipeline orchestration platform, providing a full-stack solution for AI-first applications

    Language:Makefile1.9k29080
  • hu17889/go_spider

    [爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.

    Language:Go1.8k15424474
  • edyoda/data-science-complete-tutorial

    For extensive instructor led learning

    Language:Jupyter Notebook1.8k665764
  • apptension/saas-boilerplate

    SaaS Boilerplate - Open Source and free SaaS stack that lets you build SaaS products faster in React, Django and AWS. Focus on essential business logic instead of coding repeatable features!

    Language:TypeScript1.8k15153185