pipeline

There are 5210 repositories under pipeline topic.

  • serve

    jina-ai/serve

    ☁️ Build multimodal AI applications with cloud-native stack

    Language:Python21.1k2131.9k2.2k
  • vector

    vectordotdev/vector

    A high-performance observability data pipeline.

    Language:Rust18.2k1547.7k1.6k
  • argoproj/argo-cd

    Declarative Continuous Deployment for Kubernetes

    Language:Go18k1808.4k5.5k
  • PrefectHQ/prefect

    Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

    Language:Python17.5k1665.7k1.6k
  • airbytehq/airbyte

    The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

    Language:Python16.2k18714.7k4.1k
  • taipy

    Avaiga/taipy

    Turns Data and AI algorithms into production-ready web applications in no time.

    Language:Python15.5k778731.9k
  • kestra

    kestra-io/kestra

    :zap: Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...

    Language:Java13.1k1612.6k1.1k
  • kedro

    kedro-org/kedro

    Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.

    Language:Python10k1092k906
  • great-expectations/great_expectations

    Always know what to expect from your data.

    Language:Python10k851.9k1.5k
  • prql

    PRQL/prql

    PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

    Language:Rust10k451k218
  • tektoncd/pipeline

    A cloud-native Pipeline resource.

    Language:Go8.5k1302.9k1.8k
  • mage-ai/mage-ai

    🧙 Build, run, and manage data pipelines for integrating and transforming data.

    Language:Python8k63877774
  • marimo-team/marimo

    A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.

    Language:Python7.9k40774278
  • projectdiscovery/httpx

    httpx is a fast and multi-purpose HTTP toolkit that allows running multiple probes using the retryablehttp library.

    Language:Go7.8k80635845
  • proposal-pipeline-operator

    tc39/proposal-pipeline-operator

    A proposal for adding a useful pipe operator to JavaScript.

    Language:HTML7.6k255234108
  • brunch/brunch

    :fork_and_knife: Web applications made easy. Since 2011.

    Language:JavaScript6.8k1341.3k430
  • iam-veeramalla/Jenkins-Zero-To-Hero

    Install Jenkins, configure Docker as slave, set up cicd, deploy applications to k8s using Argo CD in GitOps way.

    Language:Python6.6k3271812.4k
  • nteract/papermill

    📚 Parameterize, execute, and analyze notebooks

    Language:Python6k86403428
  • gaia-pipeline/gaia

    Build powerful pipelines in any programming language.

    Language:Go5.2k105161245
  • GameDevMind

    gonglei007/GameDevMind

    最全面的游戏开发技术图谱。帮助游戏开发者们在已知问题上节省时间,省出更多的精力投入到更有创造性的工作中去。

    Language:Shell5.1k672547
  • jx

    jenkins-x/jx

    Jenkins X provides automated CI+CD for Kubernetes with Preview Environments on Pull Requests using Cloud Native pipelines from Tekton

    Language:Go4.6k1214.2k788
  • Production-Level-Deep-Learning

    alirezadir/Production-Level-Deep-Learning

    A guideline for building practical production-level deep learning systems to be deployed in real world applications.

  • cube-studio

    tencentmusic/cube-studio

    cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式

    Language:Jupyter Notebook3.7k73146652
  • kubeflow/pipelines

    Machine Learning Pipelines for Kubeflow

    Language:Python3.6k1013.9k1.6k
  • towhee-io/towhee

    Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

    Language:Python3.2k29665253
  • yuanzhoulvpi2017/zero_nlp

    中文nlp解决方案(大模型、数据、模型、训练、推理)

    Language:Jupyter Notebook3k30197368
  • EpicGamesExt/BlenderTools

    Blender addons that improve the game development workflow between Blender and Unreal.

    Language:Python2.8k23851461
  • nextflow

    nextflow-io/nextflow

    A DSL for data-driven computational pipelines

    Language:Groovy2.8k893.3k631
  • alibaba/pipcook

    Machine learning platform for Web developers

    Language:TypeScript2.5k48267209
  • EntilZha/PyFunctional

    Python library for creating data pipelines with chain functional programming

    Language:Python2.4k49138132
  • firecow/gitlab-ci-local

    Tired of pushing to test your .gitlab-ci.yml?

    Language:TypeScript2.4k17524135
  • apptension/saas-boilerplate

    SaaS Boilerplate - Open Source and free SaaS stack that lets you build SaaS products faster in React, Django and AWS. Focus on essential business logic instead of coding repeatable features!

    Language:TypeScript2.3k18174272
  • instill-core

    instill-ai/instill-core

    🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications

    Language:Makefile2.2k30519107
  • mara/mara-pipelines

    A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow

    Language:Python2.1k5634103
  • go-streams

    reugn/go-streams

    A lightweight stream processing library for Go

    Language:Go1.9k2832157
  • hu17889/go_spider

    [爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.

    Language:Go1.8k15424471