Jump to content

Connect SuperML | Leeroopedia MCP: Equip your AI agents with best practices, code verification, and debugging knowledge. Powered by Leeroo — building Organizational Superintelligence. Contact us at founders@leeroo.com.

Principle:Ucbepic Docetl Pipeline Assembly And Execution

From Leeroopedia
Revision as of 18:18, 16 February 2026 by Admin (talk | contribs) (Auto-imported from principles/Ucbepic_Docetl_Pipeline_Assembly_And_Execution.md)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)


Knowledge Sources
Domains Data_Engineering, Pipeline_Orchestration
Last Updated 2026-02-08 01:40 GMT

Overview

A programmatic pipeline orchestration principle that assembles datasets, operations, and steps into a runnable Pipeline object via the Python API.

Description

Pipeline Assembly and Execution combines dataset objects, operation schemas, pipeline steps, and output configuration into a single Pipeline object that can be run, optimized, or exported to YAML. The Pipeline class internally converts to a YAML-equivalent dict and delegates execution to DSLRunner.

Usage

Use this principle for programmatic pipeline construction when YAML configuration is insufficient (e.g., dynamic operation generation, conditional pipelines, notebook-based workflows).

Theoretical Basis

Programmatic pipeline composition:

  1. Object Construction: Create typed Dataset, Operation, Step, and Output objects
  2. Assembly: Compose into a Pipeline object with named datasets
  3. Conversion: Pipeline internally converts to dict format matching YAML schema
  4. Delegation: Execution delegated to DSLRunner.load_run_save()

Related Pages

Implemented By

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment