Principle:Ucbepic Docetl Optimized Pipeline Review
| Knowledge Sources | |
|---|---|
| Domains | Optimization, Decision_Making |
| Last Updated | 2026-02-08 01:40 GMT |
Overview
A decision-making principle for selecting and deploying the best optimized pipeline variant from a Pareto frontier of cost-accuracy tradeoffs.
Description
After MCTS search completes, the Pareto frontier contains multiple pipeline variants, each offering a different tradeoff between LLM cost and output accuracy. Optimized Pipeline Review is the process of:
- Examining the Pareto frontier visualization (cost vs accuracy scatter plot)
- Comparing optimized variants against the baseline
- Selecting the variant that best matches deployment constraints (budget, quality threshold)
- Running the selected optimized pipeline on the full dataset
Usage
Apply this principle after MOAR optimization completes. Review the Pareto frontier plots in the save_dir/graph/ directory, select the best pipeline variant, and run it with docetl run.
Theoretical Basis
Multi-objective decision making on Pareto frontiers:
- Dominance: A plan dominates another if it is better on at least one objective and no worse on all others
- Frontier: The set of non-dominated plans forms the Pareto frontier
- Selection: Choose based on deployment constraints (cost budget, minimum accuracy)
- Validation: Run selected variant on full dataset to confirm performance