Jump to content

Connect SuperML | Leeroopedia MCP: Equip your AI agents with best practices, code verification, and debugging knowledge. Powered by Leeroo — building Organizational Superintelligence. Contact us at founders@leeroo.com.

Principle:Marker Inc Korea AutoRAG Deployment Mode Selection

From Leeroopedia
Revision as of 17:28, 16 February 2026 by Admin (talk | contribs) (Auto-imported from principles/Marker_Inc_Korea_AutoRAG_Deployment_Mode_Selection.md)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)


Knowledge Sources
Domains Deployment, API_Design
Last Updated 2026-02-08 06:00 GMT

Overview

A deployment pattern that provides multiple execution modes for running an optimized RAG pipeline: code, API server, or web interface.

Description

After initialization, AutoRAG pipelines can be deployed in three modes: Code Runner (programmatic access via Runner.run), API Server (REST endpoints via ApiRunner), and Web Interface (interactive chat via GradioRunner or Streamlit). Code mode is for integration into Python applications. API mode exposes /v1/run, /v1/retrieve, and /v1/stream endpoints. Web mode provides a chat-like interface for end users. Each mode uses the same underlying module chain but with different input/output interfaces.

Usage

Choose the deployment mode based on the use case: code mode for batch processing or embedding in applications, API mode for microservice architectures, or web mode for demos and user-facing applications.

Theoretical Basis

All deployment modes share the same execution pattern:

  1. Create a pseudo QA DataFrame from the user query
  2. Sequentially run each module instance on the previous result
  3. Merge module outputs into the growing result DataFrame
  4. Extract the final output from the specified result column

The difference lies only in the input/output interface, not the pipeline execution.

Related Pages

Implemented By

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment