Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Langchain ai Langchain Chat Model Invocation
- Workflow:Avhz RustQuant Model Calibration
- Workflow:Truera Trulens Snowflake Observability Pipeline
- Workflow:Huggingface Open r1 Reasoning Data Generation
- Workflow:Heibaiying BigData Notes Spark SQL Data Analysis
- Workflow:Apache Flink File Sink Pipeline
- Workflow:CARLA simulator Carla Building from Source
- Workflow:Evidentlyai Evidently Text Data Quality Evaluation
- Workflow:ClickHouse ClickHouse Packaging For Distribution
- Workflow:Microsoft Semantic kernel Agent Conversation And Orchestration
Principles
- Principle:Ggml org Ggml GGUF File Creation
- Principle:Allenai Open instruct Causal LM Loading
- Principle:Tensorflow Serving Resource Tracking
- Principle:Lucidrains X transformers Variational Latent Language Modeling
- Principle:Triton inference server Server Engine Validation
- Principle:Spotify Luigi Spark Resource Configuration
- Principle:Huggingface Datasets TensorFlow Formatting
- Principle:Iterative Dvc Dataset Resolution
- Principle:Sail sg LongSpec Math Equivalence Evaluation
- Principle:Mistralai Client python OCR Result Processing
Implementations
- Implementation:TobikoData Sqlmesh CopyButton
- Implementation:Sgl project Sglang Triton Character Generation
- Implementation:Microsoft DeepSpeedExamples DSPipeline Text Generation
- Implementation:Helicone Helicone Mitmproxy Mac
- Implementation:Apache Druid QueryParametersDialog
- Implementation:Helicone Helicone BaseProvider
- Implementation:Openai Openai python Response Input Audio Param
- Implementation:Apache Paimon DataTypeJsonParser
- Implementation:Ucbepic Docetl VisualizationBuilder
- Implementation:CARLA simulator Carla Client Start Recorder
Heuristics
- Heuristic:Datajuicer Data juicer Batch Size Adaptation
- Heuristic:Interpretml Interpret Memory Budget Heuristic
- Heuristic:Apache Shardingsphere Worker ID Reservation Strategy
- Heuristic:Speechbrain Speechbrain Score Normalization Tips
- Heuristic:Astronomer Astronomer cosmos Dbt Invocation Mode Selection
- Heuristic:Langchain ai Langchain Warning Deprecated Langchain Classic
- Heuristic:Mbzuai oryx Awesome LLM Post training Excel Sheet Name Truncation
- Heuristic:Neuml Txtai Memory Streaming Optimization
- Heuristic:Predibase Lorax Quantization Backend Selection
- Heuristic:Rapidsai Cuml Batch Size Memory Tradeoff
Environments
- Environment:ArroyoSystems Arroyo Object Storage
- Environment:MaterializeInc Materialize Dbt Materialize Runtime
- Environment:Mistralai Client python GCP Deployment Environment
- Environment:Truera Trulens OpenAI Provider Environment
- Environment:Apache Dolphinscheduler Netty Runtime
- Environment:Mlc ai Web llm Node Build Toolchain
- Environment:Apache Shardingsphere Calcite Federation Engine
- Environment:Huggingface Trl DeepSpeed Environment
- Environment:Kubeflow Kubeflow Kubectl Kustomize CLI Environment
- Environment:Apache Kafka Committer Tools Environment