Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Google research Deduplicate text datasets Wiki40B TFDS deduplication
- Workflow:Intel Ipex llm Pipeline Parallel Inference
- Workflow:Astronomer Astronomer cosmos Kubernetes dbt execution
- Workflow:Microsoft LoRA NLU GLUE Finetuning
- Workflow:Vllm project Vllm Offline Text Generation
- Workflow:Nautechsystems Nautilus trader Backtest with BacktestNode
- Workflow:Haosulab ManiSkill Custom Task Development
- Workflow:Vibrantlabsai Ragas Experiment Driven Development
- Workflow:CARLA simulator Carla Simulation Recording and Replay
- Workflow:Apache Beam Twister2 Batch Execution
Principles
- Principle:Unstructured IO Unstructured Strategy Selection
- Principle:AUTOMATIC1111 Stable diffusion webui Latent Diffusion Pipeline
- Principle:Guardrails ai Guardrails Validator Logic Implementation
- Principle:Google deepmind Mujoco Object Name Lookup
- Principle:Datajuicer Data juicer Pipeline Monitoring and Checkpointing
- Principle:Mlfoundations Open flamingo Distributed Checkpointing
- Principle:Duckdb Duckdb Benchmark Discovery
- Principle:FMInference FlexLLMGen Tokenizer Loading
- Principle:BerriAI Litellm Training Data Preparation
- Principle:Langgenius Dify Frontend Container Runtime
Implementations
- Implementation:TobikoData Sqlmesh PlanOptions
- Implementation:Microsoft Playwright Server Instrumentation
- Implementation:AUTOMATIC1111 Stable diffusion webui Network apply weights
- Implementation:Axolotl ai cloud Axolotl Setup Script
- Implementation:Huggingface Diffusers Check Repo Quality
- Implementation:Confident ai Deepeval FaithfulnessMetric
- Implementation:FlowiseAI Flowise ExportAsTemplateDialog
- Implementation:Truera Trulens UX Components
- Implementation:Vllm project Vllm RequestOutput VLM Access
- Implementation:Apache Paimon FormatTable
Heuristics
- Heuristic:Mlc ai Mlc llm BLAS Dispatch Decision
- Heuristic:Sdv dev SDV Version Compatibility
- Heuristic:ChenghaoMou Text dedup Mersenne Prime Backward Compatibility
- Heuristic:Ggml org Llama cpp Warning Deprecated Legacy Converters
- Heuristic:Isaac sim IsaacGymEnvs GPU Pipeline Selection
- Heuristic:CrewAIInc CrewAI Context Window Management
- Heuristic:Datahub project Datahub Validation Cross API
- Heuristic:Protectai Llm guard Fail Fast Early Exit
- Heuristic:Lance format Lance Vector Index Tuning
- Heuristic:TobikoData Sqlmesh Forward Only Safety
Environments
- Environment:Intel Ipex llm Pipeline Parallel Environment
- Environment:Duckdb Duckdb Code Generation Tools
- Environment:DataExpert io Data engineer handbook Spark Iceberg Docker Environment
- Environment:Heibaiying BigData Notes Storm 1 2 Environment
- Environment:Bitsandbytes foundation Bitsandbytes Build From Source Environment
- Environment:ThreeSR Awesome Inference Time Scaling Semantic Scholar API Environment
- Environment:ARISE Initiative Robosuite GPU Rendering
- Environment:Evidentlyai Evidently Grafana Monitoring Environment
- Environment:Intel Ipex llm Portable Environment
- Environment:Heibaiying BigData Notes Java 8 Maven Environment