Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Predibase Lorax Single LoRA Inference
- Workflow:Hiyouga LLaMA Factory Model Export and Merging
- Workflow:Huggingface Open r1 GRPO Reasoning Training
- Workflow:Vibrantlabsai Ragas RAG Evaluation
- Workflow:DataTalksClub Data engineering zoomcamp Docker PostgreSQL Data Ingestion
- Workflow:Openai Openai python Fine Tuning Job Management
- Workflow:Mage ai Mage ai Building a New Source Connector
- Workflow:Webdriverio Webdriverio WDIO Testrunner Setup
- Workflow:Deepseek ai Janus Rectified Flow Image Generation
- Workflow:Farama Foundation Gymnasium RL Agent Training Loop
Principles
- Principle:Marker Inc Korea AutoRAG Quality Filtering And Export
- Principle:Huggingface Diffusers CI Test Reporting
- Principle:Langfuse Langfuse Dataset Item Processing
- Principle:FlowiseAI Flowise Lead Capture
- Principle:Openai Openai node Fine Tuning Job Creation
- Principle:Sgl project Sglang Multimodal Prompt Construction
- Principle:Scikit learn Scikit learn Metric Visualization
- Principle:Webdriverio Webdriverio BrowserStack Accessibility Testing
- Principle:OpenGVLab InternVL Dynamic Resolution Preprocessing
- Principle:Microsoft DeepSpeedExamples PPO Training
Implementations
- Implementation:CrewAIInc CrewAI Crew Constructor
- Implementation:Run llama Llama index IngestionPipeline Persist
- Implementation:Microsoft Playwright TraceModel Loader
- Implementation:Gretelai Gretel synthetics Generate Text
- Implementation:Avhz RustQuant Asay82 Model
- Implementation:InternLM Lmdeploy Impl 884
- Implementation:Mlc ai Mlc llm Process chat completion request
- Implementation:Neuml Txtai Texts Data
- Implementation:Huggingface Datatrove BaseInferenceServer
- Implementation:Tensorflow Serving Static Storage Path Source
Heuristics
- Heuristic:Protectai Modelscan Nested Zip Not Supported
- Heuristic:Apache Druid Cluster Health Diagnostic Thresholds
- Heuristic:Tensorflow Tfjs Memory Management With Tidy
- Heuristic:Axolotl ai cloud Axolotl Gradient Checkpointing Reentrant Rules
- Heuristic:Vibrantlabsai Ragas Temperature Sampling Strategy
- Heuristic:Apache Dolphinscheduler Gzip Compression Threshold
- Heuristic:Deepset ai Haystack Pipeline Max Runs Safety Limit
- Heuristic:Guardrails ai Guardrails Async Vs Sync Validation Mode
- Heuristic:Infiniflow Ragflow Citation Threshold Decay
- Heuristic:Snorkel team Snorkel LabelModel Mu Eps Clamping
Environments
- Environment:Intel Ipex llm XPU Inference Environment
- Environment:Evidentlyai Evidently Python Core Environment
- Environment:SeleniumHQ Selenium Grid Deployment Environment
- Environment:Huggingface Datatrove Inference GPU Environment
- Environment:Apache Paimon Python Core Runtime
- Environment:ContextualAI HALOs CUDA 12 1 Training Environment
- Environment:Mlc ai Mlc llm TVM Runtime Environment
- Environment:Google deepmind Dm control GLFW Desktop Rendering
- Environment:Tencent Ncnn Vulkan Environment
- Environment:Apache Beam Portable Runner Environment