Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Mlflow Mlflow Prompt Management
- Workflow:Snorkel team Snorkel Data Augmentation
- Workflow:Microsoft Autogen Studio Team Deployment
- Workflow:CrewAIInc CrewAI Crew Training And Testing
- Workflow:Ray project Ray Remote Task Execution
- Workflow:Kubeflow Pipelines Standalone Deployment
- Workflow:Microsoft Semantic kernel Agent Conversation And Orchestration
- Workflow:Speechbrain Speechbrain Speech Enhancement Training
- Workflow:Datajuicer Data juicer Dataset Quality Analysis
- Workflow:Guardrails ai Guardrails Streaming Validation
Principles
- Principle:Onnx Onnx Shape Inference
- Principle:Google deepmind Dm control Robot Configuration
- Principle:Speechbrain Speechbrain Dataset Pipeline Construction
- Principle:DistrictDataLabs Yellowbrick Error Handling
- Principle:Apache Flink Source Connector Framework
- Principle:Ollama Ollama OpenAI Route Registration
- Principle:LMCache LMCache Prefiller Instance Launch
- Principle:OWASP Www project top 10 for large language model applications Real World Incident Cross Reference
- Principle:Recommenders team Recommenders SAR Model Training
- Principle:Arize ai Phoenix Application Instrumentation
Implementations
- Implementation:Iamhankai Forest of Thought ToT Task Run
- Implementation:CARLA simulator Carla Actor API Spec
- Implementation:Mlflow Mlflow Build Packages
- Implementation:Datajuicer Data juicer VideoAspectRatioFilter
- Implementation:Arize ai Phoenix Legacy GeminiModel
- Implementation:Elevenlabs Elevenlabs python WidgetConfigResponse
- Implementation:Openai Openai python Response Code Interpreter Tool Call Param
- Implementation:CARLA simulator Carla Python AdRss Bindings
- Implementation:Mbzuai oryx Awesome LLM Post training Collection Config Variables
- Implementation:TobikoData Sqlmesh Toggle
Heuristics
- Heuristic:Marker Inc Korea AutoRAG Batch Size Tuning
- Heuristic:Iamhankai Forest of Thought UCB Exploration Constant
- Heuristic:Mlc ai Mlc llm Metal KV Cache Capacity Limit
- Heuristic:AUTOMATIC1111 Stable diffusion webui NaN Detection And Precision Fixes
- Heuristic:Huggingface Optimum Device Offload Constraints
- Heuristic:Kornia Kornia Lazy Loading Optional Deps
- Heuristic:Ggml org Llama cpp Quantization Quality Tips
- Heuristic:ARISE Initiative Robomimic HDF5 Cache Mode Selection
- Heuristic:Astronomer Astronomer cosmos Memory Optimised Imports
- Heuristic:CrewAIInc CrewAI LLM Provider Message Workarounds
Environments
- Environment:Apache Shardingsphere Calcite Federation Engine
- Environment:Testtimescaling Testtimescaling github io Semantic Scholar API
- Environment:Datahub project Datahub Java 17 Backend Environment
- Environment:Promptfoo Promptfoo Node Runtime
- Environment:DataTalksClub Data engineering zoomcamp PySpark Batch Environment
- Environment:BerriAI Litellm Docker Deployment
- Environment:Ollama Ollama GPU Runtime
- Environment:Arize ai Phoenix Frontend Node 22
- Environment:Unstructured IO Unstructured Libmagic
- Environment:Intel Ipex llm NPU Environment