Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Vespa engine Vespa Logging framework initialization
- Workflow:PacktPublishing LLM Engineers Handbook Digital Data ETL
- Workflow:Apache Flink Async Sink Lifecycle
- Workflow:Danijar Dreamerv3 Evaluation Only
- Workflow:Turboderp org Exllamav2 Text Generation
- Workflow:Cohere ai Cohere python Model Finetuning
- Workflow:Facebookresearch Audiocraft MusicGen Text To Music Inference
- Workflow:Hiyouga LLaMA Factory Full Parameter SFT
- Workflow:PeterL1n BackgroundMattingV2 Realtime webcam matting
- Workflow:Huggingface Datasets Dataset Preprocessing
Principles
- Principle:Mlfoundations Open flamingo WebDataset Data Pipeline
- Principle:FMInference FlexLLMGen DeepSpeed Initialization
- Principle:FMInference FlexLLMGen CUDA Type Conversion
- Principle:Kserve Kserve Controller Deployment
- Principle:Cohere ai Cohere python Streaming Chat Request
- Principle:Online ml River Iterative Progressive Validation
- Principle:Nautechsystems Nautilus trader Strategy Registration
- Principle:Mage ai Mage ai HTTP Client Request
- Principle:Ggml org Llama cpp Evaluation Dataset Acquisition
- Principle:Googleapis Python genai Live Music Generation
Implementations
- Implementation:Vespa engine Vespa IndexingProcessor ErrorHandling
- Implementation:Kubeflow Pipelines Taxi Utils
- Implementation:DevExpress Testcafe CI Npm Install Pattern
- Implementation:Microsoft Playwright ArtifactDispatcher
- Implementation:BerriAI Litellm CyberArk Secret Manager
- Implementation:FlowiseAI Flowise SSOConfig
- Implementation:Run llama Llama index FunctionCallingLLM
- Implementation:DataExpert io Data engineer handbook Do team vertex transformation
- Implementation:Googleapis Python genai AutomaticFunctionCallingConfig Setup
- Implementation:Infiniflow Ragflow UseMcpRequest Hooks
Heuristics
- Heuristic:Cohere ai Cohere python HTTP Retry Backoff Strategy
- Heuristic:Hpcaitech ColossalAI Warmup Steps Heuristic
- Heuristic:Vespa engine Vespa Warning Deprecated Cloud API Constructors
- Heuristic:Datahub project Datahub Gradle Formatting Over Direct Tools
- Heuristic:Cleanlab Cleanlab Object Detection Scoring Constants
- Heuristic:Princeton nlp SimPO Multi Seed Diversity
- Heuristic:Getgauge Taiko Navigation Wait Strategy
- Heuristic:Microsoft BIPIA OpenAI Rate Limit Retry
- Heuristic:Huggingface Datatrove Thundering Herd Prevention
- Heuristic:ChenghaoMou Text dedup Mersenne Prime Backward Compatibility
Environments
- Environment:ARISE Initiative Robomimic PyTorch CUDA Environment
- Environment:Togethercomputer Together python API Credentials
- Environment:Interpretml Interpret Python Core Environment
- Environment:Apache Dolphinscheduler Java Runtime
- Environment:InternLM Lmdeploy Build From Source
- Environment:Mlflow Mlflow Docker Container Environment
- Environment:Haotian liu LLaVA OpenAI API Evaluation Environment
- Environment:Scikit learn Scikit learn OpenMP Thread Configuration
- Environment:OpenHands OpenHands Integration Credentials
- Environment:Langfuse Langfuse ClickHouse Analytics