Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Huggingface Peft LoRA Causal LM Finetuning
- Workflow:Neuml Txtai Model Training
- Workflow:MaterializeInc Materialize Docker Image Build
- Workflow:Openai Openai node Chat Completion
- Workflow:Iterative Dvc Data Tracking
- Workflow:Roboflow Rf detr Roboflow Deployment
- Workflow:Heibaiying BigData Notes Storm Topology Development
- Workflow:Datahub project Datahub Java SDK V2 Entity Management
- Workflow:NVIDIA TransformerEngine Comm GEMM Overlap Training
- Workflow:Microsoft Playwright Trace recording and debugging
Principles
- Principle:Eric mitchell Direct preference optimization Hydra Configuration
- Principle:Haosulab ManiSkill Domain Randomization
- Principle:Dotnet Machinelearning SIMD Vector Math
- Principle:Ollama Ollama Server Initialization
- Principle:Onnx Onnx Model Serialization
- Principle:Wandb Weave Release Push
- Principle:Apache Hudi Write Operation Configuration
- Principle:Online ml River Online Decision Trees
- Principle:Heibaiying BigData Notes Kafka Rebalancing and Shutdown
- Principle:Astronomer Astronomer cosmos Graph Entity Model
Implementations
- Implementation:Run llama Llama index TransformRetriever
- Implementation:Farama Foundation Gymnasium Continuous MountainCarEnv
- Implementation:Pola rs Polars Sink Operations
- Implementation:Kserve Kserve OpenVINO Runtime
- Implementation:FlowiseAI Flowise InviteUsersDialog
- Implementation:Online ml River Metrics Base
- Implementation:Hpcaitech ColossalAI Eval Utilities
- Implementation:Open compass VLMEvalKit Build Judge
- Implementation:Apache Paimon RestApi
- Implementation:Apache Flink Pool
Heuristics
- Heuristic:ContextualAI HALOs TF32 Matmul Acceleration
- Heuristic:Kserve Kserve VLLM GPU Memory Utilization
- Heuristic:Mlc ai Mlc llm Engine Mode Selection
- Heuristic:Apache Dolphinscheduler Load Balancer Strategy Selection
- Heuristic:Facebookresearch Habitat lab Force Single Threaded PyTorch
- Heuristic:Gretelai Gretel synthetics Mixed Precision Training Tradeoff
- Heuristic:NVIDIA NeMo Aligner Adam State Offloading Tip
- Heuristic:Huggingface Open r1 vLLM GPU Allocation
- Heuristic:ThreeSR Awesome Inference Time Scaling Empty Venue Default Tip
- Heuristic:Openai CLIP JIT Vs Non JIT Loading
Environments
- Environment:Openai Openai agents python Voice Dependencies
- Environment:Huggingface Trl DeepSpeed Environment
- Environment:FlagOpen FlagEmbedding Python PyTorch Environment
- Environment:Togethercomputer Together python Fine Tuning Data Requirements
- Environment:Iamhankai Forest of Thought OpenAI API Credentials
- Environment:LMCache LMCache CUDA GPU Runtime
- Environment:Apache Hudi Flink Runtime Environment
- Environment:Norrrrrrr lyn WAInjectBench External Repos Dependencies
- Environment:Kubeflow Kubeflow Python KFP SDK Environment
- Environment:DataExpert io Data engineer handbook PostgreSQL Docker Environment