Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Unslothai Unsloth QLoRA SFT Finetuning
- Workflow:Obss Sahi COCO Evaluation
- Workflow:Haotian liu LLaVA Benchmark Evaluation
- Workflow:Sail sg LongSpec Speculative Decoding Inference
- Workflow:LaurentMazare Tch rs JIT Model Inference
- Workflow:Triton inference server Server LLM Deployment With TRT LLM
- Workflow:ArroyoSystems Arroyo UDF Development
- Workflow:Rapidsai Cuml Multi GPU Distributed ML
- Workflow:Microsoft Agent framework Graph Based Workflow Execution
- Workflow:Microsoft Onnxruntime Train Convert Predict
Principles
- Principle:Apache Kafka Trogdor Invocation
- Principle:Ggml org Llama cpp ComputeGraph
- Principle:NVIDIA NeMo Curator Connected Component Analysis
- Principle:Huggingface Datatrove Pipeline Failure Diagnosis
- Principle:Spotify Luigi Hadoop Pipeline Execution
- Principle:MaterializeInc Materialize Release Commit and Tagging
- Principle:Google deepmind Mujoco Model Loading
- Principle:Kubeflow Pipelines Component Definition
- Principle:Huggingface Transformers Documentation Metadata Management
- Principle:Elevenlabs Elevenlabs python Text Source Preparation
Implementations
- Implementation:Online ml River Stats AutoCorr
- Implementation:FlagOpen FlagEmbedding Reinforced IR Generate Universal Query
- Implementation:OpenBMB UltraFeedback Annotation Data Loading
- Implementation:Deepset ai Haystack Sidebars Navigation Config
- Implementation:DistrictDataLabs Yellowbrick MissingValuesBar
- Implementation:TobikoData Sqlmesh Layout Worker Help
- Implementation:FMInference FlexLLMGen DeepSpeed Autotuning Utils
- Implementation:Treeverse LakeFS External Scheduler Configuration
- Implementation:CrewAIInc CrewAI Tool Usage And Hooks
- Implementation:Duckdb Duckdb Generate Auxiliary
Heuristics
- Heuristic:ChenghaoMou Text dedup SimHash Optimization Ceiling
- Heuristic:Apache Spark Partition Sizing Tips
- Heuristic:Axolotl ai cloud Axolotl Memory Optimization Tips
- Heuristic:Unstructured IO Unstructured Chunk Size Tuning
- Heuristic:Facebookresearch Habitat lab Force Single Threaded PyTorch
- Heuristic:Mlc ai Web llm Multi Round KV Cache Reuse
- Heuristic:Tencent Ncnn FP16 Precision Selection
- Heuristic:TA Lib Ta lib python NaN Propagation Behavior
- Heuristic:Facebookresearch Audiocraft Audio Normalization Strategies
- Heuristic:Huggingface Datatrove VLLM Startup Optimization
Environments
- Environment:Helicone Helicone Wrangler CLI
- Environment:Princeton nlp Tree of thought llm Python OpenAI
- Environment:Huggingface Trl Python Core Dependencies
- Environment:Sgl project Sglang CPU Runtime
- Environment:Huggingface Peft GPU Hardware Detection
- Environment:InternLM Lmdeploy Python Dependencies
- Environment:LLMBook zh LLMBook zh github io VLLM Inference Environment
- Environment:Spotify Luigi AWS S3 Storage
- Environment:Run llama Llama index Fsspec Remote Storage
- Environment:Predibase Lorax Python Server Dependencies