Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Open compass VLMEvalKit Video Benchmark Evaluation
- Workflow:PeterL1n BackgroundMattingV2 Training pipeline
- Workflow:DataExpert io Data engineer handbook PySpark Iceberg Job Execution
- Workflow:Sdv dev SDV Multi table synthesis
- Workflow:Apache Spark Kubernetes Deployment
- Workflow:LaurentMazare Tch rs LLM Text Generation
- Workflow:ClickHouse ClickHouse Building From Source
- Workflow:Guardrails ai Guardrails Structured Data Generation
- Workflow:ARISE Initiative Robosuite Domain Randomization Training
- Workflow:Lm sys FastChat Vicuna SFT Finetuning
Principles
- Principle:Mit han lab Llm awq AWQ HuggingFace Export
- Principle:Truera Trulens Feedback Display Formatting
- Principle:Volcengine Verl RLHF Data Preparation
- Principle:Zai org CogVideo Parallel Video Generation
- Principle:Hpcaitech ColossalAI Tokenizer Vocabulary Expansion
- Principle:FlowiseAI Flowise Role Based Access Control
- Principle:Protectai Llm guard Input Scanner Factory Pattern
- Principle:Rapidsai Cuml Support Vector Machines
- Principle:PacktPublishing LLM Engineers Handbook Query Expansion
- Principle:Puppeteer Puppeteer Dynamic Content Waiting
Implementations
- Implementation:Datajuicer Data juicer Multimodal Utils
- Implementation:Openai Openai python Response Input Param
- Implementation:Hiyouga LLaMA Factory LongLoRA
- Implementation:OpenGVLab InternVL LlavaMptForCausalLM
- Implementation:Haosulab ManiSkill RoboCasaStove
- Implementation:Roboflow Rf detr RFDETR Size Variants
- Implementation:Mlc ai Mlc llm Auto Config
- Implementation:Huggingface Datasets HDF5 Builder
- Implementation:Protectai Llm guard Output Language
- Implementation:Trailofbits Fickling Create Malicious Dataset
Heuristics
- Heuristic:ChenghaoMou Text dedup Mersenne Prime Backward Compatibility
- Heuristic:Trailofbits Fickling Force Flag Bypass
- Heuristic:Openai Whisper Temperature Fallback Strategy
- Heuristic:Ggml org Ggml Gradient Accumulation Batch Sizing
- Heuristic:Apache Spark Memory Tuning Tips
- Heuristic:Hiyouga LLaMA Factory LoRA DDP Configuration
- Heuristic:Rapidsai Cuml Float64 Kernel Stability
- Heuristic:FlagOpen FlagEmbedding Temperature Scaling Tip
- Heuristic:OpenRLHF OpenRLHF Value Head ZeRO3 Init Tip
- Heuristic:Togethercomputer Together python Repetition Penalty Conflict
Environments
- Environment:Sgl project Sglang ROCm
- Environment:CarperAI Trlx DeepSpeed Multi GPU
- Environment:Romsto Speculative Decoding CUDA PyTorch
- Environment:Mistralai Client python Azure Deployment Environment
- Environment:Openai Openai agents python LiteLLM Dependencies
- Environment:Openai CLIP PyTorch CUDA Runtime
- Environment:Anthropics Anthropic sdk python AWS Bedrock Environment
- Environment:Guardrails ai Guardrails OpenTelemetry Tracing
- Environment:NVIDIA DALI CMake Build Environment
- Environment:Lance format Lance Python Environment