Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:NVIDIA NeMo Aligner RLHF PPO Training
- Workflow:Nightwatchjs Nightwatch Custom Commands And Assertions
- Workflow:PeterL1n BackgroundMattingV2 Video matting inference
- Workflow:Testtimescaling Testtimescaling github io GitHub Pages Course Progression
- Workflow:Risingwavelabs Risingwave CDC Data Replication
- Workflow:Lm sys FastChat ShareGPT Data Pipeline
- Workflow:Roboflow Rf detr ONNX Export
- Workflow:Shiyu coder Kronos Single Series Prediction
- Workflow:AUTOMATIC1111 Stable diffusion webui Image postprocessing and upscaling
- Workflow:Lm sys FastChat MT Bench Evaluation
Principles
- Principle:Microsoft Semantic kernel Function Choice Behavior
- Principle:Lance format Lance Tag Management
- Principle:Spotify Luigi Database Connection Configuration
- Principle:OpenRLHF OpenRLHF Unpaired Preference Dataset Construction
- Principle:Lance format Lance Version Cleanup
- Principle:Openai Evals Dataset Preparation
- Principle:Langchain ai Langgraph Entrypoint Configuration
- Principle:Openai Whisper Single Segment Decoding
- Principle:Mlflow Mlflow Development Environment Setup
- Principle:Intel Ipex llm Training With HF Trainer LoRA
Implementations
- Implementation:Lance format Lance MemWalIndex
- Implementation:OpenGVLab InternVL ScienceQA Inference
- Implementation:Online ml River FeatureExtraction Agg
- Implementation:Mlflow Mlflow Genai Evaluate
- Implementation:ARISE Initiative Robomimic TrainUtils run epoch
- Implementation:Promptfoo Promptfoo Logger Browser
- Implementation:Langgenius Dify UseLog
- Implementation:Pyro ppl Pyro MarkovMessenger
- Implementation:Apache Paimon DeltaVarintCompressor
- Implementation:Microsoft DeepSpeedExamples VisProjection
Heuristics
- Heuristic:Kornia Kornia Avoid Inplace Ops Compile
- Heuristic:Deepseek ai Janus Bfloat16 Dtype Selection
- Heuristic:LMCache LMCache Chunk Size And Default Config
- Heuristic:Shiyu coder Kronos Sampling Temperature Tuning
- Heuristic:Vespa engine Vespa RPM Zstd Compression Settings
- Heuristic:SeldonIO Seldon core Model Scheduling Preference Tip
- Heuristic:Scikit learn Scikit learn Warning Deprecated PassiveAggressive
- Heuristic:ArroyoSystems Arroyo Stateful Operator TTL
- Heuristic:Apache Airflow Task Dependency Isolation
- Heuristic:Google research Deduplicate text datasets Variable Width Pointer Optimization
Environments
- Environment:Apache Dolphinscheduler ZooKeeper Registry
- Environment:Explodinggradients Ragas Python Runtime Environment
- Environment:Apache Kafka Gradle Build Environment
- Environment:Testtimescaling Testtimescaling github io Python 3 Runtime
- Environment:OWASP Www project top 10 for large language model applications Pre Commit Hooks Environment
- Environment:Fede1024 Rust rdkafka Rust Librdkafka Build Environment
- Environment:Google deepmind Dm control EGL Headless Rendering
- Environment:FMInference FlexLLMGen CUDA GPU
- Environment:OpenGVLab InternVL DeepSpeed
- Environment:Sgl project Sglang Multimodal