Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Scikit learn contrib Imbalanced learn Imbalanced Model Evaluation
- Workflow:OpenHands OpenHands GitHub Webhook Event Processing
- Workflow:Dagster io Dagster ETL Pipeline
- Workflow:ARISE Initiative Robosuite Environment Setup And Simulation
- Workflow:OpenGVLab InternVL LoRA Finetuning
- Workflow:Microsoft Autogen Studio Team Deployment
- Workflow:Mlflow Mlflow Experiment Tracking
- Workflow:ARISE Initiative Robosuite Gymnasium RL Integration
- Workflow:Truera Trulens Snowflake Observability Pipeline
- Workflow:Ggml org Llama cpp Model Perplexity Evaluation
Principles
- Principle:Pytorch Serve Label Mapping
- Principle:MaterializeInc Materialize Release Commit and Tagging
- Principle:Pyro ppl Pyro Full Rank Variational Inference
- Principle:Farama Foundation Gymnasium Generalized Advantage Estimation
- Principle:ClickHouse ClickHouse Remote Syslog Logging
- Principle:Mlflow Mlflow Local Model Serving
- Principle:Openai Openai python Response Processing
- Principle:Hiyouga LLaMA Factory Distributed Training
- Principle:Dotnet Machinelearning Hybrid Dense Sparse Storage
- Principle:Webdriverio Webdriverio Documentation Infrastructure
Implementations
- Implementation:AUTOMATIC1111 Stable diffusion webui Soft Inpainting
- Implementation:SeleniumHQ Selenium Closure Testing StackTrace
- Implementation:Huggingface Datatrove GopherRepetitionFilter
- Implementation:Facebookresearch Habitat lab VER PreemptionDecider
- Implementation:LLMBook zh LLMBook zh github io Get Data DPO
- Implementation:Cleanlab Cleanlab Get Active Learning Scores
- Implementation:Unslothai Unsloth GEMM Forward Kernel
- Implementation:Helicone Helicone FilterDefs
- Implementation:Openai Openai agents python Lifecycle Hooks Pattern
- Implementation:Rapidsai Cuml Dependencies
Heuristics
- Heuristic:Spotify Luigi Batch Parameter Aggregation
- Heuristic:MaterializeInc Materialize Pipeline Test Trimming Rules
- Heuristic:Snorkel team Snorkel DataParallel Default Behavior
- Heuristic:Langchain ai Langchain Pydantic V2 Configuration Tips
- Heuristic:Huggingface Datasets Parquet Shard Sizing
- Heuristic:Unslothai Unsloth Padding Free Packing
- Heuristic:Apache Shardingsphere Shadow Routing Hint First Fallback
- Heuristic:Gretelai Gretel synthetics Memory Chunking For Normalization
- Heuristic:ClickHouse ClickHouse Banned Functions Thread Safety
- Heuristic:Apache Airflow Variable Access Pattern
Environments
- Environment:Nautechsystems Nautilus trader Python Cython Rust Runtime
- Environment:Protectai Modelscan Python Core Runtime
- Environment:Ggml org Ggml CUDA GPU Environment
- Environment:Evidentlyai Evidently Node Frontend Environment
- Environment:Deepset ai Haystack OpenAI API Environment
- Environment:Huggingface Datasets Audio Video Dependencies
- Environment:CARLA simulator Carla Simulation Runtime
- Environment:Spotify Luigi Tornado Web Server
- Environment:Apache Flink Node Build Environment
- Environment:Sgl project Sglang Python Dependencies