Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Heibaiying BigData Notes Flink Kafka Streaming Pipeline
- Workflow:Dagster io Dagster ML Pipeline
- Workflow:Cohere ai Cohere python AWS Bedrock Deployment
- Workflow:Huggingface Diffusers Checkpoint Conversion
- Workflow:Sktime Pytorch forecasting TFT Demand Forecasting
- Workflow:Ucbepic Docetl Long Document Chunking
- Workflow:ClickHouse ClickHouse Contributing Pull Request
- Workflow:Neuml Txtai Model Training
- Workflow:Datajuicer Data juicer LLM Powered Data Generation
- Workflow:Guardrails ai Guardrails Custom Validator Development
Principles
- Principle:Helicone Helicone Cache Abstraction
- Principle:ArroyoSystems Arroyo UDF Compilation
- Principle:DistrictDataLabs Yellowbrick Residual Analysis
- Principle:Promptfoo Promptfoo Vulnerability Grading
- Principle:Langchain ai Langchain Distribution Building
- Principle:Webdriverio Webdriverio Type Safety
- Principle:Bigscience workshop Petals Output Decoding
- Principle:Treeverse LakeFS Import Progress Monitoring
- Principle:Huggingface Trl Reward Preference Dataset Loading
- Principle:Huggingface Datasets JSON Export
Implementations
- Implementation:Haosulab ManiSkill AllegroHandTouch
- Implementation:Alibaba MNN Protobuf Generated TCTable Impl H
- Implementation:Axolotl ai cloud Axolotl MultipackBatchSampler
- Implementation:Open compass VLMEvalKit OCRBench V2 Eval
- Implementation:Ggml org Llama cpp Memory Recurrent
- Implementation:Apache Paimon BatchTableWrite Write Pandas
- Implementation:Alibaba ROLL RLVRConfig
- Implementation:Ollama Ollama Convert Olmo
- Implementation:Zai org CogVideo SAT VideoDataset
- Implementation:Nightwatchjs Nightwatch Multi Page Flow Pattern
Heuristics
- Heuristic:Ray project Ray NaN Score Filtering In PBT
- Heuristic:Online ml River Hoeffding Tree Grace Period Tuning
- Heuristic:Treeverse LakeFS Warning Deprecated InternalApi Methods
- Heuristic:Nightwatchjs Nightwatch Safari Parallel Limitation
- Heuristic:Alibaba MNN GPU Tuning Modes
- Heuristic:Apache Beam Warning Deprecated Twister2 Runner
- Heuristic:Openai Evals Eval Resumption Strategy
- Heuristic:Interpretml Interpret Categorical Float Conversion Gotcha
- Heuristic:Roboflow Rf detr Layer Wise LR Decay
- Heuristic:Farama Foundation Gymnasium Sync Vs Async VectorEnv Selection
Environments
- Environment:Huggingface Transformers BitsAndBytes Quantization Env
- Environment:Astronomer Astronomer cosmos Cloud Provider Dependencies
- Environment:MaterializeInc Materialize Docker Compose Runtime
- Environment:LLMBook zh LLMBook zh github io Bitsandbytes Quantization Environment
- Environment:Huggingface Datatrove Inference GPU Environment
- Environment:Neuml Txtai Python Core Environment
- Environment:Microsoft Onnxruntime CPU Training Environment
- Environment:Open compass VLMEvalKit Python Runtime Environment
- Environment:Datahub project Datahub Docker Quickstart Environment
- Environment:EvolvingLMMs Lab Lmms eval Server Mode Environment