Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:DataExpert io Data engineer handbook PySpark Iceberg Job Execution
- Workflow:Googleapis Python genai Multi Turn Chat
- Workflow:Microsoft Playwright Codegen test recording
- Workflow:Farama Foundation Gymnasium RL Agent Training Loop
- Workflow:Deepseek ai Janus Autoregressive Image Generation
- Workflow:Trailofbits Fickling Pickle Safety Analysis
- Workflow:Huggingface Optimum Accelerated Inference Pipeline
- Workflow:NVIDIA NeMo Curator Video Curation Pipeline
- Workflow:NVIDIA NeMo Curator Text Curation Pipeline
- Workflow:Axolotl ai cloud Axolotl Multimodal Vision Finetuning
Principles
- Principle:Sktime Pytorch forecasting Series Decomposition
- Principle:Cypress io Cypress Test Suite Execution
- Principle:Trailofbits Fickling Benchmark Dataset Construction
- Principle:ClickHouse ClickHouse PCG Random Number Generation
- Principle:LLMBook zh LLMBook zh github io BPE Tokenization
- Principle:Apache Druid Spec Management Troubleshooting
- Principle:Huggingface Datasets XML Dataset Building
- Principle:Ggml org Llama cpp Response Generation
- Principle:Ggml org Llama cpp Memory Management
- Principle:Langfuse Langfuse API Authentication and Rate Limiting
Implementations
- Implementation:Mlflow Mlflow ML Package Versions Data
- Implementation:Fede1024 Rust rdkafka FutureProducer Send Result
- Implementation:Avhz RustQuant Time Utilities
- Implementation:Sktime Pytorch forecasting DataEmbedding Inverted
- Implementation:Microsoft Onnxruntime OnnxRuntime Java
- Implementation:FlagOpen FlagEmbedding EvalDenseRetriever Call
- Implementation:FlagOpen FlagEmbedding LLM Embedder BM25
- Implementation:Huggingface Datatrove KenlmModel
- Implementation:CARLA simulator Carla BlueprintLibrary Find For Sensors
- Implementation:AUTOMATIC1111 Stable diffusion webui UI Builder
Heuristics
- Heuristic:Apache Flink Async Sink Timeout And Backpressure Defaults
- Heuristic:Truera Trulens Trace Compression Token Limits
- Heuristic:Openai Whisper Log Probability Threshold
- Heuristic:Elevenlabs Elevenlabs python Audio Buffer Sizes
- Heuristic:Apache Druid Capability Detection Strategy
- Heuristic:Elevenlabs Elevenlabs python TTS Model Selection
- Heuristic:Microsoft Autogen Model Context Limiting
- Heuristic:Openai Openai node RunTools Loop Limit
- Heuristic:Online ml River HST Feature Scaling Requirement
- Heuristic:Avhz RustQuant Learning Rate Tuning
Environments
- Environment:Heibaiying BigData Notes HBase Environment
- Environment:DevExpress Testcafe Docker Xvfb Container
- Environment:Junyanz Pytorch CycleGAN and pix2pix DDP Multi GPU
- Environment:Apache Druid Web Console Development
- Environment:NVIDIA NeMo Curator Video Codec Stack
- Environment:Sail sg LongSpec Training Environment
- Environment:Microsoft Onnxruntime Sklearn Conversion Environment
- Environment:DataExpert io Data engineer handbook Spark Iceberg Docker Environment
- Environment:EvolvingLMMs Lab Lmms eval Server Mode Environment
- Environment:Vllm project Vllm CUDA GPU Runtime