Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Lucidrains X transformers Encoder Decoder Sequence to Sequence
- Workflow:Duckdb Duckdb Code Generation Pipeline
- Workflow:Alibaba ROLL Agentic RL Training Pipeline
- Workflow:Deepset ai Haystack Hybrid Document Search
- Workflow:Microsoft Autogen Studio Team Deployment
- Workflow:Pola rs Polars Streaming Large Dataset Processing
- Workflow:Deepspeedai DeepSpeed Hybrid Engine RLHF Training
- Workflow:Heibaiying BigData Notes HBase Java CRUD Operations
- Workflow:Norrrrrrr lyn WAInjectBench Text Prompt Injection Detection
- Workflow:NVIDIA NeMo Aligner RLHF PPO Training
Principles
- Principle:Langchain ai Langchain Pre Release Validation
- Principle:OWASP Www project top 10 for large language model applications Vulnerability Testing
- Principle:Helicone Helicone Environment Configuration
- Principle:Langchain ai Langgraph Edge Configuration
- Principle:NVIDIA DALI TensorFlow Training Integration
- Principle:Deepspeedai DeepSpeed Tensor Parallel Training
- Principle:Interpretml Interpret Data Preparation And Validation
- Principle:Wandb Weave Version Bump Release
- Principle:Huggingface Datatrove Statistics Merging
- Principle:Rapidsai Cuml Genetic Programming
Implementations
- Implementation:AnswerDotAI RAGatouille RAGTrainer Train
- Implementation:OpenBMB UltraFeedback GPT4 Critique Annotator
- Implementation:Ggml org Llama cpp Llama Model Quantize Params
- Implementation:AUTOMATIC1111 Stable diffusion webui DDPM V1 Diffusion Model
- Implementation:Microsoft Autogen DatabaseManager Operations
- Implementation:AUTOMATIC1111 Stable diffusion webui Sample hr pass
- Implementation:SeleniumHQ Selenium Closure SafeUrl
- Implementation:Huggingface Datasets Value
- Implementation:Unslothai Unsloth GEMM Forward Kernel
- Implementation:Elevenlabs Elevenlabs python AudioNativeClient
Heuristics
- Heuristic:Mlc ai Mlc llm Optimization Level Selection
- Heuristic:Unstructured IO Unstructured Warning Deprecated Staging Base
- Heuristic:DataExpert io Data engineer handbook SparkSession Singleton Pattern
- Heuristic:Kubeflow Pipelines Resource Sizing For Components
- Heuristic:Rapidsai Cuml Float64 Kernel Stability
- Heuristic:Ray project Ray Autoscaling Delay Tuning
- Heuristic:Huggingface Diffusers LoRA Safe Fusing
- Heuristic:Vespa engine Vespa Config Polling Timeout Tuning
- Heuristic:Openai Openai node RunTools Loop Limit
- Heuristic:Apache Hudi Compaction Scheduling Safety
Environments
- Environment:Deepseek ai Janus JanusFlow Diffusers Environment
- Environment:Langgenius Dify Credentials And Env Vars
- Environment:Duckdb Duckdb Extension Distribution Env
- Environment:Apache Kafka JVM Runtime Environment
- Environment:Arize ai Phoenix OpenTelemetry SDK
- Environment:Onnx Onnx Python Runtime Environment
- Environment:Junyanz Pytorch CycleGAN and pix2pix DDP Multi GPU
- Environment:Tencent Ncnn Build Environment
- Environment:Run llama Llama index Fsspec Remote Storage
- Environment:Mlfoundations Open flamingo WebDataset Training Dependencies