Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Farama Foundation Gymnasium Policy Gradient Training
- Workflow:Ggml org Ggml MNIST Training And Evaluation
- Workflow:Tensorflow Tfjs GPT2 Text Generation
- Workflow:Datahub project Datahub Python Metadata Emission
- Workflow:Webdriverio Webdriverio Cloud Service Integration
- Workflow:Ucbepic Docetl Python API Pipeline
- Workflow:Bentoml BentoML Bento Build And Containerization
- Workflow:OpenRLHF OpenRLHF Math Reasoning Training
- Workflow:Pola rs Polars DataFrame Aggregation and Grouping
- Workflow:Datahub project Datahub Java SDK Metadata Emission
Principles
- Principle:Pyro ppl Pyro Evidence Lower Bound
- Principle:Treeverse LakeFS Repository Creation
- Principle:Haosulab ManiSkill Robot Agent Definition
- Principle:Cleanlab Cleanlab CleanLearning Initialization
- Principle:AUTOMATIC1111 Stable diffusion webui Memory Efficient Loading
- Principle:EvolvingLMMs Lab Lmms eval Environment Setup
- Principle:Sdv dev SDV CTGAN Synthesis
- Principle:FlowiseAI Flowise Docker Deployment
- Principle:FlowiseAI Flowise Chat Widget Embedding
- Principle:Openai CLIP Text Tokenization
Implementations
- Implementation:Avhz RustQuant FractionalOrnsteinUhlenbeck
- Implementation:Tencent Ncnn Blob
- Implementation:Treeverse LakeFS Java SDK Model RepositoryCreation
- Implementation:Langgenius Dify UseShare
- Implementation:Microsoft Semantic kernel InvokePromptAsync With KernelArguments
- Implementation:Facebookresearch Habitat lab ResetArmSkill
- Implementation:Explodinggradients Ragas Collections ContextRelevance Metric
- Implementation:Kserve Kserve LocalModelNode CRD
- Implementation:Elevenlabs Elevenlabs python UnitTestToolCallEvaluationModelInput
- Implementation:Langfuse Langfuse StorageService UploadWithSignedUrl
Heuristics
- Heuristic:NVIDIA DALI Last Batch Policy Selection
- Heuristic:FlowiseAI Flowise Tool Ordering Convention
- Heuristic:Sktime Pytorch forecasting Encoder Decoder Length Limits
- Heuristic:Togethercomputer Together python Retry Backoff Strategy
- Heuristic:Facebookresearch Audiocraft Chroma Conditioning Cache Requirement
- Heuristic:Google deepmind Mujoco Thread Pool Configuration
- Heuristic:Microsoft LoRA Scaling Factor Alpha Over R
- Heuristic:Huggingface Diffusers Memory Offloading Strategy
- Heuristic:Huggingface Datatrove Gopher Quality Thresholds
- Heuristic:CrewAIInc CrewAI RAG Search Defaults
Environments
- Environment:Huggingface Optimum Tensor Parallelization Environment
- Environment:Teamcapybara Capybara Selenium WebDriver Environment
- Environment:FlagOpen FlagEmbedding Python PyTorch Environment
- Environment:Apache Hudi Docker Demo Environment
- Environment:Snorkel team Snorkel PyTorch
- Environment:Pola rs Polars Python Runtime Environment
- Environment:LMCache LMCache VLLM Serving Engine
- Environment:Huggingface Datasets TensorFlow Integration
- Environment:Unstructured IO Unstructured All Docs
- Environment:Unstructured IO Unstructured OpenAI API