Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Online ml River Online Clustering
- Workflow:Facebookresearch Habitat lab Agent Benchmarking
- Workflow:Openai Openai agents python Human In The Loop Approval
- Workflow:Run llama Llama index OpenAI LLM Finetuning
- Workflow:Heibaiying BigData Notes Hadoop MapReduce Word Count
- Workflow:AnswerDotAI RAGatouille ColBERT Training
- Workflow:Langfuse Langfuse Otel ingestion pipeline
- Workflow:Langchain ai Langgraph ReAct Agent Creation
- Workflow:LLMBook zh LLMBook zh github io Supervised Finetuning
- Workflow:NVIDIA NeMo Curator Text Curation Pipeline
Principles
- Principle:Apache Flink Compacted Committable Emission
- Principle:TobikoData Sqlmesh Forward Only Change Handling
- Principle:Kserve Kserve Pool Tuning
- Principle:Huggingface Trl PPO Training Loop
- Principle:Microsoft DeepSpeedExamples SuperOffload Environment
- Principle:Diagram of thought Diagram of thought Prompt Validation Testing
- Principle:Microsoft Autogen Message Filtering
- Principle:PacktPublishing LLM Engineers Handbook Chunking And Embedding
- Principle:Ggml org Llama cpp Apple Platform Build
- Principle:AUTOMATIC1111 Stable diffusion webui Postprocessing input selection
Implementations
- Implementation:ArroyoSystems Arroyo Arrow Operators
- Implementation:NVIDIA TransformerEngine JAX Softmax
- Implementation:Treeverse LakeFS UploadObject
- Implementation:Open compass VLMEvalKit SlideVQA
- Implementation:Deepseek ai Janus Apply Sft Template JanusFlow
- Implementation:Confident ai Deepeval FaithfulnessMetric
- Implementation:Apache Flink HybridSourceReader PollNext
- Implementation:Infiniflow Ragflow MetadataManageModal Hooks
- Implementation:Openai Openai agents python ComputerTool Pattern
- Implementation:Mlc ai Mlc llm Encoding Header
Heuristics
- Heuristic:Iamhankai Forest of Thought UCB Exploration Constant
- Heuristic:Shiyu coder Kronos Gradient Clipping Strategy
- Heuristic:Pytorch Serve Torch Compile Best Practices
- Heuristic:Openai Openai node Retry Backoff Configuration
- Heuristic:BerriAI Litellm SSL Cipher Optimization
- Heuristic:Recommenders team Recommenders Test Timing Budgets
- Heuristic:Nautechsystems Nautilus trader Order Rate Limiting Configuration
- Heuristic:Protectai Llm guard PyTorch Compile Warmup
- Heuristic:Vespa engine Vespa Log Level Inheritance Polling
- Heuristic:Bigscience workshop Petals Batch Splitting Threshold
Environments
- Environment:Arize ai Phoenix OpenTelemetry SDK
- Environment:ArroyoSystems Arroyo Python UDF Runtime
- Environment:Explodinggradients Ragas Google Drive Backend Environment
- Environment:Google deepmind Mujoco MJX Warp CUDA Environment
- Environment:Nightwatchjs Nightwatch Selenium WebDriver 4
- Environment:Speechbrain Speechbrain PyTorch CUDA Runtime
- Environment:HKUDS AI Trader Python LangChain Runtime
- Environment:Intel Ipex llm NPU Environment
- Environment:Iterative Dvc Remote Storage Backends
- Environment:NVIDIA TransformerEngine CUDA Toolkit Requirements