Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Huggingface Open r1 SFT Distillation
- Workflow:ARISE Initiative Robosuite Teleoperation
- Workflow:DataExpert io Data engineer handbook PySpark Iceberg Job Execution
- Workflow:Nautechsystems Nautilus trader Backtest with BacktestNode
- Workflow:Microsoft Playwright Trace recording and debugging
- Workflow:Langchain ai Langchain Streaming Responses
- Workflow:Recommenders team Recommenders News Recommendation NRMS
- Workflow:Groq Groq python Chat Completion
- Workflow:Mlflow Mlflow LLM Tracing
- Workflow:Mage ai Mage ai API Source Extraction
Principles
- Principle:Fastai Fastbook Neural Collaborative Filtering
- Principle:Langgenius Dify Service Layer Architecture
- Principle:DevExpress Testcafe Test File Compilation
- Principle:DistrictDataLabs Yellowbrick Dataset Loading
- Principle:Unstructured IO Unstructured File Type Detection
- Principle:Liu00222 Open Prompt Injection Text Segmentation
- Principle:Ggml org Ggml Detection Post Processing
- Principle:Hpcaitech ColossalAI GRPO Policy Loss
- Principle:Bentoml BentoML Service Class Definition
- Principle:Microsoft BIPIA Model Loading
Implementations
- Implementation:Langgenius Dify Service Common
- Implementation:Ucbepic Docetl OperationComponents
- Implementation:Mage ai Mage ai Couchbase Source
- Implementation:Marker Inc Korea AutoRAG Make Basic Gen Gt
- Implementation:Datahub project Datahub RestEmitter
- Implementation:Google deepmind Mujoco Render Context Header
- Implementation:Infiniflow Ragflow Admin Login Page
- Implementation:Neuml Txtai GGML ANN
- Implementation:Openai Openai python Request Transform Utils
- Implementation:FlagOpen FlagEmbedding MiniCPM Reranker Layerwise
Heuristics
- Heuristic:Snorkel team Snorkel NLP Preprocessor Memoization
- Heuristic:Run llama Llama index Finetuning Warmup Steps
- Heuristic:Danijar Dreamerv3 Symlog TwoHot Prediction
- Heuristic:Ucbepic Docetl Validation Retry Strategy
- Heuristic:CarperAI Trlx Delta Rewards
- Heuristic:OpenGVLab InternVL Packed Training Buffer Management
- Heuristic:ARISE Initiative Robomimic Data Worker Tuning By Modality
- Heuristic:Google deepmind Mujoco Mesh Quality For Collision
- Heuristic:NVIDIA NeMo Aligner Higher Stability Log Probs
- Heuristic:NVIDIA NeMo Aligner Warning Deprecated Repository
Environments
- Environment:Kornia Kornia ONNX Runtime Environment
- Environment:Alibaba MNN Python Export Environment
- Environment:Intel Ipex llm Windows Environment
- Environment:EvolvingLMMs Lab Lmms eval Python Runtime Environment
- Environment:Neuml Txtai API Server Configuration
- Environment:Dotnet Machinelearning OneDal Acceleration
- Environment:PacktPublishing LLM Engineers Handbook Python 3 11 Poetry Environment
- Environment:OpenHands OpenHands SaaS Server Environment
- Environment:NVIDIA NeMo Curator Ray Cluster
- Environment:Datahub project Datahub Python Ingestion