Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Infiniflow Ragflow Document Processing Pipeline
- Workflow:Datahub project Datahub Protobuf Schema Ingestion
- Workflow:Vespa engine Vespa Logging framework initialization
- Workflow:Apache Beam Dataflow Streaming Execution
- Workflow:Allenai Open instruct Tulu3 Full Post Training
- Workflow:DataTalksClub Data engineering zoomcamp Docker PostgreSQL Data Ingestion
- Workflow:Iamhankai Forest of Thought Game24 Forest Solving
- Workflow:Huggingface Open r1 SFT Distillation
- Workflow:Ray project Ray Serve Deployment
- Workflow:Vespa engine Vespa Linguistics text processing pipeline
Principles
- Principle:Googleapis Python genai Pagination
- Principle:DataExpert io Data engineer handbook DataFrame Write To Table
- Principle:Apache Beam Heartbeat and Refresh
- Principle:Sdv dev SDV Gaussian Copula Synthesis
- Principle:Scikit learn contrib Imbalanced learn One Sided Selection
- Principle:Facebookresearch Audiocraft Training Checkpoint Resolution
- Principle:Langchain ai Langgraph Interrupt Stream Execution
- Principle:Online ml River Estimator Base Architecture
- Principle:ARISE Initiative Robosuite Simulation Loop
- Principle:Microsoft Onnxruntime Distributed Checkpoint Management
Implementations
- Implementation:Mlc ai Mlc llm Top P Pivot
- Implementation:LMCache LMCache Connector V1
- Implementation:Microsoft LoRA Download GLUE Data
- Implementation:Datajuicer Data juicer Visualize App
- Implementation:Sgl project Sglang CUTLASS Epilogue Scale
- Implementation:Tensorflow Tfjs RandomHeight Layer
- Implementation:Microsoft Playwright WebKit Protocol Types
- Implementation:NVIDIA NeMo Aligner Train SteerLM2
- Implementation:Truera Trulens TruApp Wrapper
- Implementation:Googleapis Python genai Sphinx EnglishStemmer
Heuristics
- Heuristic:Sdv dev SDV CTGAN Column Performance
- Heuristic:Mlc ai Web llm Service Worker Keep Alive
- Heuristic:Vibrantlabsai Ragas Warning Deprecated V1 Metrics
- Heuristic:PacktPublishing LLM Engineers Handbook DPO Training Configuration
- Heuristic:Nightwatchjs Nightwatch Node 17 DNS IPv4 Workaround
- Heuristic:Huggingface Transformers Gradient Checkpointing Memory Tradeoff
- Heuristic:CARLA simulator Carla Traffic Manager Sync Mode
- Heuristic:Eric mitchell Direct preference optimization FSDP Mixed Precision BFloat16
- Heuristic:Groq Groq python Retry Backoff Strategy
- Heuristic:Bigscience workshop Petals NF4 Quantization Default On CUDA
Environments
- Environment:Shiyu coder Kronos Qlib Data Environment
- Environment:OpenGVLab InternVL PyTorch CUDA
- Environment:Mit han lab Llm awq Flash Attention Environment
- Environment:Intel Ipex llm RAG LangChain Environment
- Environment:Cypress io Cypress Browser Requirements
- Environment:ContextualAI HALOs CUDA 12 1 Training Environment
- Environment:Huggingface Datatrove IO Dependencies
- Environment:Huggingface Open r1 vLLM Server
- Environment:Zai org CogVideo Diffusers Inference Environment
- Environment:Helicone Helicone Wrangler CLI