Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:ThreeSR Awesome Inference Time Scaling Manual Paper Contribution
- Workflow:Pola rs Polars Data IO and Format Conversion
- Workflow:Fastai Fastbook NLP Text Classification
- Workflow:ArroyoSystems Arroyo Local Pipeline Execution
- Workflow:DataExpert io Data engineer handbook AB Experimentation Server
- Workflow:Recommenders team Recommenders ALS Spark Recommendation
- Workflow:Sdv dev SDV Multi table synthesis
- Workflow:Intel Ipex llm vLLM Serving
- Workflow:Facebookresearch Habitat lab Agent Benchmarking
- Workflow:Microsoft Semantic kernel Vector Store RAG Pipeline
Principles
- Principle:Tensorflow Serving Target Interface
- Principle:Datajuicer Data juicer Pipeline Monitoring and Checkpointing
- Principle:Helicone Helicone Cost Rate Lookup
- Principle:LaurentMazare Tch rs Generated Tensor Operations
- Principle:Allenai Open instruct SFT Dataset Preparation
- Principle:Dagster io Dagster Documentation URL Redirect Management
- Principle:FlowiseAI Flowise Vector Store Query
- Principle:Pytorch Serve Recommendation Model Serving
- Principle:Haotian liu LLaVA LoRA Training
- Principle:Isaac sim IsaacGymEnvs Skeletal Motion Representation
Implementations
- Implementation:Microsoft Autogen DatabaseManager Operations
- Implementation:TobikoData Sqlmesh FilterableList
- Implementation:SeleniumHQ Selenium CacheLookup Annotation
- Implementation:InternLM Lmdeploy SamplingTopkKernels
- Implementation:Getgauge Taiko Intercept
- Implementation:Evidentlyai Evidently SDK Artifacts
- Implementation:Openai Evals SelfConsistencySolver
- Implementation:Recommenders team Recommenders Spark Random Split
- Implementation:Puppeteer Puppeteer Browsers Main
- Implementation:Protectai Llm guard Output EmotionDetection
Heuristics
- Heuristic:FMInference FlexLLMGen Weight Compression 4bit
- Heuristic:Mlfoundations Open flamingo Gradient Clipping Max Norm
- Heuristic:Zai org CogVideo Frame Count and Resolution Constraints
- Heuristic:Mlfoundations Open flamingo KV Cache Classification Optimization
- Heuristic:Spotify Luigi PySpark Task Serialization
- Heuristic:CrewAIInc CrewAI MCP Timeout And Retry Strategy
- Heuristic:Huggingface Transformers FSDP Activation Checkpointing Tip
- Heuristic:Kubeflow Pipelines Cache Staleness In Recursive Pipelines
- Heuristic:TA Lib Ta lib python Compatibility Mode Switching
- Heuristic:Marker Inc Korea AutoRAG Batch Size Tuning
Environments
- Environment:Huggingface Peft Optional Quantization Backends
- Environment:Truera Trulens Python Core Environment
- Environment:LMCache LMCache NIXL Transfer Library
- Environment:NVIDIA DALI CUDA GPU Environment
- Environment:CARLA simulator Carla Simulation Runtime
- Environment:Openai Openai agents python Memory Extensions Dependencies
- Environment:Alibaba MNN GPU CUDA Environment
- Environment:Turboderp org Exllamav2 CUDA GPU Runtime
- Environment:VainF Torch Pruning LLM Pruning Dependencies
- Environment:OWASP Www project top 10 for large language model applications Pydantic Invoice Agent Runtime