Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Apache Kafka Topic Management
- Workflow:Apache Kafka Broker Startup
- Workflow:Run llama Llama index Embedding Finetuning
- Workflow:Haosulab ManiSkill Sim2Real Deployment
- Workflow:Confident ai Deepeval End to End LLM Evaluation
- Workflow:Cohere ai Cohere python Semantic Search With Rerank
- Workflow:DataTalksClub Data engineering zoomcamp dlt Data Ingestion
- Workflow:Vllm project Vllm Vision Language Inference
- Workflow:Getgauge Taiko Form Interaction Testing
- Workflow:Microsoft Playwright Codegen test recording
Principles
- Principle:Eric mitchell Direct preference optimization Checkpoint Saving
- Principle:Microsoft Semantic kernel OpenAPI Plugin Import
- Principle:Princeton nlp Tree of thought llm Thought Generation
- Principle:Google research Deduplicate text datasets Duplicate Range Collection
- Principle:FMInference FlexLLMGen GEMM Performance Testing
- Principle:Apache Airflow Docker Image Preparation
- Principle:Sail sg LongSpec Multi Stage Training
- Principle:Mlflow Mlflow API Compatibility Checking
- Principle:Alibaba ROLL Reward Flow Configuration
- Principle:Neuml Txtai Training Data Preparation
Implementations
- Implementation:Apache Hudi CompactionCommitSink Invoke
- Implementation:Ggml org Llama cpp Pydantic Grammar Examples
- Implementation:Volcengine Verl Datasets Load Dataset
- Implementation:Microsoft Playwright ChannelOwner
- Implementation:OpenGVLab InternVL Wrap Backbone LoRA
- Implementation:Deepspeedai DeepSpeed DeepSpeedEngine Checkpoint
- Implementation:Teamcapybara Capybara CSSBuilder
- Implementation:Ollama Ollama Llama Context Header
- Implementation:Google deepmind Mujoco User Flexcomp Header
- Implementation:DistrictDataLabs Yellowbrick TSNEVisualizer
Heuristics
- Heuristic:Protectai Modelscan Unknown Opcodes Assume Critical
- Heuristic:Anthropics Anthropic sdk python Warning Deprecated LegacyAPIResponse
- Heuristic:Openai Openai agents python GPT 5 Reasoning Settings
- Heuristic:Microsoft BIPIA LLAMA Pad Token Workaround
- Heuristic:OpenHands OpenHands Redis Distributed Locking
- Heuristic:Snorkel team Snorkel Binary Only Slicing
- Heuristic:Helicone Helicone ClickHouse ReplacingMergeTree FINAL
- Heuristic:Haosulab ManiSkill Physics Solver Tuning
- Heuristic:Google deepmind Dm control Warning Deprecated Legacy Base Walker
- Heuristic:Evidentlyai Evidently Drift Detection Thresholds
Environments
- Environment:Apache Airflow Database Backend Environment
- Environment:Explodinggradients Ragas LLM Provider Environment
- Environment:Spotify Luigi Tornado Web Server
- Environment:SeldonIO Seldon core GPU Inference Environment
- Environment:Bitsandbytes foundation Bitsandbytes ROCm AMD Environment
- Environment:Facebookresearch Audiocraft AudioCraft Environment Variables
- Environment:Kornia Kornia ONNX Runtime Environment
- Environment:Heibaiying BigData Notes Storm 1 2 Environment
- Environment:Pyro ppl Pyro Funsor Backend
- Environment:Online ml River Build Toolchain