Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Scikit learn Scikit learn Supervised Classification
- Workflow:Mlc ai Web llm Chrome Extension Integration
- Workflow:Openai Openai agents python Human In The Loop Approval
- Workflow:PeterL1n BackgroundMattingV2 Video matting inference
- Workflow:Wandb Weave Tracing Setup
- Workflow:Huggingface Trl PPO RLHF Training
- Workflow:OpenHands OpenHands Conversation Lifecycle Management
- Workflow:Speechbrain Speechbrain Whisper ASR Finetuning
- Workflow:Evidentlyai Evidently Data Drift Monitoring
- Workflow:OWASP Www project top 10 for large language model applications Vulnerability Translation
Principles
- Principle:Promptfoo Promptfoo Documentation Site
- Principle:Farama Foundation Gymnasium Passive Environment Validation
- Principle:Deepspeedai DeepSpeed Pipeline Module Construction
- Principle:Huggingface Datasets TF Dataset Creation
- Principle:Vllm project Vllm Server Metrics Monitoring
- Principle:ClickHouse ClickHouse Lightweight JSON Parsing
- Principle:Cypress io Cypress Component Mounting
- Principle:Cypress io Cypress CI Environment Configuration
- Principle:Marker Inc Korea AutoRAG Serve And Monitor
- Principle:EvolvingLMMs Lab Lmms eval Model Inference
Implementations
- Implementation:Open compass VLMEvalKit Video Holmes
- Implementation:Run llama Llama index Node Recency Postprocessors
- Implementation:Mlflow Mlflow Prompt Version Entity
- Implementation:Treeverse LakeFS PrepareGarbageCollectionCommits
- Implementation:Apache Paimon TableType
- Implementation:Haosulab ManiSkill ActorBuilder TableSceneBuilder
- Implementation:Ggml org Ggml Magika inference
- Implementation:BerriAI Litellm Weights Biases Logger
- Implementation:Interpretml Interpret Harmonize Tensor
- Implementation:Apache Spark SparkAppHandle
Heuristics
- Heuristic:Zai org CogVideo Memory Optimization Strategies
- Heuristic:Facebookresearch Habitat lab Mini Batch Environment Divisibility
- Heuristic:Avdvg InjectGuard Sim K Threshold Tuning
- Heuristic:Kornia Kornia Morphology Engine Selection
- Heuristic:Promptfoo Promptfoo Warning Deprecated Cache Migration
- Heuristic:Predibase Lorax Warning Deprecated BitsAndBytes 8bit
- Heuristic:Microsoft LoRA Fan In Fan Out Transpose
- Heuristic:Marker Inc Korea AutoRAG Passage Filter Safety Minimum
- Heuristic:Huggingface Trl QLoRA BF16 Adapter Casting
- Heuristic:Puppeteer Puppeteer Navigation Race Condition Avoidance
Environments
- Environment:Anthropics Anthropic sdk python Python SDK Core Environment
- Environment:DataTalksClub Data engineering zoomcamp PySpark Batch Environment
- Environment:PacktPublishing LLM Engineers Handbook API Credentials
- Environment:Gretelai Gretel synthetics PyTorch CUDA Environment
- Environment:Datajuicer Data juicer GPU CUDA Environment
- Environment:Google research Deduplicate text datasets Python TFDS Environment
- Environment:Intel Ipex llm NPU Cpp Environment
- Environment:Scikit learn contrib Imbalanced learn Keras TensorFlow
- Environment:Pytorch Serve DeepSpeed Environment
- Environment:Diagram of thought Diagram of thought LLM API