Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Nightwatchjs Nightwatch Page Object Pattern
- Workflow:Ray project Ray Actor Lifecycle Management
- Workflow:Kubeflow Pipelines Pipeline Control Flow
- Workflow:Lm sys FastChat LoRA QLoRA Finetuning
- Workflow:Datajuicer Data juicer Distributed Ray Processing
- Workflow:Ggml org Llama cpp Speculative Decoding
- Workflow:Langfuse Langfuse Evaluation pipeline
- Workflow:Treeverse LakeFS Write Audit Publish With Hooks
- Workflow:Lance format Lance Vector Search Pipeline
- Workflow:EvolvingLMMs Lab Lmms eval Custom Task Creation
Principles
- Principle:Neuml Txtai Tool Assembly
- Principle:Risingwavelabs Risingwave CDC Snapshot Verification
- Principle:Farama Foundation Gymnasium Video Frame Saving
- Principle:Elevenlabs Elevenlabs python Text Chunking
- Principle:Huggingface Transformers Adapter Weight Saving
- Principle:Openai Whisper Audio Padding And Trimming
- Principle:Deepspeedai DeepSpeed AutoTP Configuration
- Principle:Eric mitchell Direct preference optimization Model Loading
- Principle:Scikit learn Scikit learn Score Distribution Analysis
- Principle:Nightwatchjs Nightwatch Page Commands
Implementations
- Implementation:Neuml Txtai Embeddings Index
- Implementation:Microsoft LoRA DART Train Dataset
- Implementation:Microsoft Agent framework Declarative Tool Function Pattern
- Implementation:Helicone Helicone XAI Model Definitions
- Implementation:Microsoft Onnxruntime OnnxSequence
- Implementation:Openai Openai python Response Input Audio Param
- Implementation:NVIDIA TransformerEngine JAX Attention
- Implementation:Scikit learn Scikit learn ComputeClassWeight
- Implementation:BerriAI Litellm Core Helpers
- Implementation:Dotnet Machinelearning OneDalAlgorithms
Heuristics
- Heuristic:TobikoData Sqlmesh Execution Time Caching
- Heuristic:Unslothai Unsloth LoRA Rank Selection
- Heuristic:OpenHands OpenHands Streamable HTTP Over SSE
- Heuristic:NVIDIA DALI Distributed Sharding Strategy
- Heuristic:PacktPublishing LLM Engineers Handbook Dataset Generation Quality Filters
- Heuristic:Langgenius Dify Credential Sanitization In API Responses
- Heuristic:Spcl Graph of thoughts Four Bit Quantization For Local LLMs
- Heuristic:DataTalksClub Data engineering zoomcamp CSV Chunk Size Optimization
- Heuristic:SeldonIO Seldon core Over Commit Memory Tip
- Heuristic:Openai CLIP JIT Vs Non JIT Loading
Environments
- Environment:Microsoft Agent framework API Credentials
- Environment:NVIDIA DALI FFmpeg Environment
- Environment:Marker Inc Korea AutoRAG VLLM Environment
- Environment:Hpcaitech ColossalAI CUDA GPU Environment
- Environment:Datajuicer Data juicer GPU CUDA Environment
- Environment:CrewAIInc CrewAI Python Runtime Environment
- Environment:DataExpert io Data engineer handbook Spark Iceberg Docker Environment
- Environment:Microsoft Autogen Studio Server Environment
- Environment:Cohere ai Cohere python Python SDK Runtime
- Environment:AUTOMATIC1111 Stable diffusion webui GPU Compute Backend