Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Haifengl Smile Nearest Neighbor Search
- Workflow:NVIDIA DALI Object Detection Training TensorFlow
- Workflow:Volcengine Verl Vision Language Model RL Training
- Workflow:Ggml org Ggml GPT2 Text Generation
- Workflow:Risingwavelabs Risingwave CDC Data Replication
- Workflow:Deepspeedai DeepSpeed Inference Engine Optimization
- Workflow:Rapidsai Cuml GPU Clustering
- Workflow:Google deepmind Dm control Multi Agent Soccer Setup
- Workflow:FlowiseAI Flowise Chatbot Deployment
- Workflow:Huggingface Datatrove Summary Statistics
Principles
- Principle:Ggml org Llama cpp Multimodal Language Model Loading
- Principle:Romsto Speculative Decoding Rejection Sampling Adjustment
- Principle:Apache Hudi Streaming Write Execution
- Principle:Apache Spark PR Merge Workflow
- Principle:Diagram of thought Diagram of thought Iterative Propose and Critique
- Principle:Mlflow Mlflow Prompt Template Design
- Principle:Iterative Dvc Dependency Graph Construction
- Principle:FMInference FlexLLMGen Model Compression Configuration
- Principle:Anthropics Anthropic sdk python Streaming Structured Output
- Principle:Promptfoo Promptfoo CLI Utilities
Implementations
- Implementation:NVIDIA NeMo Aligner Process Anthropic HH Chat Prompt
- Implementation:InternLM Lmdeploy KvCacheUtils
- Implementation:Run llama Llama index CrossEncoderFinetuneEngine
- Implementation:ArroyoSystems Arroyo Redis Lookup
- Implementation:Speechbrain Speechbrain AMI Diarization Experiment
- Implementation:Haosulab ManiSkill Articulation
- Implementation:Microsoft Onnxruntime CUDA ResizeGrad
- Implementation:Hiyouga LLaMA Factory KTransformers Integration
- Implementation:Facebookresearch Audiocraft MultiPeriodDiscriminator
- Implementation:Openai Whisper DTW
Heuristics
- Heuristic:OpenRLHF OpenRLHF Gradient Checkpointing Memory Tip
- Heuristic:Cypress io Cypress V8 Snapshot Memory
- Heuristic:Guardrails ai Guardrails RAIL Argument Parsing Security
- Heuristic:Vibrantlabsai Ragas Warning Deprecated Legacy LLM Wrappers
- Heuristic:Openai Whisper SDPA Disabling For Attention Extraction
- Heuristic:Microsoft Autogen Warning Deprecated JSON Env Files
- Heuristic:Facebookresearch Habitat lab VER Tuning Guidelines
- Heuristic:Unstructured IO Unstructured Multi Python Matrix
- Heuristic:Openai Openai node Stream Usage Interruption
- Heuristic:Spotify Luigi Streaming MapReduce Processing
Environments
- Environment:Mage ai Mage ai Singer SDK And Joblib Runtime
- Environment:Kornia Kornia ONNX Runtime Environment
- Environment:Kubeflow Kubeflow Kubernetes Cluster Environment
- Environment:Spcl Graph of thoughts Local LLaMA GPU Inference
- Environment:VainF Torch Pruning CUDA GPU Benchmarking
- Environment:Alibaba MNN Python Export Environment
- Environment:Datahub project Datahub Python Ingestion
- Environment:Tencent Ncnn PyTorch Environment
- Environment:Protectai Llm guard ONNX Runtime Acceleration
- Environment:Huggingface Datatrove IO Dependencies