Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Truera Trulens Snowflake Observability Pipeline
- Workflow:Cleanlab Cleanlab Multiannotator Consensus
- Workflow:PacktPublishing LLM Engineers Handbook Dataset Generation
- Workflow:Rapidsai Cuml Dimensionality Reduction
- Workflow:Deepspeedai DeepSpeed ZeRO Distributed Training
- Workflow:Open compass VLMEvalKit Adding Custom Benchmark
- Workflow:Togethercomputer Together python Batch Inference
- Workflow:Haifengl Smile Model Serving Pipeline
- Workflow:Ollama Ollama Model Registry Operations
- Workflow:InternLM Lmdeploy LLM Offline Batch Inference
Principles
- Principle:Trailofbits Fickling Hook Deactivation
- Principle:Bentoml BentoML Deployment Lifecycle Management
- Principle:Scikit learn contrib Imbalanced learn Adaptive Synthetic Sampling
- Principle:OWASP Www project top 10 for large language model applications Automated Vulnerability Scanning
- Principle:Apache Spark Dependency Bundling
- Principle:Huggingface Transformers Adapter Training
- Principle:DevExpress Testcafe Element Selection
- Principle:Sgl project Sglang Streaming Response Handling
- Principle:Scikit learn Scikit learn Density Estimation
- Principle:BerriAI Litellm Custom Logger Development
Implementations
- Implementation:Ucbepic Docetl Directive Base
- Implementation:Datahub project Datahub DataHubRestEmitter Init
- Implementation:Neuml Txtai CrossEncoder
- Implementation:ArroyoSystems Arroyo Udf Common Types
- Implementation:Mlc ai Mlc llm Convert weight
- Implementation:Deepspeedai DeepSpeed TPTrainingConfig Init
- Implementation:Sail sg LongSpec Math Equivalence Engine
- Implementation:Spcl Graph of thoughts SortingParser
- Implementation:CrewAIInc CrewAI Oxylabs Amazon Product Tool
- Implementation:Datajuicer Data juicer LLMPerplexityFilter
Heuristics
- Heuristic:Mlc ai Mlc llm Metal KV Cache Capacity Limit
- Heuristic:Vespa engine Vespa RPM Zstd Compression Settings
- Heuristic:Junyanz Pytorch CycleGAN and pix2pix Instance Norm for Multi GPU
- Heuristic:Deepspeedai DeepSpeed Vocabulary Tensor Core Alignment
- Heuristic:Protectai Llm guard Lazy Scanner Loading
- Heuristic:FlowiseAI Flowise Heap Memory Configuration
- Heuristic:ThreeSR Awesome Inference Time Scaling Date Parsing Fallback Tip
- Heuristic:Open compass VLMEvalKit Judge Model Selection By Dataset
- Heuristic:PacktPublishing LLM Engineers Handbook RAG Retrieval Parameters
- Heuristic:NVIDIA TransformerEngine FP8 Checkpoint Compatibility
Environments
- Environment:Neuml Txtai GPU Accelerator Detection
- Environment:Togethercomputer Together python Python SDK Runtime
- Environment:Intel Ipex llm CPU Finetuning Environment
- Environment:Tensorflow Tfjs Browser Runtime
- Environment:NVIDIA TransformerEngine GPU Compute Capability
- Environment:Deepspeedai DeepSpeed CPU Environment
- Environment:Langgenius Dify Redis And Celery Environment
- Environment:Huggingface Alignment handbook DeepSpeed Multi Node
- Environment:Allenai Open instruct vLLM Inference
- Environment:ArroyoSystems Arroyo Python UDF Runtime