Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Puppeteer Puppeteer Page Screenshot Capture
- Workflow:Mistralai Client python Text Embeddings
- Workflow:Deepspeedai DeepSpeed Pipeline Parallel Training
- Workflow:Heibaiying BigData Notes Flink Kafka Streaming Pipeline
- Workflow:Danijar Dreamerv3 Evaluation Only
- Workflow:PrefectHQ Prefect Asset Based Data Pipeline
- Workflow:Truera Trulens RAG With Guardrails
- Workflow:MaterializeInc Materialize Upgrade Testing
- Workflow:Langgenius Dify Workflow Builder and Execution
- Workflow:Datahub project Datahub Java SDK Metadata Emission
Principles
- Principle:Openai CLIP Contrastive Similarity Prediction
- Principle:Trailofbits Fickling Hook Deactivation
- Principle:Axolotl ai cloud Axolotl Vision Language Model Loading
- Principle:Apache Spark K8s Prerequisites Verification
- Principle:Treeverse LakeFS Merge
- Principle:ClickHouse ClickHouse TCP Server Startup
- Principle:Protectai Llm guard JSON Output Validation
- Principle:Kserve Kserve Model Promotion Rollback
- Principle:Haosulab ManiSkill Asset Download Loading
- Principle:Togethercomputer Together python Sandboxed Code Interpreter
Implementations
- Implementation:Vibrantlabsai Ragas Annotated Data Sample
- Implementation:Datahub project Datahub RequiresMutable
- Implementation:Cohere ai Cohere python BatchesClient
- Implementation:Spotify Luigi DataprocTask
- Implementation:Apache Shardingsphere StandaloneContextManagerBuilder Build
- Implementation:Bentoml BentoML Runner Container
- Implementation:SeldonIO Seldon core Seldon Pipeline CRD
- Implementation:Open compass VLMEvalKit CreationMMBenchDataset
- Implementation:Fede1024 Rust rdkafka Tokio Spawn Blocking
- Implementation:LMCache LMCache Base Cache Policy
Heuristics
- Heuristic:Google research Deduplicate text datasets HACKSIZE Overlap Buffer
- Heuristic:Huggingface Alignment handbook QLoRA Learning Rate Scaling
- Heuristic:OpenBMB UltraFeedback Score 10 Anomaly Correction
- Heuristic:Gretelai Gretel synthetics Binary Encoder Cutoff
- Heuristic:LLMBook zh LLMBook zh github io Deduplication Ngram Threshold
- Heuristic:Kserve Kserve Server Side Apply For CRDs
- Heuristic:Huggingface Peft RSLoRA Scaling
- Heuristic:Open compass VLMEvalKit API Retry With Random Delay
- Heuristic:Openai Openai python Retry Backoff Strategy
- Heuristic:Datajuicer Data juicer Operator Fusion Rules
Environments
- Environment:Apache Kafka Committer Tools Environment
- Environment:Bitsandbytes foundation Bitsandbytes CUDA GPU Runtime
- Environment:Vllm project Vllm ROCm
- Environment:Speechbrain Speechbrain HuggingFace Transformers
- Environment:Deepspeedai DeepSpeed CPU Environment
- Environment:Scikit learn Scikit learn Python Runtime Environment
- Environment:Nightwatchjs Nightwatch Android Mobile Testing
- Environment:Togethercomputer Together python API Credentials
- Environment:TobikoData Sqlmesh Dbt Compatibility
- Environment:Facebookresearch Habitat lab CUDA GPU Training Environment