Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Microsoft LoRA NLU GLUE Finetuning
- Workflow:Pola rs Polars DataFrame Aggregation and Grouping
- Workflow:Mage ai Mage ai SQL Database Source Extraction
- Workflow:Guardrails ai Guardrails Custom Validator Development
- Workflow:Danijar Dreamerv3 Train And Evaluate
- Workflow:Sgl project Sglang Frontend Language Multi Turn Chat
- Workflow:InternLM Lmdeploy LLM Offline Batch Inference
- Workflow:Haosulab ManiSkill Sim2Real Deployment
- Workflow:Hiyouga LLaMA Factory PPO RLHF Training
- Workflow:Confident ai Deepeval Component Level LLM Evaluation
Principles
- Principle:Risingwavelabs Risingwave Iceberg Sink Integration
- Principle:Allenai Open instruct vLLM Weight Sync
- Principle:Princeton nlp Tree of thought llm DFS Tree Search
- Principle:Huggingface Datatrove C4 Quality Filtering
- Principle:Gretelai Gretel synthetics Training Configuration
- Principle:Helicone Helicone Feature Gating
- Principle:Protectai Llm guard Prompt Injection Detection
- Principle:Run llama Llama index Ingestion Pipeline Construction
- Principle:LaurentMazare Tch rs Seq2Seq Dataset Loading
- Principle:Cleanlab Cleanlab Token Issue Display
Implementations
- Implementation:SeldonIO Seldon core Operator Main
- Implementation:Ucbepic Docetl TopKOperation Execute
- Implementation:CarperAI Trlx NeMo ILQL Trainer
- Implementation:Ucbepic Docetl ResizableDataTable
- Implementation:Apache Shardingsphere DataSourceUnitPersistService Persist
- Implementation:Cohere ai Cohere python ConnectorsClient
- Implementation:Farama Foundation Gymnasium JaxToNumpy
- Implementation:Openai Openai python Microphone Helper
- Implementation:Openai Openai python Web Search Preview Tool
- Implementation:Google deepmind Mujoco Platform UI Adapter
Heuristics
- Heuristic:Lm sys FastChat Conversation Splitting Token Buffer
- Heuristic:AUTOMATIC1111 Stable diffusion webui NaN Detection And Precision Fixes
- Heuristic:Deepset ai Haystack Document Splitting Defaults
- Heuristic:Shiyu coder Kronos Two Stage Finetuning Strategy
- Heuristic:Treeverse LakeFS Warning Deprecated InternalApi Methods
- Heuristic:FMInference FlexLLMGen Sequence Length Alignment
- Heuristic:LaurentMazare Tch rs CuDNN Benchmark Mode
- Heuristic:Microsoft DeepSpeedExamples RLHF Hyperparameter Guide
- Heuristic:Fede1024 Rust rdkafka Manual Offset Store Pattern
- Heuristic:Eric mitchell Direct preference optimization FSDP Batch Size Per GPU
Environments
- Environment:Obss Sahi Python Pycocotools
- Environment:Vllm project Vllm Buildkite
- Environment:Microsoft BIPIA Python CUDA GPU Environment
- Environment:Ollama Ollama CGo Runtime
- Environment:Intel Ipex llm Portable Environment
- Environment:Getgauge Taiko Node Runtime
- Environment:Unstructured IO Unstructured Profiling Tools
- Environment:Lakeraai Pint benchmark Python 310 With Pandas
- Environment:Alibaba ROLL Megatron Training Environment
- Environment:ClickHouse ClickHouse OpenSSL Runtime