Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Huggingface Transformers Model Benchmarking
- Workflow:Predibase Lorax OpenAI Chat Completion
- Workflow:TobikoData Sqlmesh Github CICD automation
- Workflow:Microsoft Playwright Browser automation CLI
- Workflow:FMInference FlexLLMGen HELM Benchmark Evaluation
- Workflow:Apache Paimon Vector Similarity Search
- Workflow:Triton inference server Server Custom Container Build
- Workflow:Hiyouga LLaMA Factory Full Parameter SFT
- Workflow:Microsoft DeepSpeedExamples RLHF Training Pipeline
- Workflow:Sgl project Sglang Structured Output Generation
Principles
- Principle:Recommenders team Recommenders Benchmark Prediction Generation
- Principle:Openai Openai python Response Creation
- Principle:Huggingface Transformers Output Postprocessing
- Principle:Huggingface Peft AdaLoRA Rank Allocation
- Principle:Pytorch Serve GRPC Communication
- Principle:Nautechsystems Nautilus trader Catalog Data Querying
- Principle:Openai Openai agents python Tool Output Guardrail Definition
- Principle:Iamhankai Forest of Thought Answer Extraction
- Principle:Openclaw Openclaw Credential Acquisition
- Principle:Sail sg LongSpec Metrics Collection
Implementations
- Implementation:Treeverse LakeFS Java SDK ObjectsApi
- Implementation:Avhz RustQuant Currency
- Implementation:Huggingface Peft C3AModel
- Implementation:OpenGVLab InternVL Optimizer Builder
- Implementation:Huggingface Peft GraloraConfig
- Implementation:Vibrantlabsai Ragas SemanticSimilarityV2
- Implementation:Cohere ai Cohere python ChatMessage Model
- Implementation:Microsoft Onnxruntime JsPackageLock
- Implementation:Mage ai Mage ai Tableau Streams
- Implementation:NVIDIA DALI Cpplint
Heuristics
- Heuristic:Scikit learn contrib Imbalanced learn Sampling Before Split Leakage
- Heuristic:Iterative Dvc Path Performance Optimization
- Heuristic:Danijar Dreamerv3 Percentile Return Normalization
- Heuristic:Mbzuai oryx Awesome LLM Post training Paper Deduplication Via Dict
- Heuristic:Zai org CogVideo Memory Optimization Strategies
- Heuristic:Vespa engine Vespa Maven Parallel Build Optimization
- Heuristic:Fede1024 Rust rdkafka Librdkafka Debug Logging
- Heuristic:Apache Kafka Container JMX RMI Port Tip
- Heuristic:Alibaba ROLL Dynamic Batching Token Limits
- Heuristic:Bigscience workshop Petals Batch Splitting Threshold
Environments
- Environment:Sgl project Sglang Python Dependencies
- Environment:Huggingface Open r1 CUDA Environment
- Environment:Scikit learn Scikit learn Python Runtime Environment
- Environment:Microsoft Autogen LLM Provider API Keys
- Environment:Openclaw Openclaw Mintlify Documentation Platform
- Environment:Nautechsystems Nautilus trader Databento API Credentials
- Environment:Intel Ipex llm Linux XPU Environment
- Environment:Huggingface Datatrove IO Dependencies
- Environment:Apache Airflow Database Backend Environment
- Environment:Testtimescaling Testtimescaling github io GitHub Actions Runner