Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Langfuse Langfuse Evaluation pipeline
- Workflow:OpenHands OpenHands Organization Onboarding
- Workflow:PacktPublishing LLM Engineers Handbook Dataset Generation
- Workflow:Run llama Llama index ReAct Agent
- Workflow:Apache Kafka Topic Management
- Workflow:Apache Hudi Flink Streaming Write
- Workflow:OWASP Www project top 10 for large language model applications Vulnerability Translation
- Workflow:Haosulab ManiSkill Motion Planning Demo Generation
- Workflow:Roboflow Rf detr ONNX Export
- Workflow:Sgl project Sglang Structured Output Generation
Principles
- Principle:Apache Druid SQL Task Submission
- Principle:Dotnet Machinelearning SSA Model Fitting
- Principle:Vllm project Vllm Server Metrics Monitoring
- Principle:LMCache LMCache Disaggregated Proxy Routing
- Principle:Pyro ppl Pyro Neural Module Registration
- Principle:Evidentlyai Evidently Dataset Score Extraction
- Principle:Princeton nlp Tree of thought llm Task Instantiation
- Principle:Apache Hudi Read Environment Configuration
- Principle:Ollama Ollama OpenAI Route Registration
- Principle:Teamcapybara Capybara Path Assertion
Implementations
- Implementation:Infiniflow Ragflow DatasetActionCell Component
- Implementation:Microsoft DeepSpeedExamples Create DSVL Model
- Implementation:Lance format Lance JNI Traits
- Implementation:Vllm project Vllm SGL MoE FP8
- Implementation:NVIDIA TransformerEngine Common Header
- Implementation:Microsoft DeepSpeedExamples Alpaca Training Dataset
- Implementation:Online ml River Tree Nodes Branch
- Implementation:Mlflow Mlflow Register Prompt
- Implementation:Google deepmind Dm control Suite Walker
- Implementation:LMCache LMCache Remote Backend
Heuristics
- Heuristic:FlowiseAI Flowise Document Loader Bypass Optimization
- Heuristic:Recommenders team Recommenders SAR Cold Start Items
- Heuristic:ClickHouse ClickHouse Test Writing Conventions
- Heuristic:Apache Airflow Memory Management Tips
- Heuristic:Guardrails ai Guardrails RAIL Argument Parsing Security
- Heuristic:DataExpert io Data engineer handbook SparkSession Singleton Pattern
- Heuristic:Mlc ai Mlc llm FlashInfer KV Cache Fallback
- Heuristic:ArroyoSystems Arroyo Async UDF Concurrency
- Heuristic:Dotnet Machinelearning Sparsity Threshold Optimization
- Heuristic:Google deepmind Mujoco MJX Benchmarking Tips
Environments
- Environment:Vibrantlabsai Ragas Google Drive Backend Environment
- Environment:Nautechsystems Nautilus trader Arrow Parquet Serialization
- Environment:Dotnet Machinelearning TorchSharp Environment
- Environment:Apache Paimon Python Core Runtime
- Environment:Liu00222 Open Prompt Injection CUDA Environment
- Environment:Anthropics Anthropic sdk python Azure Foundry Environment
- Environment:Kubeflow Pipelines KFP Backend Deployment
- Environment:Infiniflow Ragflow Python Runtime
- Environment:Ray project Ray Python Runtime Environment
- Environment:Dagster io Dagster Container Resource Monitoring