Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Speechbrain Speechbrain CTC ASR Training
- Workflow:Bentoml BentoML BentoCloud Deployment
- Workflow:Microsoft Semantic kernel Kernel Setup And Chat Completion
- Workflow:ChenghaoMou Text dedup Benchmark Evaluation
- Workflow:ClickHouse ClickHouse Contributing Pull Request
- Workflow:Dotnet Machinelearning AutoML Experiment
- Workflow:Norrrrrrr lyn WAInjectBench Text Prompt Injection Detection
- Workflow:Mlc ai Mlc llm Model Compilation
- Workflow:Scikit learn contrib Imbalanced learn Imbalanced Model Evaluation
- Workflow:Hpcaitech ColossalAI LLaMA Continual Pretraining
Principles
- Principle:Apache Spark Distribution Packaging
- Principle:Vespa engine Vespa Log Message Formatting
- Principle:ARISE Initiative Robomimic Dataset Loading
- Principle:LLMBook zh LLMBook zh github io Direct Preference Optimization
- Principle:Datajuicer Data juicer Data Selection
- Principle:TA Lib Ta lib python Abstract Data Input
- Principle:Google deepmind Mujoco Resource Provider
- Principle:Deepseek ai Janus CFG Input Preparation
- Principle:DataExpert io Data engineer handbook Database Seeding
- Principle:Facebookresearch Habitat lab Dataset and Scene Preparation
Implementations
- Implementation:Mage ai Mage ai GitHub Client
- Implementation:Treeverse LakeFS Java SDK Model Diff
- Implementation:Teamcapybara Capybara Node Actions Selection
- Implementation:Microsoft BIPIA RougeRecall
- Implementation:Open compass VLMEvalKit ChartMimic Color Evaluator Prefix
- Implementation:Elevenlabs Elevenlabs python OutboundSipTrunkConfigRequestModel
- Implementation:Openclaw Openclaw Chrome Extension Background
- Implementation:Apache Airflow MetricsRegistry
- Implementation:BerriAI Litellm NPM Lock
- Implementation:Microsoft Playwright Pixelmatch
Heuristics
- Heuristic:Apache Spark Serialization Optimization
- Heuristic:Openai Openai agents python Tool Choice Reset Prevents Loops
- Heuristic:Openai Openai agents python Default Max Turns Safety Limit
- Heuristic:Apache Kafka Container JMX RMI Port Tip
- Heuristic:Confident ai Deepeval Timeout and Retry Tuning
- Heuristic:Zai org CogVideo LoRA Configuration Tips
- Heuristic:Bentoml BentoML Worker Count Strategy
- Heuristic:Trailofbits Fickling Injection Mode Selection
- Heuristic:Mbzuai oryx Awesome LLM Post training Reference Citation Cap 200
- Heuristic:Allenai Open instruct Disable Dropout In RL
Environments
- Environment:Kubeflow Kubeflow Git GitHub Environment
- Environment:Guardrails ai Guardrails OpenTelemetry Tracing
- Environment:Huggingface Optimum Tensor Parallelization Environment
- Environment:Apache Airflow Development Contributor Environment
- Environment:Openai Openai node OpenAI API Credentials
- Environment:Hiyouga LLaMA Factory FP8 Training Environment
- Environment:Lm sys FastChat GPU CUDA Inference
- Environment:Kubeflow Kubeflow Kubectl Kustomize CLI Environment
- Environment:Spotify Luigi Tornado Web Server
- Environment:Sgl project Sglang Multi Platform Accelerators