Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Microsoft BIPIA White Box Defense Finetuning
- Workflow:Huggingface Trl Supervised Finetuning
- Workflow:Microsoft DeepSpeedExamples VisualChat Multimodal Training
- Workflow:Huggingface Alignment handbook SFT DPO Alignment Pipeline
- Workflow:DataExpert io Data engineer handbook PySpark Job Testing
- Workflow:Google deepmind Dm control Composer Environment Building
- Workflow:PrefectHQ Prefect Asset Based Data Pipeline
- Workflow:Ggml org Llama cpp Interactive Chat
- Workflow:Google deepmind Mujoco Model compilation and conversion
- Workflow:Datahub project Datahub CLI Metadata Ingestion
Principles
- Principle:Risingwavelabs Risingwave Environment Configuration
- Principle:Open compass VLMEvalKit Model Registration
- Principle:Eventual Inc Daft Row Wise UDF
- Principle:Google research Deduplicate text datasets TFDS Deduplication Application
- Principle:Langchain ai Langchain Document Indexing
- Principle:Ggml org Llama cpp Model Conversion
- Principle:Tencent Ncnn Inference Benchmarking
- Principle:Google research Deduplicate text datasets Substring Occurrence Querying
- Principle:Huggingface Datasets Dataset Shuffling
- Principle:ARISE Initiative Robosuite Composite Object Construction
Implementations
- Implementation:Ggml org Ggml Gguf add tensor
- Implementation:Apache Airflow DAG Distribution Config
- Implementation:CrewAIInc CrewAI Bedrock Browser Toolkit
- Implementation:Elevenlabs Elevenlabs python WorkflowOverrideAgentNodeModelInput
- Implementation:Google deepmind Dm control Suite Acrobot
- Implementation:Sktime Pytorch forecasting Tuner Lr Find
- Implementation:Microsoft Playwright Client Electron
- Implementation:Bentoml BentoML Gradio Mount
- Implementation:ChenghaoMou Text dedup Jaccard Similarity Func
- Implementation:LMCache LMCache Audit Connector
Heuristics
- Heuristic:Protectai Modelscan Graceful Scanner Degradation
- Heuristic:SeleniumHQ Selenium Bazel Hermetic Build Requirement
- Heuristic:Unslothai Unsloth LoRA Rank Selection
- Heuristic:Microsoft DeepSpeedExamples Gradient Checkpointing Tradeoff
- Heuristic:Ucbepic Docetl Validation Retry Strategy
- Heuristic:Huggingface Diffusers Dtype Precision Selection
- Heuristic:Mbzuai oryx Awesome LLM Post training Reference Citation Cap 200
- Heuristic:Zai org CogVideo Training Hyperparameter Defaults
- Heuristic:Huggingface Trl Distributed Device Map Override
- Heuristic:Facebookresearch Habitat lab DDPPO Straggler Preemption
Environments
- Environment:Cohere ai Cohere python Cohere API Credentials
- Environment:PacktPublishing LLM Engineers Handbook Docker MongoDB Qdrant Infrastructure
- Environment:Alibaba MNN HuggingFace Ecosystem Environment
- Environment:Dagster io Dagster Container Resource Monitoring
- Environment:Hpcaitech ColossalAI CUDA GPU Environment
- Environment:Scikit learn contrib Imbalanced learn Keras TensorFlow
- Environment:Nightwatchjs Nightwatch BrowserStack Cloud
- Environment:Recommenders team Recommenders Python Core Dependencies
- Environment:Arize ai Phoenix Phoenix Server Runtime
- Environment:FlowiseAI Flowise Node Runtime Environment