Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:ARISE Initiative Robosuite Environment Setup And Simulation
- Workflow:Mistralai Client python Text Embeddings
- Workflow:Huggingface Transformers 3D Parallel Distributed Training
- Workflow:CarperAI Trlx RLHF Summarization Pipeline
- Workflow:Snorkel team Snorkel Slice Aware Training
- Workflow:Nautechsystems Nautilus trader Backtest with BacktestEngine
- Workflow:Ucbepic Docetl Playground Interactive Development
- Workflow:Apache Spark Application Submission
- Workflow:Haifengl Smile Nearest Neighbor Search
- Workflow:Speechbrain Speechbrain Whisper ASR Finetuning
Principles
- Principle:Apache Dolphinscheduler State Reconciliation
- Principle:Sktime Pytorch forecasting Hyperparameter Optimization
- Principle:Microsoft Agent framework Tool Approval Configuration
- Principle:MarketSquare Robotframework browser Keyword Method Implementation
- Principle:Neuml Txtai API Security
- Principle:Speechbrain Speechbrain Speaker Embedding Precomputation
- Principle:Scikit learn contrib Imbalanced learn Sampler Compatibility Checking
- Principle:Kserve Kserve InferenceService Specification
- Principle:Helicone Helicone Asynchronous Log Queuing
- Principle:Axolotl ai cloud Axolotl Continuous Integration Testing
Implementations
- Implementation:Hiyouga LLaMA Factory Visual Model Utils
- Implementation:Promptfoo Promptfoo Config Schema
- Implementation:Haifengl Smile PerfectHash
- Implementation:PrefectHQ Prefect Hello World Example
- Implementation:Lance format Lance Java BTreeIndexParams
- Implementation:AUTOMATIC1111 Stable diffusion webui Run postprocessing
- Implementation:Volcengine Verl Toolcall Shaping Reward
- Implementation:Datahub project Datahub SaveIntoDataSourceCommandVisitor
- Implementation:ArroyoSystems Arroyo Preview Connector
- Implementation:DataTalksClub Data engineering zoomcamp Redpanda PySpark Streaming
Heuristics
- Heuristic:Avdvg InjectGuard Embedding Normalization Cosine Equivalence
- Heuristic:PacktPublishing LLM Engineers Handbook LoRA Finetuning Parameters
- Heuristic:Datahub project Datahub Docker Memory Preflight
- Heuristic:InternLM Lmdeploy KV Quantization Tradeoffs
- Heuristic:Langfuse Langfuse ClickHouse FINAL Skip Optimization
- Heuristic:Marker Inc Korea AutoRAG Hybrid Retrieval Score Normalization
- Heuristic:ARISE Initiative Robomimic Checkpoint Selection Strategy
- Heuristic:Huggingface Datasets Warning Deprecated Pandas Builder
- Heuristic:Apache Paimon File Sizing and Split Planning
- Heuristic:Openai CLIP L2 Normalization For Cosine Similarity
Environments
- Environment:Heibaiying BigData Notes Flink 1 9 Environment
- Environment:ChenghaoMou Text dedup Python 3 12 Environment
- Environment:Gretelai Gretel synthetics TensorFlow GPU Environment
- Environment:Dotnet Machinelearning OneDal Acceleration
- Environment:Alibaba MNN CPU Build Environment
- Environment:Unstructured IO Unstructured Libmagic
- Environment:Neuml Txtai GPU Accelerator Environment
- Environment:ArroyoSystems Arroyo Kubernetes Deployment
- Environment:Apache Dolphinscheduler Netty Runtime
- Environment:Togethercomputer Together python Fine Tuning Data Requirements