Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Alibaba ROLL Agentic RL Training Pipeline
- Workflow:PrefectHQ Prefect Dbt Model Orchestration
- Workflow:Princeton nlp SimPO On Policy Data Generation
- Workflow:DataTalksClub Data engineering zoomcamp dlt Data Ingestion
- Workflow:Heibaiying BigData Notes Kafka Producer Consumer Pipeline
- Workflow:ARISE Initiative Robosuite Environment Setup And Simulation
- Workflow:Arize ai Phoenix Span Annotation Pipeline
- Workflow:Deepseek ai Janus Rectified Flow Image Generation
- Workflow:Lance format Lance Version Management
- Workflow:TA Lib Ta lib python Abstract API Usage
Principles
- Principle:Langfuse Langfuse Framework Detection and Type Mapping
- Principle:Open compass VLMEvalKit Dataset Base Class Hierarchy
- Principle:Evidentlyai Evidently Snapshot Storage
- Principle:Deepspeedai DeepSpeed Sequence Parallel Attention
- Principle:Fede1024 Rust rdkafka Mock Topic Management
- Principle:Duckdb Duckdb Source Amalgamation
- Principle:Tencent Ncnn Vulkan Inference Configuration
- Principle:FlagOpen FlagEmbedding Auto Reranker Loading
- Principle:Langgenius Dify Type Safety
- Principle:Cypress io Cypress Configuration Scaffolding
Implementations
- Implementation:Kubeflow Kubeflow Release Announcement Process
- Implementation:Mistralai Client python Azure ChatCompletionStreamRequest
- Implementation:Infiniflow Ragflow CommonHooks
- Implementation:Vespa engine Vespa Cluster
- Implementation:Lm sys FastChat Remote Logger
- Implementation:Kserve Kserve LocalModelNode Agent DaemonSet
- Implementation:Scikit learn Scikit learn SpectralClustering
- Implementation:Google deepmind Mujoco MJX Derivative
- Implementation:Open compass VLMEvalKit Build Judge
- Implementation:Apache Paimon DlfProvider
Heuristics
- Heuristic:Heibaiying BigData Notes HBase Connection Thread Safety Tip
- Heuristic:Langchain ai Langgraph Durability Mode Selection
- Heuristic:TA Lib Ta lib python NaN Propagation Behavior
- Heuristic:Microsoft DeepSpeedExamples ZeRO Inference Throughput Tuning
- Heuristic:Haifengl Smile Quarkus Async Context Handling
- Heuristic:Speechbrain Speechbrain Score Normalization Tips
- Heuristic:NVIDIA DALI Memory Pool Tuning
- Heuristic:PeterL1n BackgroundMattingV2 ONNX Patch Method Compatibility
- Heuristic:Dotnet Machinelearning FastTree Default Hyperparameters
- Heuristic:Allenai Open instruct BFloat16 Training
Environments
- Environment:Snorkel team Snorkel Dask Distributed
- Environment:Lance format Lance Rust Toolchain
- Environment:Mistralai Client python Realtime Transcription Environment
- Environment:Promptfoo Promptfoo Provider API Keys
- Environment:Avdvg InjectGuard CUDA GPU
- Environment:OpenBMB UltraFeedback Python GPU Environment
- Environment:Openai Evals Optional Provider APIs
- Environment:Scikit learn contrib Imbalanced learn Python Scikit learn
- Environment:LLMBook zh LLMBook zh github io Bitsandbytes Quantization Environment
- Environment:Apache Spark Kubernetes Runtime