Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Huggingface Open r1 SFT Distillation
- Workflow:Datahub project Datahub Python Metadata Emission
- Workflow:Microsoft BIPIA White Box Defense Finetuning
- Workflow:ARISE Initiative Robosuite Domain Randomization Training
- Workflow:Huggingface Datatrove Synthetic Data Generation
- Workflow:FMInference FlexLLMGen HELM Benchmark Evaluation
- Workflow:Open compass VLMEvalKit Adding Custom VLM
- Workflow:Dotnet Machinelearning Binary Classification Pipeline
- Workflow:Isaac sim IsaacGymEnvs Custom Task Development
- Workflow:Openai Openai python Embeddings Generation
Principles
- Principle:DataExpert io Data engineer handbook Spark Session Configuration
- Principle:Avhz RustQuant Gradient Descent Optimization
- Principle:Kserve Kserve Model Lifecycle Management
- Principle:Ollama Ollama GGUF Model Conversion Lfm2
- Principle:Astronomer Astronomer cosmos Telemetry And Observability
- Principle:Neuml Txtai Base Model Configuration
- Principle:Gretelai Gretel synthetics Synthetic Text Generation
- Principle:Isaac sim IsaacGymEnvs Randomization Parameter Definition
- Principle:Kubeflow Pipelines XGBoost Model Training
- Principle:DataTalksClub Data engineering zoomcamp Chunked Data Ingestion
Implementations
- Implementation:Langgenius Dify I18n Language
- Implementation:Mit han lab Llm awq Device warmup
- Implementation:Junyanz Pytorch CycleGAN and pix2pix Pix2PixModel Optimize Parameters
- Implementation:Apache Shardingsphere MetaDataContextsInitFactory Create
- Implementation:Ggml org Ggml Sycl mmq
- Implementation:ARISE Initiative Robosuite Wrapper Base
- Implementation:Open compass VLMEvalKit ImageCaptionDataset
- Implementation:TobikoData Sqlmesh PlanBuilder Set Start End
- Implementation:Neuml Txtai Embeddings Search
- Implementation:Interpretml Interpret Explain Global
Heuristics
- Heuristic:Roboflow Rf detr EMA Best Checkpoint Strategy
- Heuristic:Apache Dolphinscheduler Netty Thread Sizing
- Heuristic:Vibrantlabsai Ragas Concurrency And Retry Configuration
- Heuristic:Protectai Llm guard Token Limit Early Guard
- Heuristic:Treeverse LakeFS Action Cache Wait Tip
- Heuristic:Hpcaitech ColossalAI Warmup Steps Heuristic
- Heuristic:Trailofbits Fickling Severity Threshold Selection
- Heuristic:Iterative Dvc Path Performance Optimization
- Heuristic:Alibaba ROLL GPU Memory Offload Strategy
- Heuristic:Duckdb Duckdb Unity Build Strategy
Environments
- Environment:Fede1024 Rust rdkafka Kafka Broker Runtime
- Environment:DataExpert io Data engineer handbook Flink Kafka Docker Environment
- Environment:Openai Openai node Node 20 Runtime
- Environment:Kserve Kserve Knative Serving
- Environment:Diagram of thought Diagram of thought Python Graph Libraries
- Environment:Apache Spark JDK Build Environment
- Environment:Huggingface Datatrove S3 Storage Environment
- Environment:Huggingface Datatrove IO Dependencies
- Environment:Sgl project Sglang Grafana
- Environment:Wandb Weave LLM Integration Dependencies