Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Marker Inc Korea AutoRAG Evaluation Data Creation
- Workflow:Hpcaitech ColossalAI LLaMA Continual Pretraining
- Workflow:Puppeteer Puppeteer Request Interception
- Workflow:Apache Hudi Flink MOR Compaction
- Workflow:Roboflow Rf detr Roboflow Deployment
- Workflow:Huggingface Transformers 3D Parallel Distributed Training
- Workflow:BerriAI Litellm Observability Integration
- Workflow:Apache Shardingsphere Metadata DDL Refresh
- Workflow:Alibaba ROLL Supervised Finetuning Pipeline
- Workflow:Testtimescaling Testtimescaling github io Automated Citation Tracking
Principles
- Principle:EvolvingLMMs Lab Lmms eval Experiment Tracking
- Principle:Treeverse LakeFS Branch Creation
- Principle:DataTalksClub Data engineering zoomcamp Environment Setup
- Principle:Cleanlab Cleanlab Object Detection Issue Filtering
- Principle:Huggingface Trl PPO Model Saving and Evaluation
- Principle:Ggml org Llama cpp Diffusion Text Generation
- Principle:Langchain ai Langgraph Remote Client Connection
- Principle:OpenRLHF OpenRLHF Knowledge Distillation Training
- Principle:Mistralai Client python Chat Completion
- Principle:ThreeSR Awesome Inference Time Scaling Repository Forking
Implementations
- Implementation:Apache Flink Mapreduce HadoopInputFormatBase
- Implementation:Facebookresearch Habitat lab ResetArmSkill
- Implementation:Fastai Fastbook Tabular Learner
- Implementation:NVIDIA TransformerEngine JAX Activation
- Implementation:Microsoft Onnxruntime Sklearn Model Training
- Implementation:Rapidsai Cuml FIL Exceptions
- Implementation:Elevenlabs Elevenlabs python MusicCustomClient
- Implementation:Apache Flink WritableTypeInfo
- Implementation:Deepspeedai DeepSpeed DeepSpeedInferenceConfig Init
- Implementation:DistrictDataLabs Yellowbrick FrequencyVisualizer
Heuristics
- Heuristic:Google deepmind Dm control Rendering Backend Selection Tips
- Heuristic:Nightwatchjs Nightwatch Safari Parallel Limitation
- Heuristic:Bigscience workshop Petals Prompt Embeddings Float32 Precision
- Heuristic:CrewAIInc CrewAI Rate Limiting Strategy
- Heuristic:Junyanz Pytorch CycleGAN and pix2pix Identity Loss Color Preservation
- Heuristic:Apache Shardingsphere Version Cleanup After Switch
- Heuristic:Danijar Dreamerv3 Percentile Return Normalization
- Heuristic:Mbzuai oryx Awesome LLM Post training Reference Citation Cap 200
- Heuristic:Teamcapybara Capybara Async Waiting And Retry
- Heuristic:Explodinggradients Ragas Retry And Backoff Configuration
Environments
- Environment:Openai Openai agents python MCP Dependencies
- Environment:Google deepmind Mujoco MJX Warp CUDA Environment
- Environment:Ucbepic Docetl Python Runtime
- Environment:Eric mitchell Direct preference optimization PyTorch CUDA
- Environment:ClickHouse ClickHouse Systemd Runtime
- Environment:Marker Inc Korea AutoRAG Vector Database Backends
- Environment:Apache Hudi Flink Runtime Environment
- Environment:Google deepmind Mujoco MJX JAX Environment
- Environment:PacktPublishing LLM Engineers Handbook AWS SageMaker GPU Environment
- Environment:Openclaw Openclaw Docker Deployment Environment