Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Datahub project Datahub Spark Lineage Capture
- Workflow:PacktPublishing LLM Engineers Handbook RAG Inference
- Workflow:Openai Openai node Audio Processing
- Workflow:ARISE Initiative Robomimic Dataset Preparation Pipeline
- Workflow:Mage ai Mage ai Destination Data Loading
- Workflow:OpenRLHF OpenRLHF DPO Training
- Workflow:Sdv dev SDV Multi table synthesis
- Workflow:VainF Torch Pruning Vision Transformer Pruning
- Workflow:Princeton nlp Tree of thought llm Baseline comparison
- Workflow:Nautechsystems Nautilus trader Data loading and cataloging
Principles
- Principle:Apache Flink Split Based Record Reading
- Principle:Openai Whisper Word Level Subtitle Output
- Principle:Ray project Ray Request Routing And Handling
- Principle:Elevenlabs Elevenlabs python Batch Speech to Text
- Principle:Tensorflow Serving Canary Deployment
- Principle:BerriAI Litellm Error Handling
- Principle:Zai org CogVideo Image Conditioning Preparation
- Principle:Microsoft DeepSpeedExamples ZeRO3 CPU Offload Training
- Principle:Unslothai Unsloth Sentence Embedding Finetuning
- Principle:Bigscience workshop Petals Data Preparation
Implementations
- Implementation:Datahub project Datahub FieldPath Schematron
- Implementation:Huggingface Transformers Quantization Verification Pattern
- Implementation:Datahub project Datahub MetadataResponseFuture
- Implementation:Huggingface Datasets JaxFormatter
- Implementation:Lm sys FastChat Topic Clustering
- Implementation:Protectai Llm guard Output EmotionDetection
- Implementation:Lance format Lance DictEncoding
- Implementation:CrewAIInc CrewAI Couchbase Vector Search Tool
- Implementation:Apache Airflow TaskInstance Listener Spec
- Implementation:Open compass VLMEvalKit MEGABench Answer Str Parse
Heuristics
- Heuristic:HKUDS AI Trader Linear Retry Backoff
- Heuristic:SeleniumHQ Selenium FindElements For Absence Check
- Heuristic:Cleanlab Cleanlab Multiprocessing Platform Strategy
- Heuristic:Mlflow Mlflow Prompt Cache Tuning
- Heuristic:FMInference FlexLLMGen Sequence Length Alignment
- Heuristic:Microsoft LoRA LoRA Init Strategy
- Heuristic:Kserve Kserve VLLM GPU Memory Utilization
- Heuristic:Microsoft Onnxruntime Graph Optimization Level Selection
- Heuristic:Google deepmind Dm control Warning Deprecated Legacy Base Walker
- Heuristic:Hiyouga LLaMA Factory Mixed Precision Training Tips
Environments
- Environment:Vllm project Vllm Python Dependencies
- Environment:BerriAI Litellm Observability Stack
- Environment:Marker Inc Korea AutoRAG Korean NLP Dependencies
- Environment:Openai Openai node Node 20 Runtime
- Environment:Helicone Helicone Node 20 TypeScript Runtime
- Environment:Farama Foundation Gymnasium Box2D Physics Backend
- Environment:Haifengl Smile Quarkus Serve Environment
- Environment:Ggml org Llama cpp Metal GPU Environment
- Environment:Roboflow Rf detr Python GPU Environment
- Environment:Astronomer Astronomer cosmos Python Airflow Runtime