Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Tencent Ncnn Vulkan GPU Accelerated Inference
- Workflow:Vllm project Vllm Multi LoRA Serving
- Workflow:Neuml Txtai API Deployment
- Workflow:CrewAIInc CrewAI Knowledge RAG Pipeline
- Workflow:BerriAI Litellm Fine Tuning Job
- Workflow:ChenghaoMou Text dedup Suffix Array Deduplication
- Workflow:Datajuicer Data juicer LLM Powered Data Generation
- Workflow:ArroyoSystems Arroyo Connection Setup
- Workflow:Recommenders team Recommenders ALS Spark Recommendation
- Workflow:Google deepmind Dm control MJCF Model Composition
Principles
- Principle:Facebookresearch Audiocraft Latent Decoding and Audio Output
- Principle:Axolotl ai cloud Axolotl LoRA Adapter Injection
- Principle:FlagOpen FlagEmbedding Finetuned Embedder Validation
- Principle:Apache Airflow Local Testing
- Principle:FlowiseAI Flowise Workspace Navigation
- Principle:Mlfoundations Open flamingo Distributed Checkpointing
- Principle:AUTOMATIC1111 Stable diffusion webui Resource Monitoring
- Principle:Langchain ai Langgraph Channel Types
- Principle:Datahub project Datahub Emitter Initialization
- Principle:Huggingface Trl SFT Argument Configuration
Implementations
- Implementation:Confident ai Deepeval Update LLM Span
- Implementation:Apache Shardingsphere DefaultDistributedLock Lock
- Implementation:Microsoft Semantic kernel Bing SiteFilter TestData
- Implementation:Obss Sahi Coco2fiftyone
- Implementation:Duckdb Duckdb Setup Ubuntu Build Env
- Implementation:CARLA simulator Carla RecurrentSharedFuture
- Implementation:Pytorch Serve Marsgen
- Implementation:Google deepmind Mujoco Simulate Header
- Implementation:Apache Dolphinscheduler RpcService Annotation
- Implementation:Facebookresearch Audiocraft SampleManager
Heuristics
- Heuristic:Huggingface Optimum Device Offload Constraints
- Heuristic:PeterL1n BackgroundMattingV2 Checkpoint Interval Tuning
- Heuristic:Astronomer Astronomer cosmos Watcher Queue Sizing
- Heuristic:Lance format Lance Encoding Compression Thresholds
- Heuristic:Mbzuai oryx Awesome LLM Post training Reference Citation Cap 200
- Heuristic:Openai Whisper Median Word Duration Clamping
- Heuristic:ARISE Initiative Robomimic Data Worker Tuning By Modality
- Heuristic:AUTOMATIC1111 Stable diffusion webui NaN Detection And Precision Fixes
- Heuristic:PeterL1n BackgroundMattingV2 Backbone Scale Selection
- Heuristic:Protectai Llm guard ONNX Runtime Optimization
Environments
- Environment:Deepset ai Haystack OpenAI API Environment
- Environment:Astronomer Astronomer cosmos Cosmos Airflow Configuration
- Environment:Scikit learn Scikit learn OpenMP Thread Configuration
- Environment:Datahub project Datahub Python 3 10 Ingestion Environment
- Environment:Triton inference server Server TRT LLM Deployment
- Environment:EvolvingLMMs Lab Lmms eval Server Mode Environment
- Environment:Haotian liu LLaVA Python CUDA Training Environment
- Environment:FMInference FlexLLMGen CUDA GPU
- Environment:Microsoft Onnxruntime Sklearn Conversion Environment
- Environment:Mlfoundations Open flamingo Evaluation Dependencies