Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Sail sg LongSpec GLIDE Draft Model Training
- Workflow:Apache Paimon Data Ingestion With Ray Sink
- Workflow:Ggml org Llama cpp LoRA Adapter Workflow
- Workflow:Microsoft Semantic kernel Kernel Setup And Chat Completion
- Workflow:Apache Druid SQL Based Data Ingestion
- Workflow:Datajuicer Data juicer LLM Powered Data Generation
- Workflow:Dotnet Machinelearning AutoML Experiment
- Workflow:Google research Deduplicate text datasets Single file deduplication
- Workflow:Microsoft Onnxruntime Train Convert Predict
- Workflow:Wandb Weave Prompt Management
Principles
- Principle:Sgl project Sglang Parallel Branching Logic
- Principle:Apache Airflow Configuration Resolution
- Principle:Apache Spark Distribution Packaging
- Principle:Mlc ai Web llm Model Selection
- Principle:Rapidsai Cuml Cluster Model Fitting
- Principle:Deepseek ai Janus Autoregressive Text Generation
- Principle:FMInference FlexLLMGen DeepSpeed Package Build
- Principle:DataExpert io Data engineer handbook DataFrame Write To Table
- Principle:Explodinggradients Ragas Optimization Loss Functions
- Principle:ARISE Initiative Robosuite Manipulation Task Design
Implementations
- Implementation:Microsoft BIPIA Smart Tokenizer And Embedding Resize
- Implementation:Microsoft Onnxruntime OrtSession JNI
- Implementation:FlowiseAI Flowise ArrayRenderer
- Implementation:Teamcapybara Capybara Server Middleware
- Implementation:Infiniflow Ragflow TestingResult Component
- Implementation:Microsoft Playwright WkWorkers
- Implementation:Iterative Dvc Dependency Dataset
- Implementation:Huggingface Datasets Get Dataset Config Info
- Implementation:Speechbrain Speechbrain Train TimersAndSuch Decoupled
- Implementation:FlowiseAI Flowise OpenAIAssistantLayout
Heuristics
- Heuristic:Recommenders team Recommenders Test Timing Budgets
- Heuristic:Deepspeedai DeepSpeed Sequence Parallel PyTorch Version
- Heuristic:Huggingface Diffusers LoRA Safe Fusing
- Heuristic:ThreeSR Awesome Inference Time Scaling Empty Venue Default Tip
- Heuristic:Bentoml BentoML Adaptive Batching Tuning
- Heuristic:Anthropics Anthropic sdk python Retry Backoff Strategy
- Heuristic:Kornia Kornia Avoid Inplace Ops Compile
- Heuristic:Google deepmind Mujoco Mesh Quality For Collision
- Heuristic:Confident ai Deepeval Dotenv Loading Order
- Heuristic:ContextualAI HALOs Batch Size Divisibility
Environments
- Environment:Alibaba ROLL SGLang Inference Environment
- Environment:InternLM Lmdeploy Python Dependencies
- Environment:Microsoft LoRA NLG Eval External Tools
- Environment:Cypress io Cypress Browser Requirements
- Environment:Togethercomputer Together python Python SDK Runtime
- Environment:Microsoft DeepSpeedExamples VisualChat Training Environment
- Environment:MarketSquare Robotframework browser CI GitHub Actions
- Environment:Sgl project Sglang Grafana
- Environment:Deepspeedai DeepSpeed CUDA GPU Environment
- Environment:Lucidrains X transformers Python Environment