Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Openai Openai python Chat Completion
- Workflow:Scikit learn Scikit learn Hyperparameter Tuning
- Workflow:ARISE Initiative Robomimic Dataset Preparation Pipeline
- Workflow:DataTalksClub Data engineering zoomcamp Kafka Stream Processing
- Workflow:Pola rs Polars Lazy Query Pipeline
- Workflow:Teamcapybara Capybara Selenium Driver Configuration
- Workflow:Kubeflow Kubeflow Release Management
- Workflow:Kserve Kserve Multi Model Serving
- Workflow:EvolvingLMMs Lab Lmms eval Custom Model Integration
- Workflow:CARLA simulator Carla Traffic Generation
Principles
- Principle:ClickHouse ClickHouse DNS Resolution
- Principle:Spcl Graph of thoughts Ground Truth Evaluation
- Principle:AnswerDotAI RAGatouille Model Training
- Principle:MaterializeInc Materialize Pipeline YAML Generation
- Principle:SeleniumHQ Selenium WebDriver Session For CDP
- Principle:Triton inference server Server Protocol Endpoint Testing
- Principle:Avhz RustQuant Interpolation
- Principle:Apache Druid Partitioning Configuration
- Principle:Datajuicer Data juicer Operator Testing
- Principle:Bitsandbytes foundation Bitsandbytes LLM Int8 Linear Layer
Implementations
- Implementation:Scikit learn Scikit learn ClassicalMDS
- Implementation:Googleapis Python genai Common Utilities
- Implementation:Apache Kafka CoordinatorRuntime ScheduleWriteOperation
- Implementation:Huggingface Alignment handbook SFTTrainer Usage
- Implementation:Google deepmind Mujoco MJX Sensor
- Implementation:Liu00222 Open Prompt Injection PromptLocate locate and recover
- Implementation:Googleapis Python genai AutomaticFunctionCallingConfig Setup
- Implementation:Apache Dolphinscheduler DataSourceProcessorManager SPI Loading
- Implementation:Openai Openai node ZodToJsonSchema Entry
- Implementation:Confident ai Deepeval Evaluate Function
Heuristics
- Heuristic:Isaac sim IsaacGymEnvs DR Setup Only Flag
- Heuristic:Apache Spark K8s Container Patterns
- Heuristic:Tensorflow Serving Batching Thread Tuning
- Heuristic:NVIDIA DALI Memory Pool Tuning
- Heuristic:Farama Foundation Gymnasium Seeding Determinism Best Practices
- Heuristic:Romsto Speculative Decoding Ngram Order Selection
- Heuristic:Microsoft Playwright Browser Specific Workarounds
- Heuristic:Gretelai Gretel synthetics GPU Memory Allow Growth
- Heuristic:Ucbepic Docetl Rate Limit Exponential Backoff
- Heuristic:Infiniflow Ragflow Agent Max Rounds Strategy
Environments
- Environment:OpenHands OpenHands Third Party Runtime Credentials
- Environment:Unslothai Unsloth CUDA VLLM
- Environment:Langchain ai Langchain Unit Test Network Isolation
- Environment:Mit han lab Llm awq Python Runtime Environment
- Environment:Arize ai Phoenix LLM Provider SDKs
- Environment:Mlc ai Mlc llm CUDA GPU Environment
- Environment:Microsoft Autogen Extension Optional Dependencies
- Environment:Deepset ai Haystack GPU Device Environment
- Environment:Spotify Luigi Python Runtime
- Environment:Huggingface Datatrove Processing Dependencies