Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Mlc ai Web llm Basic Chat Completion
- Workflow:Google deepmind Mujoco Interactive simulation
- Workflow:Liu00222 Open Prompt Injection DataSentinel Detection
- Workflow:Googleapis Python genai Text Content Generation
- Workflow:Facebookresearch Audiocraft MusicGen Text To Music Inference
- Workflow:Langgenius Dify Plugin Management
- Workflow:Fede1024 Rust rdkafka Produce Consume Roundtrip
- Workflow:Intel Ipex llm RAG With LangChain
- Workflow:Mlc ai Web llm Web Worker Deployment
- Workflow:Avhz RustQuant Stochastic Process Simulation
Principles
- Principle:InternLM Lmdeploy API Client Integration
- Principle:Arize ai Phoenix Experiment Execution
- Principle:Online ml River Time Series Evaluation
- Principle:Neuml Txtai Sparse Retrieval
- Principle:CrewAIInc CrewAI Crew Integration In Flow
- Principle:ARISE Initiative Robomimic Checkpointing and Model Saving
- Principle:Apache Flink Hybrid Source Processing
- Principle:Pytorch Serve Streaming Cloud Inference
- Principle:Microsoft Autogen Result Aggregation
- Principle:Nautechsystems Nautilus trader Backtest Engine Configuration
Implementations
- Implementation:PacktPublishing LLM Engineers Handbook HuggingFace Load Dataset
- Implementation:Online ml River Stream Iter Arff
- Implementation:BerriAI Litellm Managed Files
- Implementation:Google deepmind Dm control Manipulation Load
- Implementation:TobikoData Sqlmesh Context Invalidate Environment
- Implementation:Openai Openai node Node CJS Auto Lockfile
- Implementation:Pyro ppl Pyro PackedTensor
- Implementation:Fede1024 Rust rdkafka MockCluster Create Topic
- Implementation:Promptfoo Promptfoo Google Sheets Integration
- Implementation:Lucidrains X transformers EntropyBasedTokenizer
Heuristics
- Heuristic:Deepset ai Haystack Pipeline Max Runs Safety Limit
- Heuristic:TobikoData Sqlmesh Execution Time Caching
- Heuristic:Gretelai Gretel synthetics Binary Encoder Cutoff
- Heuristic:Langchain ai Langchain Pydantic V2 Configuration Tips
- Heuristic:ArroyoSystems Arroyo Async UDF Concurrency
- Heuristic:Scikit learn contrib Imbalanced learn Sampling Before Split Leakage
- Heuristic:Openai Openai python Structured Output Strict Schema
- Heuristic:Scikit learn contrib Imbalanced learn KNeighbors Selection Tips
- Heuristic:Pyro ppl Pyro Numerical Stability Patterns
- Heuristic:Tensorflow Tfjs Backend Selection Strategy
Environments
- Environment:Googleapis Python genai Vertex AI Service Account
- Environment:Triton inference server Server TRT LLM Deployment
- Environment:DataExpert io Data engineer handbook Flink Kafka Docker Environment
- Environment:Vllm project Vllm Python
- Environment:Alibaba ROLL Python Runtime Environment
- Environment:Lakeraai Pint benchmark Python 310 With Transformers
- Environment:Togethercomputer Together python Fine Tuning Data Requirements
- Environment:Langgenius Dify Python Backend Environment
- Environment:Nautechsystems Nautilus trader Python Cython Rust Runtime
- Environment:DataTalksClub Data engineering zoomcamp Kafka Confluent Environment