Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Mistralai Client python Text Embeddings
- Workflow:Astronomer Astronomer cosmos Kubernetes dbt execution
- Workflow:Ggml org Llama cpp Text Generation
- Workflow:PacktPublishing LLM Engineers Handbook RAG Inference
- Workflow:Dagster io Dagster Modal Serverless Pipeline
- Workflow:Kserve Kserve Canary Rollout Deployment
- Workflow:Puppeteer Puppeteer Cross Browser Automation
- Workflow:Neuml Txtai Semantic Search Pipeline
- Workflow:Iterative Dvc Pipeline Reproduction
- Workflow:LLMBook zh LLMBook zh github io Data Preprocessing Pipeline
Principles
- Principle:MarketSquare Robotframework browser Page Navigation and Interaction
- Principle:Kubeflow Pipelines Incremental Model Training
- Principle:Duckdb Duckdb Quantile Estimation
- Principle:SeldonIO Seldon core Pipeline Version Progression
- Principle:Pyro ppl Pyro No U Turn Sampling
- Principle:Truera Trulens Method Instrumentation
- Principle:Ollama Ollama Architecture Detection
- Principle:NVIDIA NeMo Aligner Supervised Training Loop
- Principle:Vllm project Vllm Multimodal Generation
- Principle:Datahub project Datahub Java Client Initialization
Implementations
- Implementation:Mlflow Mlflow User Training Code
- Implementation:Openai Openai python Shared Function Definition
- Implementation:CrewAIInc CrewAI MongoDB Vector Search Tool
- Implementation:Datahub project Datahub Scheduled Ingestion Orchestration
- Implementation:Open compass VLMEvalKit Get Score Gen Table
- Implementation:Scikit learn Scikit learn ParameterGrid Init
- Implementation:Cohere ai Cohere python ChatMessage Model
- Implementation:Datajuicer Data juicer VideoDepthEstimationMapper
- Implementation:Langfuse Langfuse Dataset Items Repository
- Implementation:Puppeteer Puppeteer NgSchematics Packages Util
Heuristics
- Heuristic:Truera Trulens Rate Limiting And Retry Strategy
- Heuristic:Predibase Lorax LoRA Kernel Selection By Rank
- Heuristic:Infiniflow Ragflow Hybrid Search Fallback Strategy
- Heuristic:Isaac sim IsaacGymEnvs DR Setup Only Flag
- Heuristic:Dotnet Machinelearning Tokenizer Caching Strategy
- Heuristic:Recommenders team Recommenders Test Timing Budgets
- Heuristic:ChenghaoMou Text dedup Bloom Filter Single Process
- Heuristic:Run llama Llama index Worker Count Configuration
- Heuristic:LLMBook zh LLMBook zh github io Reward Model LM Regularization
- Heuristic:Bentoml BentoML Warning Deprecated Server Module
Environments
- Environment:Alibaba MNN GPU OpenCL Environment
- Environment:ARISE Initiative Robosuite MuJoCo Python
- Environment:Openai Openai agents python Voice Dependencies
- Environment:Haotian liu LLaVA OpenAI API Evaluation Environment
- Environment:Langchain ai Langchain OpenAI API Credentials
- Environment:Sgl project Sglang Grafana
- Environment:Google research Deduplicate text datasets Python HuggingFace Environment
- Environment:Microsoft Semantic kernel ONNX CUDA Environment
- Environment:Allenai Open instruct Python 3 12 Runtime
- Environment:Run llama Llama index Sentence Transformers Finetuning