Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Getgauge Taiko Headless Browser Testing
- Workflow:Datahub project Datahub Protobuf Schema Ingestion
- Workflow:Online ml River Drift Adaptive Classification
- Workflow:Unstructured IO Unstructured Chunking And Embedding
- Workflow:Pytorch Serve LLM Deployment vLLM
- Workflow:Kornia Kornia Edge Detection Pipeline
- Workflow:Junyanz Pytorch CycleGAN and pix2pix Pix2pix Training
- Workflow:Facebookresearch Habitat lab Custom Task Extension
- Workflow:Sdv dev SDV Sequential data synthesis
- Workflow:Lakeraai Pint benchmark Custom Dataset Benchmarking
Principles
- Principle:Ollama Ollama GGUF Model Conversion Mistral Causal
- Principle:Protectai Llm guard Input Scanner Factory Pattern
- Principle:Datajuicer Data juicer Column Wise Distribution Analysis
- Principle:Fastai Fastbook Activation Functions
- Principle:Huggingface Open r1 Synthetic Data Generation
- Principle:PacktPublishing LLM Engineers Handbook Document Persistence
- Principle:AUTOMATIC1111 Stable diffusion webui Embedding creation
- Principle:Alibaba MNN Input Preprocessing
- Principle:TobikoData Sqlmesh Environment Listing And Inspection
- Principle:Huggingface Datasets Streaming Skip
Implementations
- Implementation:Infiniflow Ragflow Document Util
- Implementation:Puppeteer Puppeteer Cdp Frame
- Implementation:Microsoft Autogen SchemaManager
- Implementation:TobikoData Sqlmesh LoadingIcon
- Implementation:Vllm project Vllm RequestOutput LoRA Access
- Implementation:Apache Druid ExpressionEditorDialog
- Implementation:Mlflow Mlflow Prompt Version Entity
- Implementation:Farama Foundation Gymnasium GAE Computation
- Implementation:SeleniumHQ Selenium Civetweb API
- Implementation:Apache Kafka KafkaAdminClient CreateTopics
Heuristics
- Heuristic:Google deepmind Mujoco Thread Pool Configuration
- Heuristic:Sgl project Sglang Schedule Conservativeness Tuning
- Heuristic:Confident ai Deepeval Secret Management Best Practices
- Heuristic:Elevenlabs Elevenlabs python Text Chunking Splitter Characters
- Heuristic:ThreeSR Awesome Inference Time Scaling Empty Venue Default Tip
- Heuristic:Apache Beam Executor Shutdown Ordering
- Heuristic:Mbzuai oryx Awesome LLM Post training API Rate Limit Retry Strategy
- Heuristic:Open compass VLMEvalKit Video Frame Sampling Configuration
- Heuristic:LLMBook zh LLMBook zh github io Greedy Decoding Temperature Zero
- Heuristic:Tensorflow Tfjs GPU Pipeline Data Residency
Environments
- Environment:Fede1024 Rust rdkafka Kafka Broker Runtime
- Environment:Vespa engine Vespa POSIX Mmap Log Control
- Environment:Duckdb Duckdb Extension Distribution Env
- Environment:Langchain ai Langchain OpenAI API Credentials
- Environment:Huggingface Datasets Search Dependencies
- Environment:Microsoft BIPIA DeepSpeed Finetuning Environment
- Environment:Dagster io Dagster Container Resource Monitoring
- Environment:Dagster io Dagster PostgreSQL Storage
- Environment:Spotify Luigi AWS S3 Storage
- Environment:Haosulab ManiSkill Motion Planning Deps