Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Lance format Lance Vector Search Pipeline
- Workflow:Fede1024 Rust rdkafka Produce Consume Roundtrip
- Workflow:Langgenius Dify Knowledge Base Creation
- Workflow:PeterL1n BackgroundMattingV2 Realtime webcam matting
- Workflow:Bitsandbytes foundation Bitsandbytes FSDP QLoRA Distributed Training
- Workflow:Risingwavelabs Risingwave Docker Deployment
- Workflow:Predibase Lorax Structured JSON Output
- Workflow:Apache Paimon Table Read Write
- Workflow:MaterializeInc Materialize Upgrade Testing
- Workflow:Hpcaitech ColossalAI Model Evaluation
Principles
- Principle:Cleanlab Cleanlab Issue Retrieval
- Principle:Openclaw Openclaw Routing Verification
- Principle:Langfuse Langfuse OTel Ingestion Post Processing
- Principle:Confident ai Deepeval OpenAI Agents Instrumentation
- Principle:Recommenders team Recommenders ALS Recommendation Generation
- Principle:Neuml Txtai Prompt Engineering
- Principle:Open compass VLMEvalKit Environment Setup
- Principle:Shiyu coder Kronos Tokenizer Encoding
- Principle:Spotify Luigi MapReduce Processing
- Principle:AnswerDotAI RAGatouille Index Loading
Implementations
- Implementation:SeleniumHQ Selenium GeckoDriverService
- Implementation:Duckdb Duckdb Mbedtls PK
- Implementation:Onnx Onnx Save Model
- Implementation:Google research Deduplicate text datasets Finish Dedup Wiki40b
- Implementation:Apache Beam JobServicePipelineResult
- Implementation:Pyro ppl Pyro BlockMessenger
- Implementation:Apache Druid Compaction Dynamic Config Completions
- Implementation:Ucbepic Docetl MOAR SearchUtils
- Implementation:Guardrails ai Guardrails AsyncValidatorService
- Implementation:Microsoft LoRA Mark Only LoRA Trainable
Heuristics
- Heuristic:Google deepmind Mujoco MJX Benchmarking Tips
- Heuristic:Arize ai Phoenix Warning Deprecated HallucinationEvaluator
- Heuristic:Nightwatchjs Nightwatch Timeout And Retry Tuning
- Heuristic:Deepseek ai Janus CFG Weight Tuning
- Heuristic:Iterative Dvc Shell Execution Pitfalls
- Heuristic:Getgauge Taiko Implicit Wait Tuning
- Heuristic:Heibaiying BigData Notes HBase Connection Thread Safety Tip
- Heuristic:Pola rs Polars Collect All For Diverging Queries
- Heuristic:OpenGVLab InternVL Dynamic Resolution Tiling
- Heuristic:Huggingface Transformers Label Smoothing Multi Label Warning
Environments
- Environment:Nautechsystems Nautilus trader Asyncio Uvloop Event Loop
- Environment:Ggml org Ggml Vulkan GPU Environment
- Environment:Mlc ai Mlc llm TVM Runtime Environment
- Environment:Openai Openai node OpenAI API Credentials
- Environment:Haifengl Smile Native BLAS LAPACK ARPACK
- Environment:Apache Shardingsphere ZooKeeper Cluster Coordination
- Environment:Openai CLIP Python Dependencies
- Environment:Ggml org Ggml CUDA GPU Environment
- Environment:Huggingface Datasets SQL Dependencies
- Environment:DataTalksClub Data engineering zoomcamp Docker PostgreSQL Python Environment