Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Cleanlab Cleanlab Object Detection Label Quality
- Workflow:Neuml Txtai API Deployment
- Workflow:Bentoml BentoML BentoCloud Deployment
- Workflow:Open compass VLMEvalKit API Model Evaluation
- Workflow:Trailofbits Fickling Safe ML Model Loading
- Workflow:Apache Airflow DAG Authoring and Deployment
- Workflow:Risingwavelabs Risingwave Iceberg Lakehouse Ingestion
- Workflow:Recommenders team Recommenders ALS Spark Recommendation
- Workflow:DataTalksClub Data engineering zoomcamp dlt Data Ingestion
- Workflow:Neuml Txtai Semantic Search
Principles
- Principle:BerriAI Litellm Logging Payload Construction
- Principle:Mbzuai oryx Awesome LLM Post training Trend Visualization
- Principle:Openai Openai node Training Data Upload
- Principle:Heibaiying BigData Notes HBase Connection Configuration
- Principle:Togethercomputer Together python Batch Job Monitoring
- Principle:Apache Kafka Maven Artifact Publishing
- Principle:FMInference FlexLLMGen Checkpoint Format Abstraction
- Principle:Hiyouga LLaMA Factory Sequence Packing Theory
- Principle:Lucidrains X transformers Iterative Masked Generation
- Principle:Huggingface Datasets Abstract Dataset IO
Implementations
- Implementation:Ucbepic Docetl Count Tokens
- Implementation:Scikit learn Scikit learn Make Pipeline
- Implementation:Apache Beam DirectPipelineResult
- Implementation:Ggml org Ggml Cann backend
- Implementation:Mlc ai Mlc llm Router Translate request
- Implementation:Deepset ai Haystack Logging Configuration
- Implementation:OpenGVLab InternVL InternLM2Tokenizer
- Implementation:Facebookresearch Habitat lab Core Utils
- Implementation:Deepspeedai DeepSpeed HybridEngine Generate
- Implementation:Pyro ppl Pyro SV DKL
Heuristics
- Heuristic:Spcl Graph of thoughts GoT Decompose Sort Merge Strategy
- Heuristic:Infiniflow Ragflow Hybrid Search Fallback Strategy
- Heuristic:Openclaw Openclaw Warning Suppression For Known Deprecations
- Heuristic:Sail sg LongSpec NCCL Distributed Settings
- Heuristic:Intel Ipex llm LoRA Target All Linear Layers
- Heuristic:Bentoml BentoML Worker Count Strategy
- Heuristic:NVIDIA NeMo Curator Deduplication Blocksize Tuning
- Heuristic:OWASP Www project top 10 for large language model applications Vulnerability Entry Structure Guide
- Heuristic:TobikoData Sqlmesh Model Change Categorization
- Heuristic:Mit han lab Llm awq Skip QK Projection Clipping
Environments
- Environment:Datahub project Datahub Python 3 10 Ingestion Environment
- Environment:Datajuicer Data juicer LLM API Credentials Environment
- Environment:Bitsandbytes foundation Bitsandbytes ROCm AMD Environment
- Environment:Sgl project Sglang GitHub Actions
- Environment:CrewAIInc CrewAI Python Runtime Environment
- Environment:Sgl project Sglang CUDA Runtime
- Environment:Allenai Open instruct Python 3 12 Runtime
- Environment:PacktPublishing LLM Engineers Handbook Docker MongoDB Qdrant Infrastructure
- Environment:Openai Whisper Numba
- Environment:Vespa engine Vespa Java 17 Build Runtime