Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Mlc ai Mlc llm Python Engine Inference
- Workflow:Zai org CogVideo Video Editing DDIM Inversion
- Workflow:Risingwavelabs Risingwave Iceberg Lakehouse Ingestion
- Workflow:Haifengl Smile Data Loading Pipeline
- Workflow:Explodinggradients Ragas Test Data Generation
- Workflow:Iterative Dvc Experiment Tracking
- Workflow:SeldonIO Seldon core AB Testing Experiment
- Workflow:Isaac sim IsaacGymEnvs Factory Assembly Training
- Workflow:Haifengl Smile SQL Analytics Pipeline
- Workflow:ARISE Initiative Robosuite Teleoperation
Principles
- Principle:FMInference FlexLLMGen Model Replacement Policy
- Principle:CARLA simulator Carla A Star Route Planning
- Principle:Volcengine Verl Data Preparation For RL
- Principle:OWASP Www project top 10 for large language model applications Translation Publication
- Principle:Apache Shardingsphere YAML Deserialization
- Principle:Fede1024 Rust rdkafka Manual Offset Management
- Principle:LaurentMazare Tch rs FFI Collection Traits
- Principle:Treeverse LakeFS Java SDK API Operations
- Principle:Huggingface Datasets DatasetDict Hub Upload
- Principle:Duckdb Duckdb Benchmark Discovery
Implementations
- Implementation:Infiniflow Ragflow FilesTable Component
- Implementation:Huggingface Alignment handbook Get Model Quantized
- Implementation:Online ml River NaiveBayes GaussianNB
- Implementation:Predibase Lorax Galactica
- Implementation:Facebookresearch Habitat lab Datasets download rearrangement
- Implementation:Online ml River Sketch HeavyHitters
- Implementation:Kserve Kserve Revision Status Propagation
- Implementation:Ggml org Llama cpp Android AI Chat JNI
- Implementation:Lance format Lance Java SQBuildParams
- Implementation:Avhz RustQuant HoLee
Heuristics
- Heuristic:DevExpress Testcafe Video Encoding Defaults
- Heuristic:Datajuicer Data juicer Operator Fusion Rules
- Heuristic:Romsto Speculative Decoding Shared Tokenizer Requirement
- Heuristic:Obss Sahi Match Threshold Tuning
- Heuristic:Scikit learn Scikit learn Feature Scaling Numerical Stability
- Heuristic:LMCache LMCache Health Monitor Thresholds
- Heuristic:Huggingface Alignment handbook DPO Beta Selection
- Heuristic:Mlc ai Mlc llm BLAS Dispatch Decision
- Heuristic:Mbzuai oryx Awesome LLM Post training Reference Citation Cap 200
- Heuristic:Run llama Llama index Chunk Size Optimization
Environments
- Environment:Bentoml BentoML NVIDIA GPU Resource
- Environment:Datahub project Datahub Java 17 Backend Environment
- Environment:Diagram of thought Diagram of thought Python Graph Libraries
- Environment:Alibaba MNN HuggingFace Ecosystem Environment
- Environment:PacktPublishing LLM Engineers Handbook Selenium Chrome Crawler Environment
- Environment:FlagOpen FlagEmbedding Finetuning Environment
- Environment:Alibaba MNN GPU CUDA Environment
- Environment:Google research Deduplicate text datasets Rust Cargo Build Environment
- Environment:BerriAI Litellm Observability Stack
- Environment:Openai Whisper Triton