Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Bitsandbytes foundation Bitsandbytes 8bit Optimizer Training
- Workflow:Vespa engine Vespa Logging framework initialization
- Workflow:Datajuicer Data juicer Dataset Quality Analysis
- Workflow:Huggingface Datatrove FineWeb Dataset Creation
- Workflow:Facebookresearch Habitat lab HITL Interactive Evaluation
- Workflow:Huggingface Trl Supervised Finetuning
- Workflow:Run llama Llama index Data Ingestion Pipeline
- Workflow:Infiniflow Ragflow Search Application Setup
- Workflow:Rapidsai Cuml Dimensionality Reduction
- Workflow:Cleanlab Cleanlab Object Detection Label Quality
Principles
- Principle:OpenBMB UltraFeedback Score Validation and Correction
- Principle:Mistralai Client python File Upload
- Principle:Tensorflow Serving Resource Arithmetic
- Principle:Facebookresearch Audiocraft Audio Generation Evaluation
- Principle:Langchain ai Langchain Rate Limiting
- Principle:Puppeteer Puppeteer CDP Session Management
- Principle:Groq Groq python Chat Response Parsing
- Principle:NVIDIA NeMo Aligner SFT Data Preparation
- Principle:Allenai Open instruct Preference Data Processing
- Principle:Triton inference server Server Tracing Testing
Implementations
- Implementation:Mlfoundations Open flamingo Evaluate captioning
- Implementation:NVIDIA NeMo Curator RayActorPoolAdapter
- Implementation:Tensorflow Serving GKE Cluster Setup
- Implementation:Langgenius Dify PluginCredentialHooks
- Implementation:Hiyouga LLaMA Factory MCA Workflow
- Implementation:TA Lib Ta lib python CDL Signal Interpretation
- Implementation:Microsoft Playwright TraceV6
- Implementation:Predibase Lorax GPTQ Utils Exllamav2
- Implementation:Tencent Ncnn RVM Example
- Implementation:FMInference FlexLLMGen Policy
Heuristics
- Heuristic:Bitsandbytes foundation Bitsandbytes Blocksize Platform Defaults
- Heuristic:Avdvg InjectGuard Module Level Initialization
- Heuristic:Princeton nlp Tree of thought llm Duplicate Candidate Zeroing
- Heuristic:Volcengine Verl Sequence Length Balancing
- Heuristic:Snorkel team Snorkel Precision Init Prior
- Heuristic:Datajuicer Data juicer Checkpoint Resumption Strategy
- Heuristic:Apache Flink False Positive Availability Optimization
- Heuristic:Romsto Speculative Decoding Shared Tokenizer Requirement
- Heuristic:Tencent Ncnn Letterbox Vs Direct Resize
- Heuristic:Liu00222 Open Prompt Injection PPL Threshold Tuning
Environments
- Environment:ClickHouse ClickHouse Systemd Runtime
- Environment:Roboflow Rf detr ONNX Export Environment
- Environment:NVIDIA NeMo Aligner TensorRT LLM Acceleration Environment
- Environment:Lucidrains X transformers PyTorch CUDA
- Environment:Isaac sim IsaacGymEnvs Python CUDA Runtime
- Environment:Huggingface Datatrove S3 Storage Environment
- Environment:Run llama Llama index Fsspec Remote Storage
- Environment:Openai Whisper Triton
- Environment:ClickHouse ClickHouse CI Docker Environment
- Environment:NVIDIA DALI TensorFlow Environment