Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Heibaiying BigData Notes Hive Data Warehouse Operations
- Workflow:Openclaw Openclaw Initial Setup And Onboarding
- Workflow:OpenRLHF OpenRLHF Reward Model Training
- Workflow:DataExpert io Data engineer handbook PySpark Iceberg Job Execution
- Workflow:Langchain ai Langgraph Building a Stateful Graph
- Workflow:Langfuse Langfuse Evaluation pipeline
- Workflow:Apache Spark Release Process
- Workflow:PacktPublishing LLM Engineers Handbook RAG Inference
- Workflow:Microsoft LoRA NLU GLUE Finetuning
- Workflow:Treeverse LakeFS External Data Import
Principles
- Principle:Lm sys FastChat Causal LM Loading
- Principle:Apache Kafka Connect Distributed Invocation
- Principle:Unstructured IO Unstructured Basic Chunking
- Principle:Bitsandbytes foundation Bitsandbytes FP8 Linear Layer
- Principle:Ray project Ray CI Pipeline Configuration
- Principle:Avdvg InjectGuard Vector Store Construction
- Principle:Webdriverio Webdriverio Session Management
- Principle:Neuml Txtai API Security
- Principle:Mlfoundations Open flamingo Vision Conditioned Text Generation
- Principle:Huggingface Datasets Deprecation Management
Implementations
- Implementation:Ggml org Llama cpp Convert Llama2c To GGML
- Implementation:FlagOpen FlagEmbedding RetroMAE Modeling
- Implementation:Open compass VLMEvalKit Aria
- Implementation:Microsoft Onnxruntime CPU FakeQuant
- Implementation:CrewAIInc CrewAI ContextualAI Create Agent Tool
- Implementation:Tencent Ncnn YOLO11 Seg Example
- Implementation:Explodinggradients Ragas BleuScore Metric
- Implementation:NVIDIA TransformerEngine CPU Offload
- Implementation:Apache Flink Pool
- Implementation:Microsoft Onnxruntime CrossEntropy Declarations
Heuristics
- Heuristic:Openai Whisper Log Probability Threshold
- Heuristic:Huggingface Open r1 Test Batch Early Termination
- Heuristic:NVIDIA TransformerEngine Sequence Length Alignment
- Heuristic:NVIDIA DALI NVJPEG Memory Preallocation
- Heuristic:Liu00222 Open Prompt Injection PPL Threshold Tuning
- Heuristic:PrefectHQ Prefect Concurrency Limit Scoping
- Heuristic:Shiyu coder Kronos Two Stage Finetuning Strategy
- Heuristic:Promptfoo Promptfoo WAL Mode Network Filesystem
- Heuristic:Haotian liu LLaVA Gradient Checkpointing Memory Optimization
- Heuristic:Onnx Onnx External Data Path Security
Environments
- Environment:NVIDIA DALI TensorFlow Environment
- Environment:Datahub project Datahub Python 3 10 Ingestion Environment
- Environment:Eventual Inc Daft Ray Distributed Runner
- Environment:Hpcaitech ColossalAI ColossalChat Training Environment
- Environment:Kserve Kserve Istio Service Mesh
- Environment:Cohere ai Cohere python Python SDK Runtime
- Environment:Mlflow Mlflow Python Runtime Environment
- Environment:TobikoData Sqlmesh GitHub CICD Runner
- Environment:PrefectHQ Prefect Python Runtime Environment
- Environment:Guardrails ai Guardrails Python 3 10 Runtime