Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Roboflow Rf detr ONNX Export
- Workflow:Allenai Open instruct DPO Preference Tuning
- Workflow:CrewAIInc CrewAI Sequential Crew Execution
- Workflow:Dotnet Machinelearning GenAI Causal LM Inference
- Workflow:Protectai Llm guard Scanner Benchmarking
- Workflow:Lakeraai Pint benchmark Custom System Evaluation
- Workflow:Speechbrain Speechbrain Speech Enhancement Training
- Workflow:Apache Airflow Provider Distribution Development
- Workflow:Heibaiying BigData Notes Spark SQL Data Analysis
- Workflow:Apache Kafka Release Candidate Staging
Principles
- Principle:Infiniflow Ragflow Infrastructure Orchestration
- Principle:Spotify Luigi Database Connection Configuration
- Principle:Alibaba MNN Diffusion ONNX Export
- Principle:Ray project Ray Deployment Class Definition
- Principle:Huggingface Trl SFT Argument Configuration
- Principle:Axolotl ai cloud Axolotl Preference Dataset Preparation
- Principle:Vespa engine Vespa Indexing Error Handling
- Principle:Haifengl Smile Model Serialization
- Principle:Microsoft Semantic kernel Plugin Registration
- Principle:Dotnet Machinelearning Feature Engineering
Implementations
- Implementation:Zai org CogVideo MagViT2 Tokenizer
- Implementation:Testtimescaling Testtimescaling github io Git Commit Push Workflow
- Implementation:DataTalksClub Data engineering zoomcamp Pandas Chunked CSV Loading
- Implementation:Apache Paimon BitmapDeletionVector
- Implementation:Zai org CogVideo CogVideoXPipeline Call
- Implementation:Mage ai Mage ai Google Ads Streams
- Implementation:Treeverse LakeFS Java SDK Model PrepareGCUncommittedResponse
- Implementation:Sdv dev SDV GaussianCopulaSynthesizer Init
- Implementation:Treeverse LakeFS Java SDK Model PresignMultipartUpload
- Implementation:Haosulab ManiSkill RoboCasaCounter
Heuristics
- Heuristic:Romsto Speculative Decoding Shared Tokenizer Requirement
- Heuristic:Tencent Ncnn Letterbox Vs Direct Resize
- Heuristic:Vibrantlabsai Ragas Analytics Silent Failure Pattern
- Heuristic:Scikit learn Scikit learn Working Memory Tuning
- Heuristic:Openclaw Openclaw Warning Suppression For Known Deprecations
- Heuristic:Deepseek ai Janus Image Generation Prompt Tips
- Heuristic:Ucbepic Docetl Token Counting And Truncation
- Heuristic:OWASP Www project top 10 for large language model applications Warning Deprecated Markdown To PDF Convert
- Heuristic:Onnx Onnx Big Endian Byte Order Handling
- Heuristic:Ggml org Llama cpp GPU Layer Offloading Verification
Environments
- Environment:Dagster io Dagster Container Resource Monitoring
- Environment:OpenHands OpenHands Third Party Runtime Credentials
- Environment:CrewAIInc CrewAI Optional Provider Dependencies
- Environment:Nautechsystems Nautilus trader Python Cython Rust Runtime
- Environment:Datahub project Datahub Frontend Build
- Environment:Explodinggradients Ragas Google Drive Backend Environment
- Environment:Mlc ai Mlc llm Python Serving Environment
- Environment:Mlc ai Web llm Node Build Toolchain
- Environment:Elevenlabs Elevenlabs python PyAudio
- Environment:FMInference FlexLLMGen HuggingFace Access