Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:DataTalksClub Data engineering zoomcamp Kafka Stream Processing
- Workflow:Puppeteer Puppeteer PDF Generation
- Workflow:Risingwavelabs Risingwave Sink Connector Pipeline
- Workflow:Huggingface Datasets Dataset Preprocessing
- Workflow:Microsoft Onnxruntime Train Convert Predict
- Workflow:LLMBook zh LLMBook zh github io Supervised Finetuning
- Workflow:Lance format Lance Table Optimization
- Workflow:Bitsandbytes foundation Bitsandbytes FSDP QLoRA Distributed Training
- Workflow:Microsoft Onnxruntime On Device Training
- Workflow:Infiniflow Ragflow Agent Workflow Building
Principles
- Principle:Openai Openai agents python RunState Serialization
- Principle:EvolvingLMMs Lab Lmms eval Dependency Management
- Principle:Ggml org Llama cpp JinjaTemplateEngine
- Principle:Vibrantlabsai Ragas LLM Configuration
- Principle:Neuml Txtai Late Interaction Retrieval
- Principle:MaterializeInc Materialize Release Prerequisite Verification
- Principle:Infiniflow Ragflow Search Result Display
- Principle:Ollama Ollama ImageGeneration
- Principle:Arize ai Phoenix Prompt Versioning
- Principle:Huggingface Datasets Disk Persistence
Implementations
- Implementation:Langfuse Langfuse Seeder Orchestrator
- Implementation:Hpcaitech ColossalAI Zero Bubble GRPOConsumer
- Implementation:NVIDIA TransformerEngine PyTorch Quantizer Cpp
- Implementation:Arize ai Phoenix Prompts Create
- Implementation:SeleniumHQ Selenium Colors
- Implementation:DistrictDataLabs Yellowbrick CVScores Visualizer
- Implementation:FlowiseAI Flowise CreateDocumentStore
- Implementation:Guardrails ai Guardrails Schema Validator
- Implementation:CARLA simulator Carla LaneSection
- Implementation:Sktime Pytorch forecasting QuantileLoss
Heuristics
- Heuristic:Trailofbits Fickling Race Condition Prevention
- Heuristic:Mbzuai oryx Awesome LLM Post training Paper Deduplication Via Dict
- Heuristic:Shiyu coder Kronos Learning Rate And Optimizer Tuning
- Heuristic:Openai Openai python Fine Tuning Data Preparation Tips
- Heuristic:Predibase Lorax Quantization Backend Selection
- Heuristic:LaurentMazare Tch rs Safetensors Format Preference
- Heuristic:Groq Groq python Streaming Usage Stats
- Heuristic:VainF Torch Pruning AutoGrad Dependency Graph
- Heuristic:Openclaw Openclaw Cache TTL Asymmetric Strategy
- Heuristic:MarketSquare Robotframework browser GRPC Response Chunking
Environments
- Environment:Togethercomputer Together python API Credentials
- Environment:Lance format Lance Python Environment
- Environment:FlagOpen FlagEmbedding Python PyTorch Environment
- Environment:Intel Ipex llm CPU Finetuning Environment
- Environment:Volcengine Verl CUDA GPU Environment
- Environment:PacktPublishing LLM Engineers Handbook Python 3 11 Poetry Environment
- Environment:Dotnet Machinelearning ONNX Runtime Environment
- Environment:Testtimescaling Testtimescaling github io Semantic Scholar API
- Environment:DataExpert io Data engineer handbook Python Development Environment
- Environment:Sgl project Sglang CUDA SM100