Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Ggml org Llama cpp HF to GGUF Model Conversion
- Workflow:Lucidrains X transformers DPO Preference Alignment
- Workflow:Openai Openai python Responses API Text Generation
- Workflow:Lm sys FastChat LoRA QLoRA Finetuning
- Workflow:Huggingface Alignment handbook Multi Stage Post Training
- Workflow:Vespa engine Vespa Config subscription lifecycle
- Workflow:SeldonIO Seldon core Model Explainability
- Workflow:Cleanlab Cleanlab Object Detection Label Quality
- Workflow:PacktPublishing LLM Engineers Handbook Digital Data ETL
- Workflow:Openai Openai node Streaming To Client
Principles
- Principle:Ggml org Llama cpp GGUFFormat
- Principle:EvolvingLMMs Lab Lmms eval Results Retrieval
- Principle:ARISE Initiative Robosuite IK Solver Utilities
- Principle:Online ml River Online Optimizers
- Principle:Onnx Onnx Output Dimension Expansion
- Principle:Microsoft Semantic kernel RAG Chat Augmentation
- Principle:MarketSquare Robotframework browser Test Cleanup and Failure Handling
- Principle:Microsoft Agent framework Chat Client Configuration
- Principle:Hpcaitech ColossalAI Document Loading
- Principle:DistrictDataLabs Yellowbrick Parallel Coordinates Visualization
Implementations
- Implementation:Datahub project Datahub Proto2DataHub
- Implementation:Speechbrain Speechbrain Hparams Switchboard Seq2Seq
- Implementation:Microsoft Autogen Studio MCP Create Modal
- Implementation:SeleniumHQ Selenium DevTools AddListener
- Implementation:Openai Openai python Image Create Variation Params
- Implementation:Open compass VLMEvalKit Spotting Metric
- Implementation:BerriAI Litellm Get Model Cost Map
- Implementation:Junyanz Pytorch CycleGAN and pix2pix Visualizer
- Implementation:Openai Openai python Response Output Text Annotation Added
- Implementation:Microsoft Onnxruntime Npm Install Onnxruntime
Heuristics
- Heuristic:Onnx Onnx External Data Path Security
- Heuristic:LLMBook zh LLMBook zh github io BF16 Mixed Precision Default
- Heuristic:Googleapis Python genai LRO Polling Backoff
- Heuristic:SeleniumHQ Selenium Warning Deprecated Proxy FTP Methods
- Heuristic:LaurentMazare Tch rs Hidden Dimension Alignment
- Heuristic:LLMBook zh LLMBook zh github io Deduplication Ngram Threshold
- Heuristic:Apache Kafka Unknown Record Type Upgrade Safety
- Heuristic:Apache Shardingsphere DDL Refresher Superclass Fallback
- Heuristic:Unslothai Unsloth Padding Free Packing
- Heuristic:Apache Druid Cluster Health Diagnostic Thresholds
Environments
- Environment:Vespa engine Vespa Java 17 Build Runtime
- Environment:CARLA simulator Carla Simulation Runtime
- Environment:Speechbrain Speechbrain Speech Enhancement Dependencies
- Environment:Datahub project Datahub Frontend Build
- Environment:ArroyoSystems Arroyo PostgreSQL Database
- Environment:Cleanlab Cleanlab Datalab Dependencies
- Environment:FlagOpen FlagEmbedding Python PyTorch Environment
- Environment:Gretelai Gretel synthetics TensorFlow GPU Environment
- Environment:Huggingface Datatrove Python Runtime
- Environment:Google research Deduplicate text datasets Python HuggingFace Environment