Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:SeldonIO Seldon core Inference Pipeline
- Workflow:Huggingface Datatrove FineWeb Dataset Creation
- Workflow:Neuml Txtai Semantic Search Pipeline
- Workflow:Microsoft Agent framework Graph Based Workflow Execution
- Workflow:Sail sg LongSpec Long Context Evaluation
- Workflow:Ggml org Llama cpp Embedding Extraction
- Workflow:Protectai Llm guard LLM Input Output Scanning
- Workflow:TA Lib Ta lib python Basic Indicator Computation
- Workflow:Gretelai Gretel synthetics DGAN Timeseries Generation
- Workflow:PacktPublishing LLM Engineers Handbook Model Evaluation
Principles
- Principle:PrefectHQ Prefect HTML Parsing
- Principle:Astronomer Astronomer cosmos Dbt Invocation
- Principle:DataExpert io Data engineer handbook Event Tracking
- Principle:Nautechsystems Nautilus trader Order Creation
- Principle:Apache Paimon Multi Batch Data Writing
- Principle:CARLA simulator Carla Simulator Packaging
- Principle:ArroyoSystems Arroyo Graceful Shutdown
- Principle:Triton inference server Server Stress Testing
- Principle:ARISE Initiative Robosuite Object Grouping
- Principle:Duckdb Duckdb Test Framework
Implementations
- Implementation:Mit han lab Llm awq Wikitext eval loop
- Implementation:Interpretml Interpret TensorTotalsBuild
- Implementation:Ggml org Llama cpp Common Utils
- Implementation:Nightwatchjs Nightwatch Extension Invocation Pattern
- Implementation:CrewAIInc CrewAI Bedrock Code Interpreter Toolkit
- Implementation:Predibase Lorax Rotary Embedding
- Implementation:Run llama Llama index CohereRerankerFinetuneEngine
- Implementation:EvolvingLMMs Lab Lmms eval RefCOCO Plus Utils Rec
- Implementation:DevExpress Testcafe CreateTestCafe Factory
- Implementation:Risingwavelabs Risingwave JDBCSinkFactory
Heuristics
- Heuristic:ChenghaoMou Text dedup Mersenne Prime Backward Compatibility
- Heuristic:Langchain ai Langgraph Retry Policy Configuration
- Heuristic:Neuml Txtai MacOS Stability Workarounds
- Heuristic:Bigscience workshop Petals Randomized Rebalancing Intervals
- Heuristic:Axolotl ai cloud Axolotl Sample Packing Best Practices
- Heuristic:Datahub project Datahub Validation Across All APIs
- Heuristic:Snorkel team Snorkel Precision Init Prior
- Heuristic:Unstructured IO Unstructured Warning Deprecated Staging Base
- Heuristic:Princeton nlp SimPO Left Truncation Strategy
- Heuristic:Fastai Fastbook Dropout Regularization
Environments
- Environment:LLMBook zh LLMBook zh github io HuggingFace Transformers Stack
- Environment:Pola rs Polars Cloud Storage Environment
- Environment:NVIDIA NeMo Aligner TensorRT LLM Acceleration Environment
- Environment:Cleanlab Cleanlab Datalab Dependencies
- Environment:Sktime Pytorch forecasting Matplotlib Plotting Dependencies
- Environment:BerriAI Litellm Redis Cache Backend
- Environment:Unslothai Unsloth CUDA VLLM
- Environment:Tensorflow Tfjs Browser Runtime
- Environment:LLMBook zh LLMBook zh github io VLLM Inference Environment
- Environment:Explodinggradients Ragas Python Runtime Environment