Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Junyanz Pytorch CycleGAN and pix2pix Pix2pix Training
- Workflow:ARISE Initiative Robosuite Environment Setup And Simulation
- Workflow:Apache Shardingsphere Shadow Rule Configuration
- Workflow:Open compass VLMEvalKit Image Benchmark Evaluation
- Workflow:Duckdb Duckdb Source Amalgamation And Packaging
- Workflow:Openai Openai node Function Calling
- Workflow:Datahub project Datahub Docker Quickstart Deployment
- Workflow:Ggml org Llama cpp Model Quantization
- Workflow:Heibaiying BigData Notes HBase Java CRUD Operations
- Workflow:Cohere ai Cohere python Streaming Chat
Principles
- Principle:ArroyoSystems Arroyo Local Cluster Initialization
- Principle:AUTOMATIC1111 Stable diffusion webui Training dataset preparation
- Principle:Anthropics Anthropic sdk python Parse Request Execution
- Principle:Microsoft Playwright Select Browser and Configure Context
- Principle:AnswerDotAI RAGatouille Training Data Preparation
- Principle:Eventual Inc Daft Data Ingestion CSV
- Principle:Microsoft Autogen Agent Specialization
- Principle:Scikit learn Scikit learn Metric Evaluation
- Principle:Pytorch Serve Streaming Inference
- Principle:Pola rs Polars Time Range Filtering
Implementations
- Implementation:Hiyouga LLaMA Factory Sequence Packing
- Implementation:Langchain ai Langchain ToolsIntegrationTests
- Implementation:Onnx Onnx Shape Inference Interfaces
- Implementation:Lucidrains X transformers TextSamplerDataset Pattern
- Implementation:Elevenlabs Elevenlabs python TtsConversationalConfigInput
- Implementation:Interpretml Interpret DecisionListClassifier
- Implementation:Microsoft Onnxruntime CUDA PadAndUnflatten
- Implementation:Elevenlabs Elevenlabs python SpeechToTextWordResponseModel
- Implementation:Intel Ipex llm LangChain RAG Chain
- Implementation:EvolvingLMMs Lab Lmms eval WavCaps Utils
Heuristics
- Heuristic:Vllm project Vllm KV Cache Block Size Selection
- Heuristic:AnswerDotAI RAGatouille FAISS Vs PyTorch KMeans Indexing
- Heuristic:Spcl Graph of thoughts GoT Decompose Sort Merge Strategy
- Heuristic:Microsoft Autogen Graph Validation Rules
- Heuristic:LLMBook zh LLMBook zh github io DPO Beta Hyperparameter
- Heuristic:Unslothai Unsloth Padding Free Packing
- Heuristic:Datahub project Datahub Gradle Task Only
- Heuristic:Mbzuai oryx Awesome LLM Post training Reference Citation Cap 200
- Heuristic:Speechbrain Speechbrain Gradient Clipping Strategy
- Heuristic:SeleniumHQ Selenium FindElements For Absence Check
Environments
- Environment:Allenai Open instruct vLLM Inference
- Environment:Microsoft DeepSpeedExamples RLHF Training Environment
- Environment:Datajuicer Data juicer Ray Cluster Environment
- Environment:Nautechsystems Nautilus trader Databento API Credentials
- Environment:Microsoft BIPIA Python CUDA GPU Environment
- Environment:Pyro ppl Pyro Distributed Training
- Environment:Apache Shardingsphere Calcite Federation Engine
- Environment:Farama Foundation Gymnasium Video Recording Dependencies
- Environment:EvolvingLMMs Lab Lmms eval Python Runtime Environment
- Environment:MarketSquare Robotframework browser Node Runtime