Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:MarketSquare Robotframework browser Library Development and Release
- Workflow:Volcengine Verl PPO Training With Reward Model
- Workflow:Microsoft DeepSpeedExamples RLHF Training Pipeline
- Workflow:Tensorflow Serving Batched Inference Pipeline
- Workflow:Recommenders team Recommenders Algorithm Benchmarking
- Workflow:Hiyouga LLaMA Factory DPO Preference Alignment
- Workflow:Interpretml Interpret Model Explanation And Visualization
- Workflow:Haotian liu LLaVA Two Stage Pretraining and Finetuning
- Workflow:Princeton nlp Tree of thought llm Baseline comparison
- Workflow:Openai CLIP Linear probe evaluation
Principles
- Principle:ChenghaoMou Text dedup Suffix Array Construction
- Principle:Mlflow Mlflow Code Quality Linting
- Principle:Microsoft Onnxruntime Trained Model Export
- Principle:DevExpress Testcafe Element Selection
- Principle:Confident ai Deepeval Component Instrumentation
- Principle:Deepset ai Haystack Document Writing
- Principle:Anthropics Anthropic sdk python Response Processing
- Principle:Heibaiying BigData Notes Hive Partitioning and Bucketing
- Principle:Neuml Txtai ONNX Export
- Principle:Huggingface Datasets Streaming Take
Implementations
- Implementation:Haosulab ManiSkill BC Diffusion Training
- Implementation:Nautechsystems Nautilus trader ParquetDataCatalog Query
- Implementation:Apache Hudi HoodieSplitReaderFunction Read
- Implementation:Cohere ai Cohere python Tool Model
- Implementation:Datahub project Datahub HdfsPlatform
- Implementation:Gretelai Gretel synthetics Tokenizer Training Pipeline
- Implementation:Open compass VLMEvalKit mPLUG Owl2
- Implementation:Microsoft DeepSpeedExamples Net Tutorial
- Implementation:CARLA simulator Carla RssCheck Interface
- Implementation:Confident ai Deepeval Synthesizer Generate Goldens From Contexts
Heuristics
- Heuristic:Openai Openai python Warning Deprecated Eval Stored Completions
- Heuristic:VainF Torch Pruning GQA Head Pruning Constraints
- Heuristic:Lm sys FastChat Flash Attention GPU Requirements
- Heuristic:Axolotl ai cloud Axolotl Sample Packing Best Practices
- Heuristic:Vibrantlabsai Ragas Warning Deprecated Legacy LLM Wrappers
- Heuristic:BerriAI Litellm Streaming Loop Detection
- Heuristic:NVIDIA NeMo Curator GPU Memory Resource Allocation
- Heuristic:TobikoData Sqlmesh Snapshot TTL Defaults
- Heuristic:LaurentMazare Tch rs Safetensors Format Preference
- Heuristic:Togethercomputer Together python Retry Backoff Strategy
Environments
- Environment:Spotify Luigi AWS S3 Storage
- Environment:Guardrails ai Guardrails Python 3 10 Runtime
- Environment:Togethercomputer Together python API Credentials
- Environment:Risingwavelabs Risingwave Java Connector Environment
- Environment:Apache Spark Release Build Environment
- Environment:Astronomer Astronomer cosmos Cloud Provider Dependencies
- Environment:Huggingface Diffusers Training Environment
- Environment:Openai Openai python Azure OpenAI
- Environment:Haifengl Smile Java 25 Runtime
- Environment:Intel Ipex llm Linux XPU Environment