Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:LLMBook zh LLMBook zh github io LLM Pretraining
- Workflow:Testtimescaling Testtimescaling github io Automated Citation Tracking
- Workflow:HKUDS AI Trader Agent Decision Loop
- Workflow:Kserve Kserve InferenceGraph Pipeline
- Workflow:Tensorflow Serving REST API Inference
- Workflow:Promptfoo Promptfoo Project Initialization
- Workflow:Microsoft Autogen Graph Based Agent Orchestration
- Workflow:DataTalksClub Data engineering zoomcamp dbt Analytics Transformation
- Workflow:PacktPublishing LLM Engineers Handbook Model Evaluation
- Workflow:Diagram of thought Diagram of thought DoT Prompt Customization
Principles
- Principle:Datajuicer Data juicer Partition Size Optimization
- Principle:NVIDIA DALI Pipeline Definition
- Principle:ContextualAI HALOs Metrics Summarization
- Principle:FlowiseAI Flowise Vector Store Upsert
- Principle:Lucidrains X transformers Masked Prediction Data Preparation
- Principle:DistrictDataLabs Yellowbrick Drawing Primitives
- Principle:Ollama Ollama Inference Dispatch
- Principle:Ggml org Llama cpp Quantization
- Principle:Togethercomputer Together python Image Response Processing
- Principle:CARLA simulator Carla Obstacle Detection
Implementations
- Implementation:Deepspeedai DeepSpeed OpBuilder
- Implementation:Guardrails ai Guardrails Faiss
- Implementation:Apache Paimon KeyValueDataWriter
- Implementation:DataExpert io Data engineer handbook Pytest Spark Fixture
- Implementation:Ggml org Llama cpp Memory Hybrid ISWA
- Implementation:DistrictDataLabs Yellowbrick Color Palettes
- Implementation:Speechbrain Speechbrain Hparams KsponSpeech Conformer
- Implementation:Tensorflow Tfjs Training Utils
- Implementation:Scikit learn Scikit learn SetOutput
- Implementation:Risingwavelabs Risingwave CdcSourceChannel
Heuristics
- Heuristic:Trailofbits Fickling Injection Mode Selection
- Heuristic:Allenai Open instruct Logprob Clamping
- Heuristic:Openai CLIP Linear Probe Regularization C
- Heuristic:Openai Openai node RunTools Loop Limit
- Heuristic:Snorkel team Snorkel Binary Only Slicing
- Heuristic:Junyanz Pytorch CycleGAN and pix2pix CuDNN Benchmark Scale Width
- Heuristic:NVIDIA TransformerEngine Attention Backend Selection
- Heuristic:Mlc ai Web llm Penalty Parameter Defaults
- Heuristic:Confident ai Deepeval Timeout and Retry Tuning
- Heuristic:Lucidrains X transformers Numerical Stability Techniques
Environments
- Environment:Run llama Llama index Fsspec Remote Storage
- Environment:Mlflow Mlflow OpenAI LLM Integration Environment
- Environment:Helicone Helicone Wrangler CLI
- Environment:NVIDIA TransformerEngine Python PyTorch Requirements
- Environment:Marker Inc Korea AutoRAG GPU PyTorch Environment
- Environment:Mbzuai oryx Awesome LLM Post training Python Pandas
- Environment:Apache Flink Python PyFlink Environment
- Environment:Treeverse LakeFS Spark GC Environment
- Environment:Open compass VLMEvalKit API Keys And Credentials
- Environment:Huggingface Transformers PyTorch 24 CUDA