Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Ucbepic Docetl YAML Pipeline Execution
- Workflow:ArroyoSystems Arroyo Connection Setup
- Workflow:Nightwatchjs Nightwatch Custom Commands And Assertions
- Workflow:Trailofbits Fickling Pickle Decompilation and Tracing
- Workflow:Online ml River Streaming Anomaly Detection
- Workflow:Deepseek ai Janus Multimodal Understanding
- Workflow:Anthropics Anthropic sdk python Streaming Message Interaction
- Workflow:Ggml org Llama cpp Model Quantization
- Workflow:Risingwavelabs Risingwave Iceberg Lakehouse Ingestion
- Workflow:Apache Hudi Flink Table Clustering
Principles
- Principle:Farama Foundation Gymnasium Reproducible Seeding
- Principle:Bentoml BentoML Model Loading From Store
- Principle:Langgenius Dify StreamingDataProcessing
- Principle:Ollama Ollama GGUF Model Conversion Llama Adapter
- Principle:Scikit learn contrib Imbalanced learn Borderline Oversampling
- Principle:AUTOMATIC1111 Stable diffusion webui Sampling Architecture
- Principle:Sail sg LongSpec Token Acceptance
- Principle:Alibaba MNN Input Preprocessing
- Principle:Tencent Ncnn Face Alignment And Recognition
- Principle:Lucidrains X transformers Hybrid Discrete Continuous Tokens
Implementations
- Implementation:Open compass VLMEvalKit ConcatVideoDataset
- Implementation:Treeverse LakeFS ImportStatus
- Implementation:NVIDIA TransformerEngine PyTorch Ext GEMM
- Implementation:Predibase Lorax GPTQ Custom Autotune
- Implementation:Pyro ppl Pyro RSA SemanticParsing
- Implementation:Huggingface Open r1 Generate Completion
- Implementation:InternLM Lmdeploy Core Sync
- Implementation:Ucbepic Docetl OptimizationDialog
- Implementation:Deepset ai Haystack DocumentMRREvaluator
- Implementation:AnswerDotAI RAGatouille RAGPretrainedModel Rerank
Heuristics
- Heuristic:Iamhankai Forest of Thought Self Correction Confidence Threshold
- Heuristic:Bitsandbytes foundation Bitsandbytes Outlier Threshold Detection
- Heuristic:FlagOpen FlagEmbedding Temperature Scaling Tip
- Heuristic:Eric mitchell Direct preference optimization RMSprop Over Adam
- Heuristic:Cleanlab Cleanlab Confident Threshold Heuristic
- Heuristic:Interpretml Interpret Categorical Float Conversion Gotcha
- Heuristic:Pytorch Serve Ampere Tensor Core Optimization
- Heuristic:Fastai Fastbook Batch Size Selection
- Heuristic:Huggingface Optimum GPTQ Quantization Defaults
- Heuristic:Lance format Lance Encoding Compression Thresholds
Environments
- Environment:Deepspeedai DeepSpeed CUDA GPU Environment
- Environment:Google deepmind Mujoco Python Bindings Environment
- Environment:TobikoData Sqlmesh GitHub CICD Runner
- Environment:Apache Spark Python Environment
- Environment:Iamhankai Forest of Thought Python CUDA Runtime
- Environment:PacktPublishing LLM Engineers Handbook Unsloth Finetuning Environment
- Environment:Vibrantlabsai Ragas Python 3 9 Core Environment
- Environment:Vibrantlabsai Ragas LLM Provider Credentials
- Environment:Datajuicer Data juicer LLM API Credentials Environment
- Environment:Huggingface Alignment handbook Evaluation Tools