Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:CARLA simulator Carla Traffic Generation
- Workflow:Alibaba ROLL Agentic RL Training Pipeline
- Workflow:ContextualAI HALOs Model Evaluation
- Workflow:Langgenius Dify Plugin Installation and Configuration
- Workflow:Huggingface Datatrove FineWeb Dataset Creation
- Workflow:Groq Groq python Audio Transcription
- Workflow:Deepspeedai DeepSpeed Sequence Parallel Long Context Training
- Workflow:Fastai Fastbook Tabular Modeling
- Workflow:Speechbrain Speechbrain Speech Separation Training
- Workflow:Ollama Ollama Custom Model Creation
Principles
- Principle:Snorkel team Snorkel Labeling Function Definition
- Principle:Apache Kafka Maven Artifact Publishing
- Principle:Avdvg InjectGuard Vector Store Construction
- Principle:FlowiseAI Flowise Server Monitoring
- Principle:Eventual Inc Daft Data Preprocessing Image Decoding
- Principle:OpenHands OpenHands Sandbox Creation
- Principle:Guardrails ai Guardrails AsyncExecution
- Principle:Huggingface Datasets Pandas Conversion
- Principle:Trailofbits Fickling Polyglot File Creation
- Principle:Zai org CogVideo Learned Perceptual Similarity
Implementations
- Implementation:CrewAIInc CrewAI Scrape Element Tool
- Implementation:EvolvingLMMs Lab Lmms eval VideoMathQA Evaluation Utils
- Implementation:Huggingface Datasets Dataset To Tf Dataset
- Implementation:Run llama Llama index SQLTableNodeMapping
- Implementation:SeleniumHQ Selenium GeckoDriverService
- Implementation:DataTalksClub Data engineering zoomcamp Kestra PostgreSQL CopyIn
- Implementation:Treeverse LakeFS Java SDK Model IcebergLocalTable
- Implementation:Haifengl Smile Index Structure Selection
- Implementation:Bentoml BentoML Podman Backend
- Implementation:Mlc ai Mlc llm Batch Prefill Base
Heuristics
- Heuristic:Avhz RustQuant MC Parallel Path Threshold
- Heuristic:Princeton nlp SimPO Concatenated Forward Pass
- Heuristic:Microsoft LoRA LoRA Rank Selection
- Heuristic:ThreeSR Awesome Inference Time Scaling Date Parsing Fallback Tip
- Heuristic:Vibrantlabsai Ragas Temperature Sampling Strategy
- Heuristic:Onnx Onnx Protobuf 2GB Limit Workaround
- Heuristic:Hiyouga LLaMA Factory Mixed Precision Training Tips
- Heuristic:Eric mitchell Direct preference optimization FSDP Mixed Precision BFloat16
- Heuristic:Dotnet Machinelearning FastTree Default Hyperparameters
- Heuristic:Langgenius Dify Celery Queue Separation
Environments
- Environment:Helicone Helicone Cloudflare Workers Runtime
- Environment:Mlc ai Mlc llm TVM Runtime Environment
- Environment:Google deepmind Dm control OSMesa Software Rendering
- Environment:OpenRLHF OpenRLHF DeepSpeed Environment
- Environment:Google deepmind Mujoco MJX Warp CUDA Environment
- Environment:Eric mitchell Direct preference optimization Python Dependencies
- Environment:Dagster io Dagster Container Resource Monitoring
- Environment:Pyro ppl Pyro Funsor Backend
- Environment:Langgenius Dify Vector Database Environment
- Environment:Protectai Llm guard Python Runtime Dependencies