Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Gretelai Gretel synthetics LSTM Text Generation
- Workflow:Risingwavelabs Risingwave Docker Deployment
- Workflow:Duckdb Duckdb Code Generation Pipeline
- Workflow:Groq Groq python Batch Processing
- Workflow:CrewAIInc CrewAI Custom Tool Integration
- Workflow:Neuml Txtai Agent Execution
- Workflow:Cohere ai Cohere python AWS Bedrock Deployment
- Workflow:Testtimescaling Testtimescaling github io GitHub Pages Course Progression
- Workflow:OpenRLHF OpenRLHF SFT Training
- Workflow:Mlflow Mlflow Experiment Tracking
Principles
- Principle:Google deepmind Dm control Locomotion Visualization
- Principle:Ggml org Llama cpp Model Distribution
- Principle:Cleanlab Cleanlab Multilabel Quality Scoring
- Principle:Apache Flink Locality Aware Split Assignment
- Principle:Huggingface Transformers Data Loading
- Principle:Microsoft Semantic kernel Native Plugin Definition
- Principle:Apache Flink Source Chain Composition
- Principle:Apache Airflow Hook Connection Implementation
- Principle:Apache Spark Cluster Installation
- Principle:Langgenius Dify UIPresentation
Implementations
- Implementation:Scikit learn contrib Imbalanced learn geometric mean score
- Implementation:Ollama Ollama MLXRunner Slice
- Implementation:InternLM Lmdeploy QuantizationKernels
- Implementation:Ucbepic Docetl Dataset Debate Gleaning
- Implementation:Lance format Lance Java DeleteOp
- Implementation:Risingwavelabs Risingwave Streaming API Functions
- Implementation:DataTalksClub Data engineering zoomcamp Kafka Docker Compose Setup
- Implementation:Hpcaitech ColossalAI MTBenchDataset
- Implementation:Isaac sim IsaacGymEnvs DR YAML Configuration
- Implementation:ThreeSR Awesome Inference Time Scaling Search Papers Function
Heuristics
- Heuristic:Unstructured IO Unstructured Strategy Fallback Chain
- Heuristic:Deepset ai Haystack BM25 Score Scaling
- Heuristic:Iterative Dvc YAML Dual Parser Strategy
- Heuristic:ArroyoSystems Arroyo Stateful Operator TTL
- Heuristic:Princeton nlp SimPO Multi Seed Diversity
- Heuristic:Openai Openai python Streaming Resource Management
- Heuristic:Sail sg LongSpec NCCL Distributed Settings
- Heuristic:Junyanz Pytorch CycleGAN and pix2pix Identity Loss Color Preservation
- Heuristic:Sail sg LongSpec Tree Shape Configuration
- Heuristic:Marker Inc Korea AutoRAG Hybrid Retrieval Score Normalization
Environments
- Environment:FlowiseAI Flowise Docker Environment
- Environment:Pola rs Polars GPU Execution Environment
- Environment:MarketSquare Robotframework browser Docker Container
- Environment:Unstructured IO Unstructured OpenAI API
- Environment:DataTalksClub Data engineering zoomcamp Kafka Confluent Environment
- Environment:Nightwatchjs Nightwatch Android Mobile Testing
- Environment:Mlflow Mlflow OpenAI LLM Integration Environment
- Environment:Risingwavelabs Risingwave Python Tooling Environment
- Environment:Huggingface Diffusers Quantization Environment
- Environment:Zai org CogVideo Diffusers Inference Environment