Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Treeverse LakeFS Garbage Collection
- Workflow:Fastai Fastbook Tabular Modeling
- Workflow:Ggml org Llama cpp Speculative Decoding
- Workflow:Predibase Lorax Multi Adapter Merging
- Workflow:Apache Beam Twister2 Batch Execution
- Workflow:Iterative Dvc Plot Visualization
- Workflow:TA Lib Ta lib python Candlestick Pattern Recognition
- Workflow:Openai Evals Creating a model graded eval
- Workflow:Zai org CogVideo Diffusers LoRA Finetuning
- Workflow:Mlflow Mlflow Model Serving
Principles
- Principle:Apache Kafka Source Artifact Building
- Principle:PacktPublishing LLM Engineers Handbook Prompt Engineering For Dataset Generation
- Principle:Langgenius Dify Plugin Installation
- Principle:DataTalksClub Data engineering zoomcamp Dbt Project Configuration
- Principle:Huggingface Transformers LoRA Configuration
- Principle:Onnx Onnx External Data Loading
- Principle:OWASP Www project top 10 for large language model applications Style Guide Conformance
- Principle:Mlflow Mlflow Model Registration
- Principle:Apache Flink Asynchronous File Compaction
- Principle:DevExpress Testcafe Reporter Output Configuration
Implementations
- Implementation:Ggml org Llama cpp Get Evaluation Dataset
- Implementation:FlagOpen FlagEmbedding Search Demo Preprocess
- Implementation:Ucbepic Docetl ParetoFrontier Analysis
- Implementation:Elevenlabs Elevenlabs python UnitTestToolCallEvaluationModelInput
- Implementation:Apache Dolphinscheduler PhysicalTaskExecutor Lifecycle
- Implementation:DataTalksClub Data engineering zoomcamp Toml Credentials Loader
- Implementation:Open compass VLMEvalKit VGRPBench Aquarium
- Implementation:Heibaiying BigData Notes Hive View and Management Operations
- Implementation:Lance format Lance Java Index
- Implementation:Infiniflow Ragflow DataflowResultHooks
Heuristics
- Heuristic:Deepseek ai Janus Image Generation Prompt Tips
- Heuristic:LLMBook zh LLMBook zh github io Reward Model LM Regularization
- Heuristic:Arize ai Phoenix Adaptive Rate Limiting
- Heuristic:Treeverse LakeFS Retry Backoff Configuration
- Heuristic:Promptfoo Promptfoo Adaptive Concurrency Tuning
- Heuristic:Datahub project Datahub Secret Handling And Deprecation Patterns
- Heuristic:Open compass VLMEvalKit Video Frame Sampling Configuration
- Heuristic:Marker Inc Korea AutoRAG Passage Filter Safety Minimum
- Heuristic:SeldonIO Seldon core Tracing Latency Tip
- Heuristic:Fede1024 Rust rdkafka Regular Polling Required
Environments
- Environment:Duckdb Duckdb CMake Build Toolchain
- Environment:Predibase Lorax Docker Container Runtime
- Environment:Neuml Txtai GPU Accelerator Detection
- Environment:Rapidsai Cuml Dask Distributed
- Environment:Lm sys FastChat GPU CUDA Inference
- Environment:Eric mitchell Direct preference optimization Python Dependencies
- Environment:Google deepmind Dm control GLFW Desktop Rendering
- Environment:Apache Kafka Release Toolchain Environment
- Environment:ThreeSR Awesome Inference Time Scaling Semantic Scholar API Environment
- Environment:Openai Openai agents python LiteLLM Dependencies