Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:DistrictDataLabs Yellowbrick Feature Analysis and Selection
- Workflow:Heibaiying BigData Notes Flink Kafka Streaming Pipeline
- Workflow:Langchain ai Langgraph CLI Deployment
- Workflow:Haotian liu LLaVA Web Demo Deployment
- Workflow:Fede1024 Rust rdkafka Mock Cluster Testing
- Workflow:Google deepmind Mujoco Offscreen video recording
- Workflow:Duckdb Duckdb Source Amalgamation And Packaging
- Workflow:Mistralai Client python Chat Completion
- Workflow:Intel Ipex llm Pipeline Parallel Inference
- Workflow:Roboflow Rf detr ONNX Export
Principles
- Principle:Cohere ai Cohere python Tool Schema Definition
- Principle:CARLA simulator Carla Autopilot Mode
- Principle:NVIDIA NeMo Curator Semantic Deduplication for Video
- Principle:Getgauge Taiko Page Navigation
- Principle:Heibaiying BigData Notes Kafka Message Sending
- Principle:FlagOpen FlagEmbedding Matryoshka Reranking
- Principle:Huggingface Peft AdaLoRA Adaptive Rank
- Principle:Axolotl ai cloud Axolotl LoRA Adapter Injection
- Principle:SeldonIO Seldon core Tabular Data Query Filtering
- Principle:Tensorflow Tfjs Layer Wrapping
Implementations
- Implementation:Langchain ai Langchain AzureChatOpenAI
- Implementation:DistrictDataLabs Yellowbrick ClassificationReport Visualizer
- Implementation:Guardrails ai Guardrails Validator Validate
- Implementation:Google deepmind Dm control Header Parsing
- Implementation:NVIDIA NeMo Curator CaptionGenerationStage
- Implementation:Farama Foundation Gymnasium JaxToTorch
- Implementation:Run llama Llama index LLM Utils
- Implementation:Treeverse LakeFS Java SDK Model CustomViewer
- Implementation:Pyro ppl Pyro SoftLaplace
- Implementation:Unstructured IO Unstructured Golden File Fixtures Local
Heuristics
- Heuristic:Apache Airflow DAG Top Level Code Avoidance
- Heuristic:Avhz RustQuant Interpolation Method Selection
- Heuristic:Microsoft Onnxruntime Flash Attention Optimization
- Heuristic:Langfuse Langfuse BullMQ Retry Strategy Patterns
- Heuristic:Cleanlab Cleanlab KNN Distance Metric Selection
- Heuristic:OpenGVLab InternVL Loss Reduction Strategy
- Heuristic:Princeton nlp Tree of thought llm Duplicate Candidate Zeroing
- Heuristic:Huggingface Trl Gradient Checkpointing Use Reentrant
- Heuristic:Volcengine Verl Inplace Operations OOM Prevention
- Heuristic:Bentoml BentoML Platform Serving Caveats
Environments
- Environment:Sdv dev SDV Python Runtime
- Environment:Allenai Open instruct vLLM Inference
- Environment:DataTalksClub Data engineering zoomcamp Dlt BigQuery Environment
- Environment:DataExpert io Data engineer handbook Spark Iceberg Docker Environment
- Environment:Googleapis Python genai Gemini API Key Authentication
- Environment:Huggingface Trl Quantization Environment
- Environment:Cypress io Cypress Linux Display Server
- Environment:Haifengl Smile Java 25 Runtime
- Environment:Datahub project Datahub Python Ingestion
- Environment:Astronomer Astronomer cosmos Cloud Provider Dependencies