Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Pola rs Polars Data IO and Format Conversion
- Workflow:Haifengl Smile Data Loading Pipeline
- Workflow:Openai Openai node Function Calling
- Workflow:Predibase Lorax OpenAI Chat Completion
- Workflow:Facebookresearch Habitat lab Custom Task Extension
- Workflow:Ucbepic Docetl Playground Interactive Development
- Workflow:Promptfoo Promptfoo LLM Evaluation
- Workflow:Evidentlyai Evidently Text Data Quality Evaluation
- Workflow:Apache Airflow Scheduler Operation and Task Execution
- Workflow:Microsoft BIPIA White Box Defense Finetuning
Principles
- Principle:Apache Dolphinscheduler Workflow DAG Definition
- Principle:Intel Ipex llm Pipeline Parallel Generation
- Principle:Lance format Lance Index Optimization
- Principle:Ollama Ollama Manifest Resolution
- Principle:Facebookresearch Habitat lab Hierarchical Policy Assembly
- Principle:Interpretml Interpret EBM JSON Serialization
- Principle:Iterative Dvc Pipeline Visualization
- Principle:CrewAIInc CrewAI Built In Tool Selection
- Principle:FMInference FlexLLMGen DeepSpeed Training Engine
- Principle:Dotnet Machinelearning Gradient Boosted Tree Histogram
Implementations
- Implementation:Datajuicer Data juicer TokenNumFilter
- Implementation:Bentoml BentoML GRPC Testing Utils
- Implementation:Cypress io Cypress DetectFramework
- Implementation:Nautechsystems Nautilus trader BacktestEngine Run
- Implementation:OpenHands OpenHands DeviceCodeStore
- Implementation:ARISE Initiative Robosuite HingedBox
- Implementation:ARISE Initiative Robosuite TransportGroup
- Implementation:Mage ai Mage ai Docs Site Configuration
- Implementation:Langgenius Dify Dify Env Sync
- Implementation:Lm sys FastChat Gen Judgment
Heuristics
- Heuristic:Spcl Graph of thoughts GoT Decompose Sort Merge Strategy
- Heuristic:SqueezeAILab ETS Embedding Model GPU Collocation
- Heuristic:Bitsandbytes foundation Bitsandbytes Outlier Threshold Detection
- Heuristic:Fede1024 Rust rdkafka Librdkafka Debug Logging
- Heuristic:Speechbrain Speechbrain Data Augmentation Defaults
- Heuristic:OpenGVLab InternVL Pixel Shuffle Downsampling
- Heuristic:Mlc ai Web llm Tokenizer JSON Preference
- Heuristic:Apache Kafka Coordinator Loading Commit Interval
- Heuristic:Apache Spark Memory Tuning Tips
- Heuristic:Microsoft DeepSpeedExamples LoRA Learning Rate Scaling
Environments
- Environment:Openai Openai python Python 3 9 Plus
- Environment:Mistralai Client python Realtime Transcription Environment
- Environment:LLMBook zh LLMBook zh github io HuggingFace Transformers Stack
- Environment:Mlc ai Web llm Node Build Toolchain
- Environment:Onnx Onnx Cpp Build Environment
- Environment:Apache Flink Hadoop Compatibility Environment
- Environment:SeleniumHQ Selenium Contributor Development Environment
- Environment:Dotnet Machinelearning Native Build Toolchain
- Environment:NVIDIA NeMo Aligner TensorRT LLM Acceleration Environment
- Environment:Intel Ipex llm RAG LangChain Environment