Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Pola rs Polars Lazy Query Pipeline
- Workflow:Mit han lab Llm awq TinyChat LLM Deployment
- Workflow:Huggingface Peft LoRA Causal LM Finetuning
- Workflow:Microsoft LoRA GPT2 NLG Finetuning
- Workflow:DataExpert io Data engineer handbook Flink Kafka Streaming Pipeline
- Workflow:Protectai Modelscan Custom Scanner Plugin
- Workflow:Princeton nlp SimPO On Policy Data Generation
- Workflow:Gretelai Gretel synthetics ACTGAN Tabular Synthesis
- Workflow:Heibaiying BigData Notes Hive Data Warehouse Operations
- Workflow:Online ml River Streaming Anomaly Detection
Principles
- Principle:Datahub project Datahub Metadata Sink Verification
- Principle:Scikit learn Scikit learn Feature Transformation
- Principle:Datahub project Datahub Docker Prerequisites Validation
- Principle:Langgenius Dify Variable Wiring
- Principle:Datajuicer Data juicer Partition Size Optimization
- Principle:Cypress io Cypress Monorepo Build Orchestration
- Principle:Cohere ai Cohere python SDK Authentication
- Principle:Cohere ai Cohere python Chat Response Processing
- Principle:Scikit learn Scikit learn Grid Search
- Principle:Togethercomputer Together python Model Download
Implementations
- Implementation:Iterative Dvc Repo Commit
- Implementation:Farama Foundation Gymnasium MountainCarEnv
- Implementation:Cypress io Cypress System Test Runner
- Implementation:OpenRLHF OpenRLHF UnpairedPreferenceDataset init
- Implementation:Astronomer Astronomer cosmos Get Dataset Alias Name
- Implementation:Open compass VLMEvalKit mPLUG Owl2
- Implementation:Kornia Kornia Geometry Conversions
- Implementation:Apache Paimon Ray Init
- Implementation:Alibaba MNN CMake Build Diffusion
- Implementation:Elevenlabs Elevenlabs python Model And Format Selection
Heuristics
- Heuristic:Tensorflow Tfjs WASM Cross Origin Isolation
- Heuristic:Apache Kafka Log4j Migration Compatibility
- Heuristic:Hpcaitech ColossalAI Gradient Checkpointing Memory Tip
- Heuristic:Haifengl Smile Quarkus Async Context Handling
- Heuristic:Deepset ai Haystack Pipeline Deep Copy Safety
- Heuristic:CARLA simulator Carla Synchronous Mode Fixed Delta
- Heuristic:Mit han lab Llm awq Kernel Selection Thresholds
- Heuristic:CarperAI Trlx Delta Rewards
- Heuristic:Datajuicer Data juicer Operator Fusion Rules
- Heuristic:Protectai Modelscan Stricter Zip Detection
Environments
- Environment:Vllm project Vllm GitHub
- Environment:Spotify Luigi Hadoop HDFS Cluster
- Environment:ArroyoSystems Arroyo PostgreSQL Database
- Environment:ClickHouse ClickHouse OpenSSL Runtime
- Environment:Allenai Open instruct Docker Container
- Environment:DistrictDataLabs Yellowbrick Python Scikit Learn Environment
- Environment:Openai Openai agents python Python 3 9 Runtime
- Environment:Togethercomputer Together python Fine Tuning Data Requirements
- Environment:OWASP Www project top 10 for large language model applications PR Description Generator Runtime
- Environment:Wandb Weave Python SDK Runtime