Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:CrewAIInc CrewAI Custom Tool Integration
- Workflow:Spotify Luigi Local Batch Pipeline
- Workflow:Fede1024 Rust rdkafka Mock Cluster Testing
- Workflow:Google research Deduplicate text datasets Single file deduplication
- Workflow:Heibaiying BigData Notes Flink Kafka Streaming Pipeline
- Workflow:Zai org CogVideo Diffusers Image to Video Inference
- Workflow:Ollama Ollama Custom Model Creation
- Workflow:InternLM Lmdeploy LLM Offline Batch Inference
- Workflow:Rapidsai Cuml Sklearn Zero Code Acceleration
- Workflow:FlowiseAI Flowise Evaluation Pipeline
Principles
- Principle:Datajuicer Data juicer LLM Backend Configuration
- Principle:LaurentMazare Tch rs PyTorch Binding Code Generation
- Principle:Elevenlabs Elevenlabs python Voice Cloning
- Principle:Eventual Inc Daft Iceberg Catalog Creation
- Principle:Run llama Llama index Index Persistence
- Principle:Infiniflow Ragflow Graph State Management
- Principle:FMInference FlexLLMGen Group Quantization Configuration
- Principle:Langfuse Langfuse Dataset Item Processing
- Principle:TobikoData Sqlmesh Plan Creation And Change Detection
- Principle:Kubeflow Pipelines Exit Handling
Implementations
- Implementation:CARLA simulator Carla EpisodeSettings
- Implementation:Predibase Lorax Response Format Type
- Implementation:ChenghaoMou Text dedup SimHash Union Find Cluster
- Implementation:SqueezeAILab ETS YAML Config Loading
- Implementation:ArroyoSystems Arroyo Kinesis Source
- Implementation:Trailofbits Fickling Activate Safe ML Environment
- Implementation:Huggingface Transformers Create Circleci Config
- Implementation:FlagOpen FlagEmbedding BGE Coder TripletGenerator
- Implementation:ArroyoSystems Arroyo Validate Query
- Implementation:Ucbepic Docetl ExperimentEvalUtils
Heuristics
- Heuristic:Lucidrains X transformers Rotary Position Embedding Selection
- Heuristic:CrewAIInc CrewAI LLM Provider Message Workarounds
- Heuristic:Promptfoo Promptfoo Transient Error Classification
- Heuristic:Facebookresearch Audiocraft FSDP Distributed Training Tips
- Heuristic:Apache Airflow Task Dependency Isolation
- Heuristic:Risingwavelabs Risingwave Source Backoff Strategy
- Heuristic:Microsoft Autogen Model Context Limiting
- Heuristic:Langchain ai Langchain Warning Deprecated Langchain Classic
- Heuristic:AUTOMATIC1111 Stable diffusion webui VRAM Management Strategies
- Heuristic:Microsoft Semantic kernel Prompt Injection Safety
Environments
- Environment:Kubeflow Kubeflow Istio Certmanager Dex Environment
- Environment:Langgenius Dify Credentials And Env Vars
- Environment:Mlflow Mlflow GPU System Metrics Environment
- Environment:Langgenius Dify Python Backend Environment
- Environment:Deepspeedai DeepSpeed Python Runtime Environment
- Environment:Avdvg InjectGuard CUDA GPU
- Environment:EvolvingLMMs Lab Lmms eval Python Runtime Environment
- Environment:BerriAI Litellm Python Runtime
- Environment:Mage ai Mage ai Singer SDK And Joblib Runtime
- Environment:Openai CLIP Python Dependencies