Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent, and let it build robust AI/ML systems autonomously:
- SuperML plugin — converts your AI coding agent into an expert ML engineer with agentic memory
- Leeroopedia MCP — search over best-practices and skills of ML/AI
- Kapso — experimentation platform for autonomous AI/ML software building
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Apache Shardingsphere Shadow Database Routing
- Workflow:Tensorflow Serving Model Version Management
- Workflow:Obss Sahi COCO Evaluation
- Workflow:Testtimescaling Testtimescaling github io GitHub Pages Course Progression
- Workflow:Ray project Ray Actor Lifecycle Management
- Workflow:Mistralai Client python Finetuning Job Management
- Workflow:Teamcapybara Capybara Custom Selector Definition
- Workflow:Groq Groq python Text Embedding
- Workflow:FMInference FlexLLMGen Data Wrangling Batch Inference
- Workflow:ThreeSR Awesome Inference Time Scaling Manual Paper Contribution
Principles
- Principle:Farama Foundation Gymnasium Composite Space Types
- Principle:Huggingface Diffusers Pipeline Level Quantization
- Principle:Unstructured IO Unstructured Time Profiling
- Principle:Heibaiying BigData Notes HBase Data Reading
- Principle:Ggml org Llama cpp Chat Template Application
- Principle:Datajuicer Data juicer Data Export
- Principle:Huggingface Datatrove WARC Archive Reading
- Principle:Neuml Txtai Task Wrapping
- Principle:Fede1024 Rust rdkafka Client Statistics Monitoring
- Principle:Ggml org Ggml File Type Detection
Implementations
- Implementation:FlagOpen FlagEmbedding LLM Embedder ICL Utils
- Implementation:Ollama Ollama ML Backend
- Implementation:OWASP Www project top 10 for large language model applications VulnerabilityEntry Extract Common Examples
- Implementation:Triton inference server Server Convert Checkpoint
- Implementation:Open compass VLMEvalKit Register Dataset
- Implementation:Duckdb Duckdb Benchmark Class
- Implementation:Explodinggradients Ragas Text2SQL Data Utils
- Implementation:NVIDIA NeMo Curator ClipFrameExtractionStage
- Implementation:Avhz RustQuant RootFinder Trait
- Implementation:Apache Kafka Build Docker Image Runner
Heuristics
- Heuristic:Protectai Llm guard ONNX Runtime Optimization
- Heuristic:Apache Kafka JVM GC Tuning Defaults
- Heuristic:Apache Airflow DAG Top Level Code Avoidance
- Heuristic:Huggingface Transformers Dataloader Pin Memory NonBlocking
- Heuristic:Microsoft Playwright Timeout Configuration Tips
- Heuristic:ArroyoSystems Arroyo Batch Size And Backpressure
- Heuristic:Huggingface Diffusers Guidance Scale Defaults
- Heuristic:Pyro ppl Pyro Guide Initialization Strategy
- Heuristic:Microsoft Agent framework Async Context Manager Cleanup
- Heuristic:Allenai Open instruct Disable Dropout In RL
Environments
- Environment:Zai org CogVideo SAT Framework Environment
- Environment:Apache Airflow Development Contributor Environment
- Environment:OpenRLHF OpenRLHF vLLM Environment
- Environment:Avhz RustQuant Rust Stable
- Environment:DataExpert io Data engineer handbook Statsig API Environment
- Environment:Mlc ai Mlc llm TVM Runtime Environment
- Environment:Sail sg LongSpec Inference Environment
- Environment:BerriAI Litellm Provider API Credentials
- Environment:Huggingface Open r1 CUDA Environment
- Environment:Heibaiying BigData Notes Spark 2 4 Environment