Jump to content

Connect SuperML | Leeroopedia MCP: Equip your AI agents with best practices, code verification, and debugging knowledge. Powered by Leeroo — building Organizational Superintelligence. Contact us at founders@leeroo.com.

Principle:Facebookresearch Audiocraft D Adaptation Optimization

From Leeroopedia
Revision as of 17:32, 16 February 2026 by Admin (talk | contribs) (Auto-imported from principles/Facebookresearch_Audiocraft_D_Adaptation_Optimization.md)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Knowledge Sources
Domains Optimization, Training
Last Updated 2026-02-14 01:00 GMT

Overview

An automatic learning rate adaptation technique that estimates the distance to the optimal solution online, eliminating the need for manual learning rate tuning.

Description

D-Adaptation methods modify standard optimizers (like Adam) to automatically determine the appropriate step size. By maintaining an online estimate of D (the initial distance to the solution in dual space), the optimizer can provably converge with a single hyperparameter-free learning rate setting.

Usage

Use this principle when training models where learning rate search is expensive or impractical. It provides theoretical convergence guarantees without manual tuning.

Theoretical Basis

The method maintains an estimate of D that is updated each step:

dk=max(dk1,|gkT(xkx0)|i=0k||gi||)

The effective learning rate is then scaled by d_k, ensuring the step size adapts to the problem geometry.

Related Pages

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment