Jump to content

Connect SuperML | Leeroopedia MCP: Equip your AI agents with best practices, code verification, and debugging knowledge. Powered by Leeroo — building Organizational Superintelligence. Contact us at founders@leeroo.com.

Principle:Facebookresearch Audiocraft Loudness Ratio Loss

From Leeroopedia
Knowledge Sources
Domains Audio_Processing, Loss_Functions
Last Updated 2026-02-14 01:00 GMT

Overview

A perceptual loss function family that measures the noise-to-signal loudness ratio across time, frequency, or time-frequency cells with softmax-weighted aggregation.

Description

Loudness Ratio Losses compute the perceptual SNR between output and reference audio by measuring loudness in time frames, frequency bands, or both. A softmax weighting scheme ensures that the noisiest regions contribute most to the loss, focusing optimization on the most perceptually problematic areas.

Usage

Use these losses in audio compression or watermarking training where perceptual quality matters more than simple MSE reconstruction.

Theoretical Basis

The loss computes noise loudness L_n and reference loudness L_r in each cell, then aggregates with softmax weighting:

=isoftmax(Ln,i)(Ln,iLr,i)

Related Pages

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment