In a significant stride towards enhancing the fidelity of digital audio emulations, researchers Ryota Sato and Julius O. Smith III have developed a novel approach to reduce aliasing artifacts in neural amplifier models. Their work, titled “Aliasing Reduction in Neural Amp Modeling by Smoothing Activations,” addresses a persistent challenge in the realm of digital emulations of analog audio hardware, such as vintage tube guitar amplifiers.
The increasing demand for high-quality digital emulations has spurred numerous studies into neural network-based black-box modeling. Deep learning architectures like WaveNet have shown promising results, but a key limitation has been the aliasing artifacts that arise from the nonlinear activation functions in neural networks. These artifacts can degrade the audio quality, making the emulations less convincing. Sato and Smith’s research focuses on mitigating these aliasing effects by investigating novel and modified activation functions.
To quantify the level of aliasing, the researchers introduced a novel metric called the Aliasing-to-Signal Ratio (ASR). This metric provides a high-accuracy assessment of aliasing levels, complementing the conventional Error-to-Signal Ratio (ESR), which measures the overall modeling accuracy. By conducting studies on a range of preexisting and modern activation functions with varying stretch factors, the researchers found that activation functions with smoother curves tend to achieve lower ASR values. This indicates a noticeable reduction in aliasing artifacts.
One of the most significant findings of this research is that the improvement in aliasing reduction can be achieved without a substantial increase in ESR. This means that the models can maintain high accuracy in emulating the original analog hardware while significantly reducing unwanted aliasing artifacts. The practical implications for music and audio production are substantial. Musicians and producers who rely on digital emulations of vintage amplifiers can now expect higher fidelity and more authentic sound reproduction. This advancement could revolutionize the way digital audio emulations are used in recording studios, live performances, and music production software.
Moreover, the introduction of the ASR metric provides a valuable tool for future research in this field. By having a standardized way to measure aliasing, researchers can more effectively develop and refine activation functions and other components of neural network models to further improve audio quality. This breakthrough not only enhances the current capabilities of digital audio emulations but also paves the way for future innovations in the field of audio technology. Read the original research paper here.



