ASMR Is All You Need

VĂ­ctor Adell
Jordi Aguilar
Pau Autrand
Miquel Escobar

June 2020

Abstract

In the past few years the popularity of the autonomous sensory meridian response (ASMR) concept has risen exponentially, being the video format the most common source for its stimuli. These are videos made by content creators who generate trigger sounds by using multiple materials and techniques. Therefore, in this study we propose variations of the WaveRNN and WaveNet models designed to generate these trigger sounds from scratch. We observe that the baseline architecture of these models outperform the alternative conditioned models in terms of the quality of the generated audios.

Find below a sample of the different model outputs that have been outlined on the paper ASMR Is All You Need.

Baseline models

Tapping on glass material

  • Database sound
  • Baseline WaveRNN
  • Baseline WaveNet
  • Brushing sounds

  • Database sound
  • Baseline WaveRNN
  • Baseline WaveNet
  • Conditioned models

    Tapping on metal material

  • Database sound
  • Conditioned WaveRNN
  • Conditioned WaveNet