Fix README.md

Files changed (2) hide show

README.md CHANGED Viewed

@@ -13,8 +13,11 @@ Audiobox-Aesthetics is introduced in [Meta Audiobox Aesthetics: Unified Automati
 **Model Developer**: FAIR @ Meta AI
-**Model Architecture**: Audiobox-Aesthetics
 # How to install
 We are providing 2 ways to run the model:

 **Model Developer**: FAIR @ Meta AI
+**Model Architecture**:
+<img src="assets/aes_model.png" alt="Model" height="400px">
+Audiobox-Aesthetics is based on simple Transformer-based architecture. Specifically, the audio encoder based on WavLM-based structure, consisted of several CNN and 12 Transformers (Vaswani et al., 2017) layers with 768 hidden dimensions. To predict the output, we project the audio embedding through multiple multi-layer perceptron (MLP) blocks where each MLP block consisted of 5 non-linear layers with respect to each axes (PQ, PC, CE, CU). The model is trained with standard regression loss (Mean-Absolute & Mean-Squared Error).
 # How to install
 We are providing 2 ways to run the model:

aes_model.png → assets/aes_model.png RENAMED Viewed

File without changes