Sampling rate = 48 kHz, Bitrate = 6 kbps
Example 1 (p360, male speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Example 2 (p361, female speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Example 3 (p362, female speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Example 4 (p363, male speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Example 5 (p364, male speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Example 6 (p374, male speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Example 7 (p376, male speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Example 8 (s5, female speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Sampling rate = 48 kHz, Bitrate = 12 kbps
Example 1 (p360, male speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Example 2 (p361, female speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Example 3 (p362, female speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Example 4 (p363, male speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Example 5 (p364, male speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Example 6 (p374, male speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Example 7 (p376, male speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Example 8 (s5, female speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | AudioDec | DAC |
Sampling rate = 24 kHz, Bitrate = 3 kbps
Example 1 (p376, male speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | SoundStream | HiFi-Codec |
Example 2 (p362, female speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | SoundStream | HiFi-Codec |
Sampling rate = 24 kHz, Bitrate = 6 kbps
Example 1 (p376, male speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | SoundStream | HiFi-Codec |
Example 2 (p362, female speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | SoundStream | HiFi-Codec |
Sampling rate = 16 kHz, Bitrate = 2 kbps
Example 1 (p364, male speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | SoundStream | HiFi-Codec |
Example 2 (s5, female speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | SoundStream | HiFi-Codec |
Sampling rate = 16 kHz, Bitrate = 4 kbps
Example 1 (p364, male speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | SoundStream | HiFi-Codec |
Example 2 (s5, female speaker) |
||||
---|---|---|---|---|
Raw Audio | ||||
APCodec | APCodec-S | Encodec | SoundStream | HiFi-Codec |
Ablation on APCodec (Sampling rate = 48 kHz, Bitrate = 6 kbps)
Example 1 (p360, male speaker) |
||||
---|---|---|---|---|
APCodec | ||||
APCodec w/o CNV | APCodec w/o MelMSE | APCodec w/o QLoss | APCodec w/o MRD | APCodec w/o Hinge |
Example 2 (p361, female speaker) |
||||
---|---|---|---|---|
APCodec | ||||
APCodec w/o CNV | APCodec w/o MelMSE | APCodec w/o QLoss | APCodec w/o MRD | APCodec w/o Hinge |
Ablation on APCodec-S (Sampling rate = 48 kHz, Bitrate = 6 kbps)
Example 1 (p360, male speaker) |
||||
---|---|---|---|---|
APCodec-S | APCodec-S w/o KD | |||
Example 2 (p361, female speaker) |
||||
---|---|---|---|---|
APCodec-S | APCodec-S w/o KD | |||
Results on Common Voice dataset
Example 1 |
|||
---|---|---|---|
Raw Audio | |||
APCodec | DAC | APCodec-S | AudioDec |
Example 2 |
|||
---|---|---|---|
Raw Audio | |||
APCodec | DAC | APCodec-S | AudioDec |
Results on opencpop dataset
Example 1 |
|||
---|---|---|---|
Raw Audio | |||
APCodec | DAC | APCodec-S | AudioDec |
Example 2 |
|||
---|---|---|---|
Raw Audio | |||
APCodec | DAC | APCodec-S | AudioDec |
Results on FSD50K dataset
Example 1 |
|||
---|---|---|---|
Raw Audio | |||
APCodec | DAC | APCodec-S | AudioDec |
Example 2 |
|||
---|---|---|---|
Raw Audio | |||
APCodec | DAC | APCodec-S | AudioDec |