Sound demos for "APCodec: A Neural Audio Codec with Parallel Encoding and Decoding for Amplitude and Phase Spectra"

Authors: Yang Ai, Xiao-Hang Jiang, Ye-Xin Lu, Hui-Peng Du, Zhen-Hua Ling

 

Primary Experimental Results

Sampling rate = 48 kHz, Bitrate = 6 kbps

Example 1 (p360, male speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC
Example 2 (p361, female speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC
Example 3 (p362, female speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC
Example 4 (p363, male speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC
Example 5 (p364, male speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC
Example 6 (p374, male speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC
Example 7 (p376, male speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC
Example 8 (s5, female speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC

 

Sampling rate = 48 kHz, Bitrate = 12 kbps

Example 1 (p360, male speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC
Example 2 (p361, female speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC
Example 3 (p362, female speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC
Example 4 (p363, male speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC
Example 5 (p364, male speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC
Example 6 (p374, male speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC
Example 7 (p376, male speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC
Example 8 (s5, female speaker)
Raw Audio        
     
APCodec APCodec-S Encodec AudioDec DAC

 

Sampling rate = 24 kHz, Bitrate = 3 kbps

Example 1 (p376, male speaker)
Raw Audio        
     
APCodec APCodec-S Encodec SoundStream HiFi-Codec
Example 2 (p362, female speaker)
Raw Audio        
     
APCodec APCodec-S Encodec SoundStream HiFi-Codec

 

Sampling rate = 24 kHz, Bitrate = 6 kbps

Example 1 (p376, male speaker)
Raw Audio        
     
APCodec APCodec-S Encodec SoundStream HiFi-Codec
Example 2 (p362, female speaker)
Raw Audio        
     
APCodec APCodec-S Encodec SoundStream HiFi-Codec

 

Sampling rate = 16 kHz, Bitrate = 2 kbps

Example 1 (p364, male speaker)
Raw Audio        
     
APCodec APCodec-S Encodec SoundStream HiFi-Codec
Example 2 (s5, female speaker)
Raw Audio        
     
APCodec APCodec-S Encodec SoundStream HiFi-Codec

Sampling rate = 16 kHz, Bitrate = 4 kbps

Example 1 (p364, male speaker)
Raw Audio        
     
APCodec APCodec-S Encodec SoundStream HiFi-Codec
Example 2 (s5, female speaker)
Raw Audio        
     
APCodec APCodec-S Encodec SoundStream HiFi-Codec

 

Ablation Studies

Ablation on APCodec (Sampling rate = 48 kHz, Bitrate = 6 kbps)

Example 1 (p360, male speaker)
APCodec        
     
APCodec w/o CNV APCodec w/o MelMSE APCodec w/o QLoss APCodec w/o MRD APCodec w/o Hinge
Example 2 (p361, female speaker)
APCodec        
     
APCodec w/o CNV APCodec w/o MelMSE APCodec w/o QLoss APCodec w/o MRD APCodec w/o Hinge

Ablation on APCodec-S (Sampling rate = 48 kHz, Bitrate = 6 kbps)

Example 1 (p360, male speaker)
APCodec-S APCodec-S w/o KD      
     
Example 2 (p361, female speaker)
APCodec-S APCodec-S w/o KD      
     

 

Validation on Diverse Datasets

Results on Common Voice dataset

Example 1
Raw Audio      
   
APCodec DAC APCodec-S AudioDec
Example 2
Raw Audio      
   
APCodec DAC APCodec-S AudioDec

Results on opencpop dataset

Example 1
Raw Audio      
   
APCodec DAC APCodec-S AudioDec
Example 2
Raw Audio      
   
APCodec DAC APCodec-S AudioDec

Results on FSD50K dataset

Example 1
Raw Audio      
   
APCodec DAC APCodec-S AudioDec
Example 2
Raw Audio      
   
APCodec DAC APCodec-S AudioDec