Sound demos for "Low-Latency Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses for Speech Generation Tasks"

Authors: Yang Ai, Zhen-Hua Ling

 

Comparison among Phase Prediction Methods (Section IV-C):

(1) Analysis-Synthesis Task

Example 1
Natural          
       
NSPP GL22 RAAR13      
     
GL100 RAAR100
DNN+GL100
     
     
Example 2
Natural          
       
NSPP GL22 RAAR13      
     
GL100 RAAR100
DNN+GL100
     
     
Example 3
Natural          
       
NSPP GL22 RAAR13      
     
GL100 RAAR100
DNN+GL100
     
     
Example 4
Natural          
       
NSPP GL22 RAAR13      
     
GL100 RAAR100
DNN+GL100
     
     
Example 5
Natural          
       
NSPP GL22 RAAR13      
     
GL100 RAAR100
DNN+GL100
     
     
Example 6
Natural          
       
NSPP GL22 RAAR13      
     
GL100 RAAR100
DNN+GL100
     
     

(2) BWE Task

Example 1
Natural          
       
NSPP GL100 RAAR100 DNN+GL100    
   
Example 2
Natural          
       
NSPP GL100 RAAR100 DNN+GL100    
   
Example 3
Natural          
       
NSPP GL100 RAAR100 DNN+GL100    
   
Example 4
Natural          
       
NSPP GL100 RAAR100 DNN+GL100    
   
Example 5
Natural          
       
NSPP GL100 RAAR100 DNN+GL100    
   
Example 6
Natural          
       
NSPP GL100 RAAR100 DNN+GL100    
   

(3) SS Task

Example 1
Natural          
       
NSPP GL100 RAAR100 DNN+GL100    
   
Example 2
Natural          
       
NSPP GL100 RAAR100 DNN+GL100    
   
Example 3
Natural          
       
NSPP GL100 RAAR100 DNN+GL100    
   
Example 4
Natural          
       
NSPP GL100 RAAR100 DNN+GL100    
   
Example 5
Natural          
       
NSPP GL100 RAAR100 DNN+GL100    
   
Example 6
Natural          
       
NSPP GL100 RAAR100 DNN+GL100    
   

 

Comparison with Waveform Reconstruction Method (Section IV-D):

(1) Analysis-Synthesis Task

Example 1
Natural          
       
NSPP HiFi-GAN        
       
Example 2
Natural          
       
NSPP HiFi-GAN        
       
Example 3
Natural          
       
NSPP HiFi-GAN        
       
Example 4
Natural          
       
NSPP HiFi-GAN        
       
Example 5
Natural          
       
NSPP HiFi-GAN        
       
Example 6
Natural          
       
NSPP HiFi-GAN        
       

(2) BWE Task

Example 1
Natural          
       
NSPP HiFi-GAN        
       
Example 2
Natural          
       
NSPP HiFi-GAN        
       
Example 3
Natural          
       
NSPP HiFi-GAN        
       
Example 4
Natural          
       
NSPP HiFi-GAN        
       
Example 5
Natural          
       
NSPP HiFi-GAN        
       
Example 6
Natural          
       
NSPP HiFi-GAN        
       

(3) SS Task

Example 1
Natural          
       
NSPP HiFi-GAN        
       
Example 2
Natural          
       
NSPP HiFi-GAN        
       
Example 3
Natural          
       
NSPP HiFi-GAN        
       
Example 4
Natural          
       
NSPP HiFi-GAN        
       
Example 5
Natural          
       
NSPP HiFi-GAN        
       
Example 6
Natural          
       
NSPP HiFi-GAN        
       

 

Evaluation on Low-Latency Streamable Phase Prediction (Section IV-E):

(1) Analysis-Synthesis Task

Example 1
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     
Example 2
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     
Example 3
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     
Example 4
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     
Example 5
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     
Example 6
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     

(2) BWE Task

Example 1
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     
Example 2
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     
Example 3
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     
Example 4
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     
Example 5
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     
Example 6
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     

(3) SS Task

Example 1
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     
Example 2
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     
Example 3
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     
Example 4
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     
Example 5
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     
Example 6
Natural          
       
NSPP NSPP_causal NSPP_causal_KD      
     

 

Discussions (Section IV-F):

(1) Effects of Different Anti-wrapping Functions (Section IV-F1):

Example 1
Natural          
       
NSPP NSPP-log NSPP-cube NSPP-para NSPP-cos  
 
Example 2
Natural          
       
NSPP NSPP-log NSPP-cube NSPP-para NSPP-cos  
 
Example 3
Natural          
       
NSPP NSPP-log NSPP-cube NSPP-para NSPP-cos  
 
Example 4
Natural          
       
NSPP NSPP-log NSPP-cube NSPP-para NSPP-cos  
 
Example 5
Natural          
       
NSPP NSPP-log NSPP-cube NSPP-para NSPP-cos  
 
Example 6
Natural          
       
NSPP NSPP-log NSPP-cube NSPP-para NSPP-cos  
 

(2) Ablation Studies (Section IV-F2):

(1) NSPP vs NSPP wo PEA :
  NSPP NSPP wo PEA
Example 1
Example 2
Example 3
Example 4
Example 5
Example 6
(2) NSPP vs NSPP wo AWF :
  NSPP NSPP wo AWF
Example 1
Example 2
Example 3
Example 4
Example 5
Example 6
(3) NSPP vs NSPP wo IP :
  NSPP NSPP wo IP
Example 1
Example 2
Example 3
Example 4
Example 5
Example 6
(4) NSPP vs NSPP wo GD :
  NSPP NSPP wo GD
Example 1
Example 2
Example 3
Example 4
Example 5
Example 6
(5) NSPP vs NSPP wo IAF :
  NSPP NSPP wo IAF
Example 1
Example 2
Example 3
Example 4
Example 5
Example 6