Compared with phase spectrum,magnitude spectrum can represent most speech information,hence many speech processing tasks pay much attention on manipulating mag-nitude spectrum and use the imperfect vocoder parameters or mismatched phase spectrum to synthesize the waveform,which leads to an obvious distortion of speech quality.To address this problem,a modified version of WaveNet model fused with phase information is proposed to syn-thesize the speech with higher quality.In the WaveNet model,the original or processed phase spectrum of speech and the enhanced magnitude spectrum are concatenated as the condition input,and then the predicted speech waveform is generated directly from this input,which is a kind of fusion feature.The proposed method can realize the effective utilization of the phase information and is verified in two tasks including voice conversion (VC) and bone-conducted speech enhancement (BSE).Two kinds of phase spectrum,the modified group delay (MGD)spectrum and the instantaneous frequency deviation spectrum,are compared comprehensively in the simulation experiments,and the influence of the fusion feature on the bandwidth exten-sion WaveNet model and the teacher-student WaveNet model is also explored.In VC experi-ments,the A/B test shows the generated speech using the teacher-student WaveNet model is much better than using the STRAIGHT vocoder.In BSE experiments,the results show that,using the bandwidth extension WaveNet model via the feature fused with MGD spectrum,the mean opinion score (MOS) of the enhanced speech increases by 54.3% compared with the orig-inal bone-conducted speech.All the results demonstrate that the phase-fused condition input can supplement single magnitude spectrum efficiently and help the WaveNet vocoder achieve promising improvement on the quality of the synthesized speech.