Overview
Diff-SVC is an open-source singing voice conversion tool utilizing diffusion models. It allows users to convert one singing voice into another. The architecture leverages diffusion probabilistic models to generate audio, offering capabilities to modify vocal characteristics such as pitch, timbre, and intonation. Key updates include support for 44.1kHz audio, optimizations for training speed and model size using the 'no_fs2' option, and improved inference. It supports various input and output audio formats. The project aims for academic exchange and is not intended for production environments, with no responsibility for copyright issues arising from generated content. Preprocessing, training, and inference instructions are provided, along with documentation for detailed parameter settings.
