Is DeepSpeed supported?

Yes, DeepSpeed is supported to accelerate inference. However, it's disabled on Apple Silicon due to compatibility issues.

How can I improve the inference speed?

Use a powerful GPU, leverage DeepSpeed if possible, and experiment with different presets like 'ultra_fast'.

Tortoise TTS

Tortoise TTS | Find AI List

Overview

Tortoise TTS is an open-source, multi-voice text-to-speech system leveraging both autoregressive and diffusion decoders for high-quality speech synthesis. The architecture prioritizes realistic prosody and intonation, producing natural-sounding speech. It requires an NVIDIA GPU for local installation and is designed for inference mode. The system can be installed via pip and offers Docker support for simplified deployment. It supports various interfaces, including command-line scripts and a Python API. While initially noted for its slow sampling rates, recent optimizations have improved performance significantly. The project emphasizes voice customization and provides tools for reading large amounts of text, making it suitable for applications requiring personalized and expressive speech synthesis.

Common tasks

Text-to-speech conversion Voice cloning Speech synthesis customization

FAQ

View all

What hardware is required to run Tortoise TTS?

An NVIDIA GPU is highly recommended for acceptable performance. A K80 can generate a medium-sized sentence in about 2 minutes, with faster GPUs providing better results.

How do I install Tortoise TTS?

The recommended installation method is using Conda. Follow the steps in the README, which include creating a Conda environment, installing PyTorch, and cloning the Tortoise TTS repository.

Can I use Tortoise TTS on a CPU?

While it's possible, performance will be significantly slower. A GPU is strongly recommended for practical use.

How can I customize the voice used by Tortoise TTS?

Tortoise TTS supports voice customization through voice cloning. You can provide reference audio clips to train the model on a specific voice.

FAQ+