-
-
Notifications
You must be signed in to change notification settings - Fork 137
Install ‐ Manual Installation Guide
This guide provides detailed manual installation instructions for AllTalk TTS version 2 on Windows, Linux, and Mac (theoretical) platforms.
⚡ It is preferable to use Atsetup.bat
or Atsetup.sh
as detailed in the Quick Setup Guides ⚡
⚡ as this will perform all the below steps automatically & build a correct Python environment. ⚡
⚡ I do not test AllTalk in custom built Python environments and due to the complexity of different ⚡
⚡ Python environments I cannot support them to any great extent, as there are far too many moving parts. ⚡
🚨 AKA, I will not support custom Python environments. The information below this point is an "if you know 🚨
🚨 what you are doing, then you can, technically speaking, install AllTalk into a custom Python environment" 🚨
Mac Support Disclaimer
AllTalk TTS support for Mac systems is currently untested & therefore theoretical. Please be aware of the following limitations:
- The installation process and functionality on Mac systems have not been verified due to lack of testing hardware.
- Compatibility of various TTS engines with Mac is uncertain and may vary.
- Features requiring GPU acceleration (such as DeepSpeed and CUDA) will not function on Mac systems.
- Some TTS engines may have limited or no support for Mac due to their development focus on other platforms.
These limitations are inherent to the TTS engines themselves and are beyond the control of the AllTalk TTS project. Improvements in Mac support would require updates from the individual TTS engine developers.
Users attempting to install and run AllTalk TTS on Mac systems should proceed with caution and be prepared for potential compatibility issues or reduced functionality.
This is currently targeted at Python 3.11.x and PyTorch 2.2.1 & may well error or fail on anything other than these versions.
Windows Prerequisites
Before installing AllTalk, ensure you have the following:
- Git for cloning GitHub repositories. Installation instructions
- Microsoft C++ Build Tools and Windows SDK for proper Python functionality. Installation instructions
- Espeak-ng for multiple TTS engines to function. Installation instructions
-
curl for downloading files. Download from curl's official website
Linux Prerequisites
Depending on your Linux distribution, you'll need to install some packages for AllTalk and its TTS engines to function correctly:
-
Debian-based systems (e.g., Ubuntu):
sudo apt install libaio-dev espeak-ng ffmpeg gcc g++ git curl
-
RPM-based systems (e.g., CentOS, Fedora):
sudo yum install libaio-devel espeak-ng ffmpeg gcc g++ git curl
Mac Prerequisites (Theoretical)
For Mac systems, you'll need to install some packages using Homebrew. If you don't have Homebrew installed, you can install it by following the instructions on the [official Homebrew website](https://brew.sh/).
Once Homebrew is installed, run the following commands:
brew install espeak ffmpeg gcc git curl
You'll also need to install Xcode Command Line Tools if you haven't already:
xcode-select --install
Important: Avoid using dashes or spaces in your folder path (e.g., avoid my folder-is-this/alltalk_tts-main
) as this causes issues with Python.
Windows Manual Installation
-
Open Command Prompt and navigate to your preferred directory:
cd /d C:\path\to\your\preferred\directory
-
Clone the AllTalk repository:
git clone -b alltalkbeta https://github.com/erew123/alltalk_tts
-
Navigate to the AllTalk directory:
cd alltalk_tts
-
Create and navigate to the alltalk_environment directory:
mkdir alltalk_environment cd alltalk_environment
-
Download and install Miniconda:
curl -Lk "https://repo.anaconda.com/miniconda/Miniconda3-py311_24.4.0-0-Windows-x86_64.exe" > miniconda_installer.exe start /wait "" miniconda_installer.exe /InstallationType=JustMe /NoShortcuts=1 /AddToPath=0 /RegisterPython=0 /NoRegistry=1 /S /D=%cd%\conda
-
Navigate to the conda folder and create a new environment:
cd conda .\condabin\conda create --no-shortcuts -y -k --prefix ..\env python=3.11.9
-
Activate the new environment:
call .\condabin\conda.bat activate ..\env
-
Install PyTorch 2.2.1:
.\Scripts\conda install -y pytorch==2.2.1 torchvision==0.17.1 torchaudio==2.2.1 pytorch-cuda=12.1 -c pytorch -c nvidia
-
Install Faiss and FFmpeg:
.\Scripts\conda install -y pytorch::faiss-cpu .\Scripts\conda install -y conda-forge::ffmpeg=*=*gpl* .\Scripts\conda install -y -c conda-forge "ffmpeg=*=h*_*" --no-deps
-
Navigate back to the alltalk_tts folder and install requirements:
cd .. cd .. pip install -r system\requirements\requirements_standalone.txt
-
Update Gradio:
pip install --upgrade gradio==4.32.2
-
Download and install DeepSpeed:
curl -LO https://github.com/erew123/alltalk_tts/releases/download/DeepSpeed-14.0/deepspeed-0.14.0+ce78a63-cp311-cp311-win_amd64.whl pip install deepspeed-0.14.0+ce78a63-cp311-cp311-win_amd64.whl del deepspeed-0.14.0+ce78a63-cp311-cp311-win_amd64.whl
-
Install Parler:
pip install -r system\requirements\requirements_parler.txt
-
Clean the conda environment:
.\alltalk_environment\conda\Scripts\conda clean --all --force-pkgs-dirs -y
-
(Optional) Downgrade transformers for XTTS streaming support:
pip install transformers==4.42.4
Linux Manual Installation
-
Open a terminal and navigate to your preferred directory:
cd /path/to/your/preferred/directory
-
Clone the AllTalk repository:
git clone -b alltalkbeta https://github.com/erew123/alltalk_tts
-
Navigate to the AllTalk directory:
cd alltalk_tts
-
Create and navigate to the alltalk_environment directory:
mkdir alltalk_environment cd alltalk_environment
-
Download and install Miniconda:
curl -LO https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh bash Miniconda3-latest-Linux-x86_64.sh -b -p $PWD/conda
-
Create a new conda environment:
./conda/bin/conda create --no-shortcuts -y -k --prefix ../env python=3.11.9
-
Activate the new environment:
source ./conda/bin/activate ../env
-
Install PyTorch 2.2.1:
conda install -y pytorch==2.2.1 torchvision==0.17.1 torchaudio==2.2.1 pytorch-cuda=12.1 -c pytorch -c nvidia
-
Install Faiss and FFmpeg:
conda install -y pytorch::faiss-cpu conda install -y -c conda-forge "ffmpeg=*=*gpl*" conda install -y -c conda-forge "ffmpeg=*=h*_*" --no-deps
-
Navigate back to the alltalk_tts folder and install requirements:
cd .. pip install -r system/requirements/requirements_standalone.txt
-
Update Gradio:
pip install --upgrade gradio==4.32.2
-
Install DeepSpeed:
pip install deepspeed
-
Install Parler:
pip install -r system/requirements/requirements_parler.txt
-
Clean the conda environment:
./alltalk_environment/conda/bin/conda clean --all --force-pkgs-dirs -y
-
(Optional) Downgrade transformers for XTTS streaming support:
pip install transformers==4.42.4
Mac Manual Installation (Theoretical)
Note: This Mac installation process is theoretical and has not been tested. Use with caution.
-
Open Terminal and navigate to your preferred directory:
cd /path/to/your/preferred/directory
-
Clone the AllTalk repository:
git clone -b alltalkbeta https://github.com/erew123/alltalk_tts
-
Navigate to the AllTalk directory:
cd alltalk_tts
-
Create and navigate to the alltalk_environment directory:
mkdir alltalk_environment cd alltalk_environment
-
Download and install Miniconda:
curl -LO https://repo.anaconda.com/miniconda/Miniconda3-latest-MacOSX-x86_64.sh bash Miniconda3-latest-MacOSX-x86_64.sh -b -p $PWD/conda
-
Create a new conda environment:
./conda/bin/conda create --no-shortcuts -y -k --prefix ../env python=3.11.9
-
Activate the new environment:
source ./conda/bin/activate ../env
-
Install PyTorch 2.2.1 (CPU version, as CUDA is not available on Mac):
conda install -y pytorch==2.2.1 torchvision==0.17.1 torchaudio==2.2.1 -c pytorch
-
Install Faiss and FFmpeg:
conda install -y pytorch::faiss-cpu conda install -y -c conda-forge "ffmpeg=*=*gpl*"
-
Navigate back to the alltalk_tts folder and install requirements:
cd .. pip install -r system/requirements/requirements_standalone.txt
-
Update Gradio:
pip install --upgrade gradio==4.32.2
-
Install Parler:
pip install -r system/requirements/requirements_parler.txt
-
Clean the conda environment:
./alltalk_environment/conda/bin/conda clean --all --force-pkgs-dirs -y
-
(Optional) Downgrade transformers for XTTS streaming support:
pip install transformers==4.42.4
Note: DeepSpeed is not installed on Mac as it's primarily for NVIDIA GPUs. Some features requiring GPU acceleration may not work on Mac systems.
The following will assume you have used Miniconda and not your own Python environment. If you have used your own Python environment, you will need to adapt the instructions specifically for that Python environment.
To start a miniconda environment you would run the conda.bat
file that's located in the condabin
folder, specifying the python environment name you created in the earlier steps (typically alltalk_environment
)
The following example will assume you used the C
drive on Windows and installed AllTalk into a folder called myfolder
, hence the complete path that you would have installed AllTalk to would be C:\myfolder\alltalk_tts\
@echo off
cd /D "%~dp0"
set CONDA_ROOT_PREFIX=C:\myfolder\alltalk_tts\alltalk_environment\conda
set INSTALL_ENV_DIR=C:\myfolder\alltalk_tts\alltalk_environment\env
call "%CONDA_ROOT_PREFIX%\condabin\conda.bat" activate "%INSTALL_ENV_DIR%"
call python script.py
Either way, once you start your Python environment, you would run:
-
python script.py
to start AllTalk
-
python finetuning.py
to start finetuning
- DeepSpeed is automatically installed but will only work on Nvidia GPUs (Windows and Linux only).
- Always activate the appropriate Python environment before making adjustments or using features like Fine-tuning.
- For more detailed instructions on Fine-tuning and DeepSpeed, refer to the setup utility and the full documentation.
- If you're new to Python environments, consider reviewing the Understanding Python Environments Simplified section in the Help documentation.
- Mac users may experience limited functionality due to lack of GPU acceleration for some TTS engines.