๐ Quick Start
โ Unity Requirements
- com.whisper.unity must be installed. You can get it from the official GitHub repository.
- Add two sprites to your project (these are used for the "microphone enabled" and "microphone disabled" states).
- UnityNeuroSpeech is developed for Unity 6 (6000.0.47f1). Other versions might work, but Unity 6 is highly recommended.
๐ Other Requirements
- Ollama installed (along with any LLM you want to use).
- Python 3.11 installed.
- A downloaded Whisper model.
- At least one
.wav
file for voice cloning.
๐ก What are these tools?
- Ollama is a platform for running large language models (LLMs) locally. You can use models like DeepSeek, Gemma, Qwen, etc. Avoid small modelsโthey might affect accuracy and context understanding.
- Python 3.11 โ use this specific version because it's stable and tested with this framework.
- Whisper is a speech-to-text model. You can use lightweight versions like
ggml-tiny.bin
for speed, or heavier ones likeggml-medium.bin
for better accuracy.
โ๏ธ Base Setup
- Go to the UnityNeuroSpeech GitHub repository and download the following three
.rar
files from the latest Release: UnityNeuroSpeech.X.X.X.rar
โ main framework filesdefault.venv.rar
โ Python environment for the TTS server-
TTSModel.rar
โ pretrained XTTS model -
Extract all archives to the same directory. After extraction:
-
Inside
UnityNeuroSpeech/
youโll find:- a
.unitypackage
- a
Server/
folder - a
run_server.bat
file
โ ๏ธ Do not import theServer
folder or.bat
file into Unity. Keep them outside the project directory.
- a
-
Place your
.wav
voice files intoServer/Voices
.
Each file must follow the naming pattern:en_voice.wav
,ru_voice.wav
, etc. -
Move the extracted
.venv
folder (fromdefault.venv.rar
) into theServer/
folder. -
Move the extracted
TTSModel/
folder (fromTTSModel.rar
) into theServer/
folder as well. -
Import the
.unitypackage
into your Unity project. -
In the
UnityNeuroSpeech
folder, youโll see an emptyWhisper/
folder. Drop your Whisper.bin
model file into it.
Some folders (like
Whisper/
) may contain.txt
placeholder files.
These are only used to ensure Unity exports the folder. You can safely delete them after setup.
You can also manually install your own Python environment and download the XTTS model separately.
But if you want everything to "just work" without fighting with pip, PATH, or broken dependencies โ use the provided.venv
andTTSModel
.
โ Done! Youโre ready to build your first talking AI agent.