Skip to content

๐Ÿš€ Quick Start


โœ… Unity Requirements

  1. com.whisper.unity must be installed. You can get it from the official GitHub repository.
  2. Add two sprites to your project (these are used for the "microphone enabled" and "microphone disabled" states).
  3. UnityNeuroSpeech is developed for Unity 6 (6000.0.47f1). Other versions might work, but Unity 6 is highly recommended.

๐Ÿ›  Other Requirements

  1. Ollama installed (along with any LLM you want to use).
  2. Python 3.11 installed.
  3. A downloaded Whisper model.
  4. At least one .wav file for voice cloning.

๐Ÿ’ก What are these tools?

  • Ollama is a platform for running large language models (LLMs) locally. You can use models like DeepSeek, Gemma, Qwen, etc. Avoid small modelsโ€”they might affect accuracy and context understanding.
  • Python 3.11 โ€” use this specific version because it's stable and tested with this framework.
  • Whisper is a speech-to-text model. You can use lightweight versions like ggml-tiny.bin for speed, or heavier ones like ggml-medium.bin for better accuracy.

โš™๏ธ Base Setup

  1. Go to the UnityNeuroSpeech GitHub repository and download the following three .rar files from the latest Release:
  2. UnityNeuroSpeech.X.X.X.rar โ€“ main framework files
  3. default.venv.rar โ€“ Python environment for the TTS server
  4. TTSModel.rar โ€“ pretrained XTTS model

  5. Extract all archives to the same directory. After extraction:

  6. Inside UnityNeuroSpeech/ youโ€™ll find:

    • a .unitypackage
    • a Server/ folder
    • a run_server.bat file
      โš ๏ธ Do not import the Server folder or .bat file into Unity. Keep them outside the project directory.
  7. Place your .wav voice files into Server/Voices.
    Each file must follow the naming pattern: en_voice.wav, ru_voice.wav, etc.

  8. Move the extracted .venv folder (from default.venv.rar) into the Server/ folder.

  9. Move the extracted TTSModel/ folder (from TTSModel.rar) into the Server/ folder as well.

  10. Import the .unitypackage into your Unity project.

  11. In the UnityNeuroSpeech folder, youโ€™ll see an empty Whisper/ folder. Drop your Whisper .bin model file into it.

Some folders (like Whisper/) may contain .txt placeholder files.
These are only used to ensure Unity exports the folder. You can safely delete them after setup.


You can also manually install your own Python environment and download the XTTS model separately.
But if you want everything to "just work" without fighting with pip, PATH, or broken dependencies โ€” use the provided .venv and TTSModel.


โœ… Done! Youโ€™re ready to build your first talking AI agent.