orpheus-tts/additional_inference_options/no_gpu/README.md at main · aiola-lab/orpheus-tts

Streaming Inference Example (No GPU)

You can stream audio without a GPU by using orpheus-cpp, which is a llama.cpp-compatible backend of the Orpheus TTS model.

Install orpheus-cpp
```
pip install orpheus-cpp
```

Install llama-cpp-python

Linux/Windows

pip install llama-cpp-python --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cpu

MacOS with Apple Silicon

pip install llama-cpp-python --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/metal

Run the example below:

from scipy.io.wavfile import write
from orpheus_cpp import OrpheusCpp
import numpy as np

orpheus = OrpheusCpp(verbose=False, lang="en")

text = "I really hope the project deadline doesn't get moved up again."
buffer = []
for i, (sr, chunk) in enumerate(orpheus.stream_tts_sync(text, options={"voice_id": "tara"})):
   buffer.append(chunk)
   print(f"Generated chunk {i}")
buffer = np.concatenate(buffer, axis=1)
write("output.wav", 24_000, np.concatenate(buffer))

WebRTC Streaming Example:
```
python -m orpheus_cpp
```
2025-03-26_10-37-56.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Streaming Inference Example (No GPU)

Linux/Windows

MacOS with Apple Silicon

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Streaming Inference Example (No GPU)

Linux/Windows

MacOS with Apple Silicon