DramaBox is an open text-to-speech model built around directable speech. Instead of only feeding it words to read, you describe the speaker and the scene, put the spoken dialogue inside quotes, and leave performance notes outside the quotes.
That makes the workflow feel closer to writing a short script than setting a basic emotion control. For example, you can ask for a calm narrator who starts warmly, pauses, laughs under their breath, and then drops into a tense whisper. The model tries to turn those instructions into delivery, not spoken text.
The release is also useful because it is not only a polished hosted demo. The model weights are public, the Space is public, and the GitHub repo includes Python, CLI, and Gradio paths for local testing. That matters for readers who want to move from a browser experiment to their own speech workflow.