Notifications

Notification are automatically managed by the application. It will return playback once the notification has played. Either play a mp3 or a spoken message.

Play a mp3

To play a short music file as a notification send the following payload to command topic. You can send multiple notifications after each other, and they will be played after each other.

Topic: sonos/uuid_of_speaker/control

Payload:

{
  "command": "notify",
  "input": {
    "trackUri": "https://cdn.smartersoft-group.com/various/pull-bell-short.mp3",
    "onlyWhenPlaying": false,
    "timeout": 10,
    "volume": 8,
    "delayMs": 700
  }
}

Experimental notification queue

There is a second implementation of the notification queue, that you can use with the following command. You should not mix them, but you we are curious to what you think

Topic: sonos/uuid_of_speaker/control

Payload:

{
  "command": "notifytwo",
  "input": {
    "trackUri": "https://cdn.smartersoft-group.com/various/pull-bell-short.mp3",
    "onlyWhenPlaying": false,
    "timeout": 10,
    "volume": 8,
    "delayMs": 700
  }
}

Play notification on all speakers

You can also have a notification play on all groups (all devices that coordinate themself), just send the following message to sonos/cmd/notify.

{
  "trackUri": "https://cdn.smartersoft-group.com/various/pull-bell-short.mp3",
  "onlyWhenPlaying": false,
  "timeout": 10,
  "volume": 8,
  "delayMs": 700
}

Notification explained

Internally the notifications work as following.

Get the current state
Check if it is currently playing, and cancel if not playing and you specified onlyWhenPlaying: true.
Play the sound
Wait for playback to stop (or the timeout to expire)
Restore all previously fetched status (Track, queue, volume,…)

More information about notifications

sequenceDiagram;
    participant user as User
    participant s2m as Sonos2MQTT
    participant s as Sonos Speaker
    user-->>s2m: Play notification url
    s2m->>s: What is the current position?
    s->>s2m: Here you go
    s2m-->>s2m: Save playback snapshot
    Note right of s2m: Current Track, Volume, Position, Mute & Metadata
    s2m->>s: Set track uri, volume etc.
    s->>s: Play notification track
    s2m-->>s2m: Wait for event or timeout
    s->>s2m: Playback stopped (event)
    s2m-->>s: Play next notification
    s-->>s2m: Playback stopped (event)
    s2m->>s: Restore full state

Text to speech

You can have your sonos speaker prononce some notification text, which is a pretty cool feature. But you’ll need some extra work. You’ll need a text-to-speech endpoint as described here. You have two options either host your own server or become a sponsor and get access to my personal hosted TTS server.

Either way you will have a TTS endpoint at hand. You can set it in the configuration or supply it with every request.

Option	Value	Environment variable	Config
Default language	`en-US`	`SONOS_TTS_LANG`	`--ttslang`
Default endpoint	`http://some-server.domain.com/api/generate`	`SONOS_TTS_ENDPOINT`	`--ttsendpoint`

Have a speaker speak by sending the following to sonos/uuid_of_speaker/control. Endpoint is optional (if set in environment), lang is options if set in config, gender, name, volume & onlyWhenPlaying are always optional.

{
  "command":"speak",
  "input": {
    "text": "Someone at the front-door",
    "endpoint": "https://your.tts.endpoint/api/generate",
    "lang": "en-US",
    "gender": "male",
    "name": "Salli",
    "volume": 50,
    "onlyWhenPlaying": false,
    "delayMs": 700
  }
}

Experimental tts queue

There is a second implementation of the notification queue (used by TTS), just change speak to speaktwo in the above command. You should not mix them, but you we are curious to what you think

Text to speech on all speakers

Send this payload to sonos/cmd/speak to play it on all groups. Same parameters as above.

{
  "text": "Someone at the front-door",
  "endpoint": "https://your.tts.endpoint/api/generate",
  "lang": "en-US",
  "gender": "male",
  "name": "Salli",
  "volume": 50,
  "onlyWhenPlaying": false,
  "delayMs": 700
}

Text to speech explained

The text-to-speech method executes the following:

Ask the TTS endpoint what the url of the supplied text is.
If the server doesn’t have this file, it will generate the mp3 file on the fly.
The TTS endpoint returns the url of the mp3.
We call the .PlayNotification({}) command above, with the tts url.

sequenceDiagram;
    participant user as User
    participant s2m as Sonos2MQTT
    participant tts as Text-to-speech server
    participant api as Remote service
    user->>s2m: Play text-to-speech
    s2m->>+tts: HTTP post /api/generate
    tts-->>tts: Have mp3 for text?
    tts->>+api: Generate mp3 for text
    api->>-tts: Here you go
    tts->>tts: Save file locally
    tts->>-s2m: Url for mp3 file
    s2m->>s2m: Play notification url (see above)

This way you don’t have to worry about encoding the text so sonos understands it. Sonos will just get a regular url to the mp3 file with the spoken text.

The server I’ve build is based on Amazon Polly, but I invite eveybody to build their own if you want to support an other tts service. You can replace it with any other TTS service as long as it expects a post request with the data below and responds with a json message with either uri or cdnUri. The sonos speaker needs the .mp3 at the end to be able to play the file smoothly.

TTS API

Request POST to https://your-tts-api.com/api/generate

{
  "text": "Hello from polly",
  "lang": "en-US",
  "gender": "male"
}

Response

{
  "cdnUri": "https://cacheUri/en-US/4b6eddb411d4cec3933528bfca05341828ca7593.mp3",
  "uri": "http://your_ip:5601/cache/en-US/4b6eddb411d4cec3933528bfca05341828ca7593.mp3"
}