@fiore fwiw, while looking into matrix hosting, i found that a webRTC connection over relays (see TURN) was the preferred method of setting up VOIP.
while it would sidestep stock mumble quite substantially, if the architecture is designed to be extensible enough, i don't see why a mumble server couldn't just negotiate a relay link between participants to support even a simple video feed!
