Convert and separate audio using models and TTS
Separate audio into stems using various models
Launch a web interface for model interaction