YEAH!! YEAH YEAH!!!

A hideous fruit, disgracing itself.
Allo-Aro
YEAH!! YEAH YEAH!!!
How do you imagine this functions for music or spoken word, or for more complex audio setups like the recent crop of "interactive audio posts" with 3 or 4 tracks meant to be played in tandem?
I feel like at a certain point you're just creating a feature that would be better served within the body text of a post. Wouldn't at some point this just be a transcription (e.g. for 3 minutes of spoken word content)?