I wondered the same thing. If the encoding was completely dense such that all possible outputs were valid, would the network express more complex compositional behavior by avoiding the need for binary feedback about syntax validity?
I think the main benefits of the abc midi syntax are that it is already a very dense encoding and has an established library of music to train on.
I think the main benefits of the abc midi syntax are that it is already a very dense encoding and has an established library of music to train on.