@UrsBollhalder It was pretty fast, I used about 10 mins of samples, which I left running overnight so about 12-14 hours total for a relatively accurate resynthesis.
I think the 16k samplerate is for the actual training, once you start going to 44.1 or something higher like 96k the training would take a lot longer...
Probably the same thing for mono vs stereo (half the channels = half the time)