Sounds like the text file needs vocals and pitches along with time stamps. AI is getting there to allow automating it's creation.
For myself: Adding a link I just found for reading further.
https://www.reddit.com/r/karaoke/comments/x61kzy/modern_equi...