| This is really neat, from both a technical and a language learning perspective. I quite like the idea of eliminating English from the UI for immersion. You could also go with French + icons, so there's a little learning there too. > Is there a cheaper, more scalable way to score sentences than what I’ve described here? I estimate English sentence difficulty for my program, and that consists of some grammar heuristics + seeing which CEFR level each word belongs to and (~roughly, it's a little more complex) averaging their difficulties. I wonder if it'd be worth trying a similar technique to estimate the difficulty/cognate-score. Maybe some process like: - get the words - get their English translation (easier said than done...) - estimate how close they are to a cognate: perhaps a little classical ML here? Alternatively, you could exploit etymology. If the English translation has either French or Latin etymological roots, they're probably cognates. Or (as I like to do with my project), do all of it and average them together! You might also want to try Claude 3.5 for scoring - it's about as expensive as 4o, and might be more effective. Amazon will also give you free credit via Bedrock and AWS Activate. > Users should be able to understand how the app works without reading an entire blog post about it. I have precisely this problem. I think I've distilled my tool (https://nuenki.app) down to something simple, and the HN audience seems to think so, but then I show it to non-technical people and it all goes wrong. Maybe look into how Lingua Latina per se Illustrata is presented? Maybe try streaks for gamification? They seem to work well in Anki and Duolingo. |