Hacker News new | ask | show | jobs
by tmikaeld 902 days ago
> Try coming up with a dataset that doesn’t have any copyrighted material in them.

Isn't this what Mistral AI did?

1 comments

Did they? That'd be interesting to take a look at. Do they publish contents of their dataset?
The RAW Weights here: https://docs.mistral.ai/models/