Well, alternative data in general is anonymized and absolutely does not contain any personal info (even because PII is useless for hedge funds, they need to see trends not sell something to people).
Unless it’s proprietary data (or data acquired from third parties and elaborated), the other source is mainly web scraping and this is regulated. You need to have the rights to scrape this data, which it means that it’s public data