|
|
|
Ask HN: What dataset do you think is useful to you but not readily available
|
|
3 points
by abhikandoi2000
2201 days ago
|
|
We are a bootstrapped team trying to build tools for data extraction. We are currently focusing on tools for data that is semi-structured and thus can be extracted using non deep learning based software. So if you think there is some data that you need (and you are willing to pay for) but it is not readily available, we might be able to help you. We are looking for different types of datasets that are actually useful to people, so that we can work towards a tool that can be generally used for some sort of data extraction. If you think you have such a dataset in mind, do let us know. Also, if you could share a website where we could find the semi-structured version of this dataset that you need, it'd be really helpful. |
|
A huge amount of internet modelling and sampling would improve at scale if we knew this. I've discussed this with researchers in the field. Akamai and Google and Facebook have private information which is their secret sauce.