Hacker News new | ask | show | jobs
by jmiseikis 1524 days ago
Thanks for your answer!

Tailoring to these areas mainly come from my personal focus areas, but at the same time, in the last 2 years the code and model sharing from research (both, university and companies/corporates) has leaped ahead dramatically in terms of providing code, models and sometimes datasets.

GitHub is so far the main source, but usually models and data is linked from many different resources due to large file sizes. And running/testing the approaches and models is often still quite cumbersome due to different versions and so on. Probably shared Google Colab notebooks is currently the least "painful" way to test out the methods.

That's the motivation to ask around and see if there maybe some other formats/platforms that would make such sharing even easier, thus reduce the "engineering" overhead in testing and help researchers and developers to focus more on the research part advancing the field even further :)

1 comments

Thanks for elaborating!

> And running/testing the approaches and models is often still quite cumbersome due to different versions and so on.

I definitely agree with this. I don’t know that I have any particular insight on infrastructure, but I think one of the things that makes huggingface and SMP [0] successful is a well defined common model and data API.

It can be hard to clone a random research repo and train models on custom datasets because the model API is slightly different, or expects the data in a different format. I think this is a thing in graph neural networks right now, where different libraries use different data formats, and some groups roll their own

0: https://segmentation-models-pytorch.readthedocs.io/en/latest...