Hacker News new | ask | show | jobs
by tuxlinuxien 3325 days ago
Are you using 100 threads or coroutines? because 100 threads on a single machine will no give you nice performance at all.

I have started working a crawler in Go and each website to crawl can be configured via YAML files. The quantity of code written is a bit bigger than the python one made before, but a single micro instance on amazon is way enough to "read" few hundred websites every hour. Personally, I think the goroutine system is way easier to deal with compared to the Python one.

So if I was you, I would definitely give the IO processing to golang, and let python deal with the data processing.