Hacker News new | ask | show | jobs
by kaitai 2674 days ago
I've been playing with topological methods for data analysis recently and I think there are some fruitful things happening. Seems like there are some ideas emerging from theory to practice which might be useful (Betti curves, multiparameter persistence, etc) but they're not quite there yet.

Another idea that's been intriguing me lately is applied sheaf theory. Robinson, Ghrist, and Curry are the only people I see working on this but I don't know what I'm not seeing. The "big idea" is taking local data and seeing if it patches together to a global coherent whole or not. Sometimes it doesn't (old Russian example: arbitrage in currency exchange networks). Or sometimes it's about using interpolation to fill in missing data, if you know that it's something for which there is a global function (temperatures across ocean surfaces), or providing a probability distribution for the missing parts. Category theory has something to offer here as well.

Anyone know more about any of these things?

2 comments

Look into OpenCog - there's some sheaf-theoretic NLP stuff going on there. There's a recent high-level overview by Linas Vepstas you can find on the ArXiv somewhere. There's a project called SheafSystem also, which is a sheaf-based database for scientific computing (I've never used it). I have some ideas I'm working on in this area also (not affiliated with these parties in any way, and the ideas are not ready to share yet, unfortunately.

What's your background, out of curiosity?

"The big idea is taking local data and seeing if it patches together."

I wonder if those new techniques are different from the usually used like: weighted averages of random forest, cross entropy of density estimation, minimization of variance between local estimations and the like.