Hacker News new | ask | show | jobs
by sajeevaravind 3721 days ago
User has to connect his services to Vaultedge. Services can be cloud storage like dropbox and google drive, emails like gmail and outlook and offline storage like laptops and usb disks. Vaultedge will periodically check for any new files (docs only now) in these services and classifies them using Machine learning and also builds a search index for these files. Other than giving oAuth read access to these services, user need not do anything.

Thanks for asking. Happy to answer any follow up question you may have.

1 comments

Sorry for not being clear, I am curious regarding how my phone bill becomes searchable.
Short answer is Vaultedge will create search index based on the content of the bill and also has additional knowledge that it is a cell phone bill from its classification algorithm. Put together this will allow user to search for "last month's cell phone bill".

Long answer: Let's say your bill is in pdf or html format. For Vaultedge initially it is nothing but a document, it has no idea that it is a bill. Vaultedge will extract the contents of this document and test it against our training data set using Machine Learning(ML). In this process, the document will be identified as a "Bill". Then it will apply it on a different training set which will identify it as a "cell phone" or "cable bill". Then we do further analysis and extract info like this bill is "for the month of march 2016". Now all of these info is used to construct the search index. So you can search now for "march bill" or "last months bill" to get to that bill. Please don't hesitate to clarify if anything is not clear.