Intelligence Document Search using Semantic indexing

This is a demo of an intelligence resume search we built using Latent Semantic Indexing (LSI) to build semantically ordered indexes of 100 resumes. From this index, we can query resumes by keyword search or by giving a reference resume to match. The top 5 resumes that has the highest similarity scores based will be returned.

Automatic Text Classification

We trained a Multinomial Naive Bayes algorithm to classify any text into one of  20 categories. This algorithm is trained on a dataset of 8000 emails and can classify the emails into these 20 categories based on their subject and body text.  This algorithm is hosted in a web portal and hosted at this url http://52.45.171.205:3500 for those who are interested in trying it out