Project was based on nlp techniques for checking similarity between a pair of questions
We used four different techniques for checking the similarity
- Bag Of Words (BOW)
- TF-IDF (Term frequency and inverse document frequency)
- Word2Vec
- Glove
- XGBOOST
- ADABOOST
The best results are given by glove and word2vec as we didn't have a powerful system so we were not able to take glove to its full potential.
The data set that we used was taken from kagle Dataset which was based on quora website
We are currently working on the deeplearning techiques to improve the model.