Skip to content

Finetuned word2vec embedding on Indian new corpus | Solves #14

Solves #14.

Contributions:

  1. Created a new news Indian news corpus by scraping news from Indian news handles.
  2. Created train, test and validation dataset from this corpus for sentiment analysis task.
  3. Created a model to compare results of finetuned google Word2vec embeddings on Indian news corpus and non-fintuned embeddings.

Results:

  1. Testing Acc for finetuned model: 0.8396990746259689
  2. Testing Acc for not finetuned model: 0.5267650485038757
Edited by Himanshu Maheshwari

Merge request reports