← Back to cohort

Ali Nauman Qureshi

FAST · 2020 · i16 - 0138
Email
Phone
LinkedIn
GitHub

Academic

Program
CGPA
Year
2020
Education
Address
DOB

Career

Current role
Target role
Skills
Python, TensorFlow, Gensim, Google Colab, Google Cloud, Word2Vec, GloVe, fastText, ELMo, BERT, Natural Language Processing

Verbatim text

The exact text the LLM saw on the page (or the booklet text from the old import). This is what powers semantic search.
Comparative Analysis of Different Word Embedding Techniques 
 
In this project, we trained different word-embedding models using the Word2Vec, GloVe, fastText, 
ELMo and BERT techniques on Urdu and Roman Urdu corpora. Each model was trained using the 
same set of parameters. The embeddings learnt by these models were evaluated using a number of 
Natural Language Processing tasks. Our comparison consisted of both a Qualitative Analysis as well 
as a Quantitative Analysis. 
For the Qualitative analysis, we plotted clusters of words regarded to be similar by these models 
and drew conclusions based on some characteristics of those graphs. For the Quantitative Analysis, we used the models to perform a Word Similarity task on the WordSim-353 and SimLex-999 
benchmark datasets. A Spearman’s Correlation Coefficient was calculated between the similarity 
scores given in the datasets and the ones generated by these models. 
Furthermore, the Roman Urdu models were also evaluated using a Sentiment Analysis task over a 
Roman Urdu Tweets dataset. The embeddings from these models were fed to a Neural Network 
which was fine-tuned for this task. The BERT model trained over Urdu was also evaluated using a 
Sentence Classification task on the XNLI dataset. Our model was able to reach the baseline score of 
56.6, which was achieved by the Google Research Group’s Urdu model. 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
Technology Used: 
Python, TensorFlow, Gensim,  
Google Colab, Google Cloud 
Supervisor Name: 
Ms. Mehreen Alam 
Group Members:   
Abubakar Ijaz (i16 - 0123) 
Ali Nauman Qureshi (i16 - 0138) 
Muhammad Raafey Tariq (i16 - 0259)
Provenance
Source file: Graduate Directory FAST School of Computing 2020 (Final Complete) (1).pdf
From job #23 page 188
Created: 1778223765