← Back to cohort

Farhan Ali Awan

COMSATS · 2025
Email
Phone
LinkedIn
https://pk.linkedin.com/in/farhan-ali-0b4282259
GitHub

Academic

Program
BACHELOR OF SCIENCE IN DATA SCIENCE
CGPA
Year
2025
Education
COMSATS University Islamabad
Address
Islamabad, Pakistan
DOB

Career

Current role
Data Engineer Intern (BLUTECH CONSULTING BTC)
Target role
Skills
Python, R, PostgreSQL, Microsoft Excel, Machine Learning, Data Analysis, Visualization, Power BI, Matplotlib, Seaborn, PySpark, Google Colab, ETL Pipelines, SQL, Mongo DB, Microsoft Word, Pandas, Numpy, Scipy, Data Visualization, NLP, CNN

Verbatim text

The exact text the LLM saw on the page (or the booklet text from the old import). This is what powers semantic search.
ABOUT ME 
I am a Data Science enthusiast with strong skills in Python, R, PostgreSQL, and Microsoft Excel. My expertise 
includes machine learning model development, data analysis, and visualization using tools such as Power BI, 
Matplotlib, and Seaborn, PySpark along with collaborative work in Google Colab. I am passionate about extracting 
actionable insights from data and building efficient, real-world solutions. With strong problem-solving abilities and a 
continuous learning mindset, I am committed to leveraging technology to make impactful contributions. 
 EDUCATION AND TRAINING 
07/02/2022 – CURRENT islamabad, Pakistan 
BACHELOR OF SCIENCE IN DATA SCIENCE Comsats University Islamabad 
Website https://www.comsats.edu.pk/  Level in EQF EQF level 6 
 
07/04/2019 – 06/06/2021 Islamabad, Pakistan 
FSC Scienta Vision College 
Website https://scientavision.edu.pk/  Level in EQF EQF level 4 
 WORK EXPERIENCE 
 BLUTECH CONSULTING BTC. – ISLAMABAD, PAKISTAN 
DATA ENGINEER INTERN – 13/07/2025 – 14/08/2025 
During my internship at BTC, I gained hands-on experience in data engineering by working with PostgreSQL, where I 
applied advanced SQL concepts such as window functions, indexing, and query optimization. I developed ETL 
pipelines using PySpark, incorporating techniques like Slowly Changing Dimensions (SCDs) and Change Data 
Capture (CDC) to manage historical and incremental data. This role strengthened my expertise in data warehousing, 
pipeline optimization, and big data processing, while improving my problem-solving and collaboration skills in a 
real-world environment. 
 SKILLS 
data extraction, transformation and loading tools 
ETL Pipelines 
machine learning 
Microsoft Word
 
Microsoft Excel Python (Pandas, Numpy, Matplotlib, Scipy, 
PySpark) 
 No SQL(Mongo DB) 
Languages 
PostgreSQL 
SQL 
R 
Python (computer programming) 
Data Visualization 
Power BI 
 PROJECTS 
20/03/2025 – 25/03/2025 
Twitter Sentiment Analysis Using CNN & Machine Learning Models 
Developed a sentiment analysis model to classify tweets as positive or negative using NLP techniques. The project 
involved data preprocessing, tokenization, stopword removal, lemmatization, and TF-IDF vectorization. Experimented with 
Bernoulli Naïve Bayes, SVM, and Logistic Regression, selecting the best-performing model. Additionally, implemented a 
CNN-based deep learning model to improve classification accuracy. The final system provides real-time insights into public 
sentiment on social media platforms. 
 
Farhan Ali Awan 
 
 Pakistani 
 Male 
 
 
 
 
 
Website:  https://pk.linkedin.com/in/farhan-ali-0b4282259 
Address: House No 39 St 93, 44220, Islamabad, Pakistan (Home) 
102

AI enrichment

Farhan Ali Awan is a Data Science undergraduate with internship experience in data engineering, specifically focusing on ETL pipelines and PostgreSQL optimization. He possesses practical skills in Python, SQL, and machine learning, demonstrated through academic projects involving sentiment analysis and deep learning models.
Skills (AI)
["Python", "SQL", "PostgreSQL", "PySpark", "Machine Learning", "ETL Pipelines", "Power BI", "R", "MongoDB", "Data Visualization", "NLP", "Deep Learning", "Pandas", "NumPy"]
Status: ai_done
Provenance
Source file: Graduate Directory CS Department Fall 2025.pdf
From job #242 page 102
Created: 1778157228