← Back to cohort

Ahmed Bhan

NUST · 2026

ahmedbhan141@gmail.com

Phone

03701206236

https://www.linkedin.com/in/ahmed-bhan-83267b1b8/

GitHub

—

Academic

Program

BEE

CGPA

3.38

Year

2026

Education

SEECS

Address

HOUSE NO:3 NEAR EAST RAILWAY CABIN LATIF TOWN TANDOJAM , Tandojam , Pakistan

DOB

—

Career

Current role

—

Target role

—

Skills

DevOps, AI Systems, MLOps, LLM agents, Large Action Models, Kubernetes, AWS, Reinforcement Learning, GenAI, CI/CD, Computer Vision, Deep Learning, Full-stack Engineering, NVIDIA Triton Inference Server, TensorRT, Docker, PyTorch, TensorFlow, ONNX, Hugging Face, LangChain, LangGraph, LlamaIndex, RAG, ChromaDB, MobileNet, FaceNet, rPPG, EKS, EC2, Lambda, S3, CloudFront, GitHub Actions, Microservices

Verbatim text

The exact text the LLM saw on the page (or the booklet text from the old import). This is what powers semantic search.

Ahmed Bhan
Cell: 03701206236 | Email: ahmedbhan141@gmail.com
LinkedIn: https://www.linkedin.com/in/ahmed-bhan-83267b1b8/
Address: HOUSE NO:3 NEAR EAST RAILWAY CABIN LATIF TOWN TANDOJAM , Tandojam , Pakistan
PROFESSIONAL PROFILE
DevOps and AI Systems Engineer building the future of intelligent infrastructure. Architect production MLOps pipelines, autonomous
LLM agents, and Large Action Models that bridge human intent with system execution. Expert in cloud-native orchestration
(Kubernetes, AWS), reinforcement learning for adaptive systems, advanced GenAI frameworks, and CI/CD automation - synthesizing
computer vision, deep learning, and full-stack engineering to create self-evolving infrastructure that learns, adapts, and scales.
EDUCATION
Bachelor of Electrical Engineering
SEECS , Islamabad , 3.38 (2026)
INTERNSHIP EXPERIENCE
3dim engineering solutions
17-Feb-2025 - 30-Jun-2025
Architected and deployed NVIDIA Triton Inference Server on Kubernetes clusters with auto-scaling policies, achieving 1000x latency
reduction through TensorRT optimization (INT8/FP16 quantization) Optimized LeYOLO for small-object detection on drone datasets,
deploying containerized workloads for edge devices with multi-stage Docker builds reducing image sizes by 60%
FINAL YEAR PROJECT
Baymax: Your Personal Healthcare Assistant
Architected multi-modal AI infrastructure on NVIDIA Jetson Nano, orchestrating Med-Gemma-4B via LangChain for context-aware
health summaries by fusing vision, language, and sensor data streams. Deployed MobileNet and FaceNet with TensorRT
optimization for real-time activity recognition and identity veriﬁcation, achieving less than 100ms inference latency through GPU
resource management. Implemented RAG pipeline using ChromaDB for patient history context with "Privacy-by-Design" ephemeral
data processing and secure credential handling. Built containerized rPPG pipeline for contactless vital sign extraction.
TECHNICAL EXPERTISE
Cloud Infrastructure & DevOps Engineering
Expert in AWS cloud services (EKS, EC2, Lambda, S3, CloudFront) and container orchestration using Kubernetes and Docker.
Proﬁcient in building CI/CD pipelines with GitHub Actions, and managing production deployments with zero-downtime rolling updates.
Experienced in microservices architecture.
MLOps & AI Infrastructure
Specialized in production ML deployment pipelines using NVIDIA Triton Inference Server, TensorRT optimization, and model
quantization (INT8/FP16). Proﬁcient in PyTorch, TensorFlow, ONNX, and Hugging Face frameworks. Expert in deploying ML models
on edge devices and cloud infrastructure with auto-scaling poli ...
Generative AI & Large Language Models
Advanced expertise in LLM orchestration using LangChain, LangGraph, and LlamaIndex for building autonomous agents and RAG
systems. Experienced with GPT, Llama, Gemini, Grok, and Gemma models. Skilled in developing Large Action Models (LAMs) for
complex automation, multi-agent systems, and context-aware AI app ...
Computer Vision & Deep Learning

AI enrichment

Ahmed Bhan is a Bachelor of Electrical Engineering student graduating in 2026 with a focus on DevOps and AI Systems Engineering. He has internship experience deploying optimized AI models on Kubernetes and edge devices, alongside a final year project involving multimodal healthcare AI infrastructure.

Skills (AI)

["Kubernetes", "AWS", "Docker", "NVIDIA Triton", "TensorRT", "LangChain", "PyTorch", "Computer Vision", "MLOps", "CI/CD", "Python", "RAG", "LLMs"]

Status: ai_done

Provenance

Source file: SEECS - Electrical Engineering-2026.pdf
From job #259 page 62
Created: 1778168427