Available for new projects

Hello, I'm Hasnain Khan

Machine Learning Engineer

From fine-tuning LLMs at Google to building autonomous perception systems and leading ML at scale — I architect intelligent systems that ship. 8+ years turning research papers into production-grade AI.

0+ Years Experience

0 Companies

0+ ML Systems Shipped

View My Work Download CV Let's Talk

Scroll to explore

// About Me

Turning Data Into Intelligence

8+ Years of
Experience

I'm Muhammad Hasnain Khan — a Lead Machine Learning Engineer who builds AI systems that work in the real world, not just in notebooks.

With a Master's in Computer Science (specialization in Deep Learning) from FAST Nuces and a Bachelor's from the University of Bradford, I've spent 8+ years shipping ML systems across Google, FrontNow, COMPREDICT, and ikeGPS. My work spans LLMs, computer vision, autonomous vehicles, NLP, and multi-sensor fusion.

I've trained large language models at Google for visual storytelling, built RAG-powered conversational AI platforms boosting engagement by 50%, and deployed real-time perception systems on autonomous vehicles with sub-50ms latency. I'm equally comfortable fine-tuning a Qwen 32B model as I am wiring up Kafka streams for real-time inference.

End-to-End ML

From research prototypes to production-grade systems at scale

LLMs & Multimodal AI

Fine-tuning, RAG pipelines, and vision-language models

Autonomous Systems

Sensor fusion, perception, and real-time edge deployment

// Career Path

Professional Journey

Sep 2023 — Present

Lead Machine Learning Engineer

FrontNow

Leading development of fine-tuned LLMs for a multilingual conversational AI platform. Engineered a hybrid RAG pipeline with FAISS and knowledge graphs, boosting engagement by 50%. Built streaming inference with RabbitMQ & Apache Flink and established MLOps with A/B testing, cutting inference latency by 30%.

LLMsRAGFAISSApache FlinkMLOps

Sep 2022 — Mar 2023

Machine Learning Engineer III

COMPREDICT

Built virtual sensor systems for automotive diagnostics using CAN bus and OBD-II data. Predicted and diagnosed vehicle faults to reduce maintenance costs and increase reliability. Integrated ML-powered virtual sensors with cross-functional automotive platforms.

Automotive MLVirtual SensorsCAN BusPredictive Maintenance

Jul 2021 — Aug 2022

Machine Learning Engineer II

Google

Trained large language models for semantic extraction from natural language for visual storytelling. Devised transformer-based pipelines aggregating multilingual corpora into latent scene-embeddings for 3D GAN-driven visual storytelling, reducing annotation overhead by 30%.

LLMsTransformers3D GANsNLPVisual Storytelling

Sep 2019 — Dec 2021

Machine Learning Engineer

ikeGPS

Built Visual Attention-based models in PyTorch for novel Webpage Object Detection. Utilized contextual features from ordered web elements via ResNet101. Achieved 95% accuracy for product price detection — 8.5% above Fast R-CNN baseline.

PyTorchComputer VisionAttention ModelsResNet

Jun 2018 — Jul 2019

Software Engineer (AR)

Ozi Technology (Gamerz Studio)

Designed architecture and implemented core features for AR applications. Transformed design specifications into functional apps and established development pipelines and strategy.

Unity3DC#ARSoftware Architecture

// Selected Work

Featured Projects

Research-driven projects spanning LLMs, autonomous systems, computer vision, and multimodal AI

Real-Time Multi-Sensor Fusion for Autonomous Perception

Engineered a unified perception pipeline fusing camera, LiDAR, and radar via a cross-modal BEV-centric architecture with attention-based alignment and 3D detection heads. Deployed on an autonomous vehicle testbed with model quantization for sub-50ms latency.

+17% Object Recall

<50ms Latency

BEV FusionLiDARAttentionQuantizationEmbedded GPU

View Details

Instruction-Tuned Multimodal LLM for Scene Understanding

Architected a Multimodal LLM integrating a Vision Transformer with a decoder-only LLM. Implemented a projection module to align visual features with word embeddings, enabling conversational VQA, referring expression generation, and multimodal grounding.

ViT + LLM Architecture

Zero-shot VQA Capable

Vision TransformerInstruction TuningVQAGrounding

View Details

Knowledge-Augmented Reasoning Engine via Fine-Tuned LLM

Built a novel framework enhancing factual reasoning by fine-tuning a decoder-only LLM with a domain-specific Knowledge Graph. Used PEFT with a RAG pipeline injecting relevant sub-graphs and Chain-of-Thought prompting, reducing hallucination significantly.

PEFT + RAG Methodology

CoT Multi-hop QA

PEFTRAGKnowledge GraphChain-of-Thought

View Details

Enhancing Math Reasoning in LLMs via Self-Supervised Fine-Tuning

Fine-tuned Qwen 2.5-32B on 1,000 math problems using a novel "Wait" token technique to extend reasoning. Self-supervised CoT generation for iterative training. Achieved 56.7% on AIME 2024. Deployed on Google Cloud Run for real-time inference.

56.7% AIME 2024

32B Parameters

Qwen 2.5Self-SupervisedCoTCloud Run

View Details

Multimodal Emotion Recognition for Human-Robot Interaction

Designed a multimodal emotion recognition system combining computer vision, speech processing, and NLP. Used CNNs, LSTMs, and attention mechanisms to capture temporal and spatial dynamics of human emotions and integrated with a robotic platform.

3 Modalities

Real-time Inference

CNNsLSTMsAttentionSpeechRobotics

View Details

Audio-Visual Fusion for Dynamic Pedestrian Awareness

Built a self-supervised audio-visual fusion system using footstep sounds and camera imaging for real-time pedestrian detection. Attention-based multimodal network achieves LiDAR-comparable performance at lower cost. Deployed on Jetson Orin Nano.

LiDAR-level Performance

Edge Deployed

Self-SupervisedAudio-VisualJetson OrinQuantized

View Details

Weather-Resilient Multi-Sensor Perception for Autonomous Driving

Developed a condition-aware BEV perception stack that dynamically re-weights camera, LiDAR, radar, and map priors across rain, fog, and nighttime scenes. Optimized for edge deployment with consistent low-latency inference under dense traffic.

+24% Recall Gain

<55ms Latency

BEV FusionAdverse WeatherRadarTensorRT

View Details

Document Intelligence Copilot for Enterprise Knowledge Work

Built a grounded enterprise QA copilot over PDFs, SOPs, and internal docs using OCR-aware chunking, retrieval, and citation-backed generation. Improved answer reliability while keeping response time fast enough for daily analyst workflows.

+29% Answer Accuracy

<1.5s Median Response

Document AIRAGOCRCitations

View Details

Agent Reliability Lab for Tool-Using LLM Systems

Built a creative eval and guardrail lab for agentic workflows with scenario stress tests, trace-based judges, and release gates. Visual diagnostics highlight failure concentration, drift, cost-quality tradeoffs, and deployment risk in one place.

+33% Task Success

-58% Critical Failures

Agent EvalsGuardrailsObservabilityTool Use

View Details

Neural City Digital Twin for Traffic Forecasting and Planning

Developed a city-scale digital twin that fuses traffic streams, graph topology, and event context to forecast congestion and simulate policy interventions. Includes rich visual mapping and frontier plots for practical planning decisions.

-27% Peak Congestion

4.3x Planning Throughput

GNNSimulationTime SeriesUrban AI

View Details

Advanced Multimodal RAG with End-to-End Evaluation Framework

Built an advanced RAG platform over text, tables, charts, and images with modality-aware retrieval orchestration, citation validation, and hard regression gates. Includes deeply instrumented evaluation for correctness, faithfulness, robustness, latency, and cost.

+37% Correctness Lift

91% Citation Faithfulness

Multimodal RAGRerankingEvalOpsGrounded QA

View Details

// Tech Stack

Skills & Expertise

Core ML & AI

Deep Learning

LLMs / Fine-Tuning

Computer Vision

NLP / Transformers

RAG / Knowledge Graphs

Autonomous Vehicles

Frameworks & Libraries

PyTorch

TensorFlow / Keras

Scikit-learn

LangChain / LlamaIndex

OpenCV / CoreML

PySpark / Hadoop

MLOps & Cloud

Docker / Kubernetes

AWS / GCP

CI/CD / Jenkins

Databricks

RabbitMQ / Kafka

FastAPI / GraphQL

Programming & Data

Python

C++ / C

SQL / NoSQL

C# / Unity3D

Java

Git / Pytest

// Education

Academic Background

2019 — 2021

Master's in Computer Science

FAST NUCES, Islamabad

Specialization in Deep Learning

Computer VisionGANsNLPAutonomous VehiclesInformation Retrieval

Teaching: NLP, Neural Networks, Discrete Structures

2015 — 2018

Bachelor's in Computer Science

University of Bradford, UK

Foundation in computer science, software engineering, and mathematics

// Client Words

What Clients Say

"Hasnain's RAG pipeline and semantic search engine completely transformed our conversational platform. User engagement jumped 50%. He understands both the research and the engineering side — a rare combination."

FrontNow Team AI Platform, Germany

"Hasnain built our Visual Attention model from scratch and beat the Fast R-CNN baseline by 8.5%. His PyTorch expertise and ability to deliver production-ready ML systems made all the difference."

ikeGPS Team Computer Vision, New Zealand

"His work on virtual sensor systems and predictive diagnostics helped us cut maintenance costs significantly. Hasnain bridges the gap between automotive hardware and machine learning seamlessly."

COMPREDICT Team Automotive AI, Germany

Hello, I'm Hasnain Khan

Turning Data Into Intelligence

End-to-End ML

LLMs & Multimodal AI

Autonomous Systems

Professional Journey

Lead Machine Learning Engineer

Machine Learning Engineer III

Machine Learning Engineer II

Machine Learning Engineer

Software Engineer (AR)

Featured Projects

Real-Time Multi-Sensor Fusion for Autonomous Perception

Instruction-Tuned Multimodal LLM for Scene Understanding

Knowledge-Augmented Reasoning Engine via Fine-Tuned LLM

Enhancing Math Reasoning in LLMs via Self-Supervised Fine-Tuning

Multimodal Emotion Recognition for Human-Robot Interaction

Audio-Visual Fusion for Dynamic Pedestrian Awareness

Weather-Resilient Multi-Sensor Perception for Autonomous Driving

Document Intelligence Copilot for Enterprise Knowledge Work

Agent Reliability Lab for Tool-Using LLM Systems

Neural City Digital Twin for Traffic Forecasting and Planning

Advanced Multimodal RAG with End-to-End Evaluation Framework

Skills & Expertise

Core ML & AI

Frameworks & Libraries

MLOps & Cloud

Programming & Data

Academic Background

Master's in Computer Science

Bachelor's in Computer Science

What Clients Say

Let's Build Something Amazing

Email

Phone

Location