Available for new projects

Hello, I'm Hasnain Khan

Machine Learning Engineer

From fine-tuning LLMs at Google to building autonomous perception systems and leading ML at scale — I architect intelligent systems that ship. 8+ years turning research papers into production-grade AI.

0+ Years Experience
0 Companies
0+ ML Systems Shipped
Scroll to explore

Turning Data Into Intelligence

8+ Years of
Experience

I'm Muhammad Hasnain Khan — a Lead Machine Learning Engineer who builds AI systems that work in the real world, not just in notebooks.

With a Master's in Computer Science (specialization in Deep Learning) from FAST Nuces and a Bachelor's from the University of Bradford, I've spent 8+ years shipping ML systems across Google, FrontNow, COMPREDICT, and ikeGPS. My work spans LLMs, computer vision, autonomous vehicles, NLP, and multi-sensor fusion.

I've trained large language models at Google for visual storytelling, built RAG-powered conversational AI platforms boosting engagement by 50%, and deployed real-time perception systems on autonomous vehicles with sub-50ms latency. I'm equally comfortable fine-tuning a Qwen 32B model as I am wiring up Kafka streams for real-time inference.

End-to-End ML

From research prototypes to production-grade systems at scale

LLMs & Multimodal AI

Fine-tuning, RAG pipelines, and vision-language models

Autonomous Systems

Sensor fusion, perception, and real-time edge deployment

Professional Journey

Sep 2023 — Present

Lead Machine Learning Engineer

FrontNow

Leading development of fine-tuned LLMs for a multilingual conversational AI platform. Engineered a hybrid RAG pipeline with FAISS and knowledge graphs, boosting engagement by 50%. Built streaming inference with RabbitMQ & Apache Flink and established MLOps with A/B testing, cutting inference latency by 30%.

LLMsRAGFAISSApache FlinkMLOps
Sep 2022 — Mar 2023

Machine Learning Engineer III

COMPREDICT

Built virtual sensor systems for automotive diagnostics using CAN bus and OBD-II data. Predicted and diagnosed vehicle faults to reduce maintenance costs and increase reliability. Integrated ML-powered virtual sensors with cross-functional automotive platforms.

Automotive MLVirtual SensorsCAN BusPredictive Maintenance
Jul 2021 — Aug 2022

Machine Learning Engineer II

Google

Trained large language models for semantic extraction from natural language for visual storytelling. Devised transformer-based pipelines aggregating multilingual corpora into latent scene-embeddings for 3D GAN-driven visual storytelling, reducing annotation overhead by 30%.

LLMsTransformers3D GANsNLPVisual Storytelling
Sep 2019 — Dec 2021

Machine Learning Engineer

ikeGPS

Built Visual Attention-based models in PyTorch for novel Webpage Object Detection. Utilized contextual features from ordered web elements via ResNet101. Achieved 95% accuracy for product price detection — 8.5% above Fast R-CNN baseline.

PyTorchComputer VisionAttention ModelsResNet
Jun 2018 — Jul 2019

Software Engineer (AR)

Ozi Technology (Gamerz Studio)

Designed architecture and implemented core features for AR applications. Transformed design specifications into functional apps and established development pipelines and strategy.

Unity3DC#ARSoftware Architecture

Featured Projects

Research-driven projects spanning LLMs, autonomous systems, computer vision, and multimodal AI

Real-Time Multi-Sensor Fusion for Autonomous Perception

Engineered a unified perception pipeline fusing camera, LiDAR, and radar via a cross-modal BEV-centric architecture with attention-based alignment and 3D detection heads. Deployed on an autonomous vehicle testbed with model quantization for sub-50ms latency.

+17% Object Recall
<50ms Latency
BEV FusionLiDARAttentionQuantizationEmbedded GPU
View Details

Instruction-Tuned Multimodal LLM for Scene Understanding

Architected a Multimodal LLM integrating a Vision Transformer with a decoder-only LLM. Implemented a projection module to align visual features with word embeddings, enabling conversational VQA, referring expression generation, and multimodal grounding.

ViT + LLM Architecture
Zero-shot VQA Capable
Vision TransformerInstruction TuningVQAGrounding
View Details

Knowledge-Augmented Reasoning Engine via Fine-Tuned LLM

Built a novel framework enhancing factual reasoning by fine-tuning a decoder-only LLM with a domain-specific Knowledge Graph. Used PEFT with a RAG pipeline injecting relevant sub-graphs and Chain-of-Thought prompting, reducing hallucination significantly.

PEFT + RAG Methodology
CoT Multi-hop QA
PEFTRAGKnowledge GraphChain-of-Thought
View Details

Enhancing Math Reasoning in LLMs via Self-Supervised Fine-Tuning

Fine-tuned Qwen 2.5-32B on 1,000 math problems using a novel "Wait" token technique to extend reasoning. Self-supervised CoT generation for iterative training. Achieved 56.7% on AIME 2024. Deployed on Google Cloud Run for real-time inference.

56.7% AIME 2024
32B Parameters
Qwen 2.5Self-SupervisedCoTCloud Run
View Details

Multimodal Emotion Recognition for Human-Robot Interaction

Designed a multimodal emotion recognition system combining computer vision, speech processing, and NLP. Used CNNs, LSTMs, and attention mechanisms to capture temporal and spatial dynamics of human emotions and integrated with a robotic platform.

3 Modalities
Real-time Inference
CNNsLSTMsAttentionSpeechRobotics
View Details

Audio-Visual Fusion for Dynamic Pedestrian Awareness

Built a self-supervised audio-visual fusion system using footstep sounds and camera imaging for real-time pedestrian detection. Attention-based multimodal network achieves LiDAR-comparable performance at lower cost. Deployed on Jetson Orin Nano.

LiDAR-level Performance
Edge Deployed
Self-SupervisedAudio-VisualJetson OrinQuantized
View Details

Weather-Resilient Multi-Sensor Perception for Autonomous Driving

Developed a condition-aware BEV perception stack that dynamically re-weights camera, LiDAR, radar, and map priors across rain, fog, and nighttime scenes. Optimized for edge deployment with consistent low-latency inference under dense traffic.

+24% Recall Gain
<55ms Latency
BEV FusionAdverse WeatherRadarTensorRT
View Details

Document Intelligence Copilot for Enterprise Knowledge Work

Built a grounded enterprise QA copilot over PDFs, SOPs, and internal docs using OCR-aware chunking, retrieval, and citation-backed generation. Improved answer reliability while keeping response time fast enough for daily analyst workflows.

+29% Answer Accuracy
<1.5s Median Response
Document AIRAGOCRCitations
View Details

Agent Reliability Lab for Tool-Using LLM Systems

Built a creative eval and guardrail lab for agentic workflows with scenario stress tests, trace-based judges, and release gates. Visual diagnostics highlight failure concentration, drift, cost-quality tradeoffs, and deployment risk in one place.

+33% Task Success
-58% Critical Failures
Agent EvalsGuardrailsObservabilityTool Use
View Details

Neural City Digital Twin for Traffic Forecasting and Planning

Developed a city-scale digital twin that fuses traffic streams, graph topology, and event context to forecast congestion and simulate policy interventions. Includes rich visual mapping and frontier plots for practical planning decisions.

-27% Peak Congestion
4.3x Planning Throughput
GNNSimulationTime SeriesUrban AI
View Details

Advanced Multimodal RAG with End-to-End Evaluation Framework

Built an advanced RAG platform over text, tables, charts, and images with modality-aware retrieval orchestration, citation validation, and hard regression gates. Includes deeply instrumented evaluation for correctness, faithfulness, robustness, latency, and cost.

+37% Correctness Lift
91% Citation Faithfulness
Multimodal RAGRerankingEvalOpsGrounded QA
View Details

Skills & Expertise

Core ML & AI

Deep Learning
LLMs / Fine-Tuning
Computer Vision
NLP / Transformers
RAG / Knowledge Graphs
Autonomous Vehicles

Frameworks & Libraries

PyTorch
TensorFlow / Keras
Scikit-learn
LangChain / LlamaIndex
OpenCV / CoreML
PySpark / Hadoop

MLOps & Cloud

Docker / Kubernetes
AWS / GCP
CI/CD / Jenkins
Databricks
RabbitMQ / Kafka
FastAPI / GraphQL

Programming & Data

Python
C++ / C
SQL / NoSQL
C# / Unity3D
Java
Git / Pytest

Academic Background

2019 — 2021

Master's in Computer Science

FAST NUCES, Islamabad

Specialization in Deep Learning

Computer VisionGANsNLPAutonomous VehiclesInformation Retrieval

Teaching: NLP, Neural Networks, Discrete Structures

2015 — 2018

Bachelor's in Computer Science

University of Bradford, UK

Foundation in computer science, software engineering, and mathematics

What Clients Say

"Hasnain's RAG pipeline and semantic search engine completely transformed our conversational platform. User engagement jumped 50%. He understands both the research and the engineering side — a rare combination."

FN
FrontNow Team AI Platform, Germany

"Hasnain built our Visual Attention model from scratch and beat the Fast R-CNN baseline by 8.5%. His PyTorch expertise and ability to deliver production-ready ML systems made all the difference."

iK
ikeGPS Team Computer Vision, New Zealand

"His work on virtual sensor systems and predictive diagnostics helped us cut maintenance costs significantly. Hasnain bridges the gap between automotive hardware and machine learning seamlessly."

CP
COMPREDICT Team Automotive AI, Germany

Let's Build Something Amazing

Have a project in mind? I'd love to hear about it. Let's discuss how we can work together.

Location

Germany — Available Worldwide