Abdul Hameed Azeemi
I am a machine learning researcher and an AI engineer. My research interests span data-efficient machine learning, LLM agents, speech processing, and low-resource languages.
I previously completed my PhD in machine learning at LUMS, advised by Dr. Agha Ali Raza and co-advised by Dr. Ihsan Ayyub Qazi.
I am also the founding member of ExactBuyer, where I led the development of an AI-powered GTM platform. Prior to this, I was a tech lead at Educative where I worked on payments, content discovery, and B2B integrations.
News
2026-01
Defended my PhD dissertation
Successfully defended my PhD dissertation on 'Enabling efficient speech and language processing using data selection and pruning'. Thanks to my advisor, Dr. Agha Ali Raza, co-advisor, Dr. Ihsan Ayyub Qazi, and committee members from CMU, ITU, and LUMS for their invaluable support and feedback. Especially grateful to my lifelong mentor, Khawaja Shamsuddin Azeemi, whose continuous guidance has been truly transformative and has made this journey possible.
2025-08
Talk: Deconstructing Coding Agents
Delivered a talk on the architecture and capabilities of modern coding agents to C-suite executives and engineering leadership in Lahore.
2025-07
Released dPrune: A Framework for Data Pruning
dPrune is a Python library designed to make data selection and pruning simple and accessible for NLP and speech tasks.
2024-05
LLMs in Development Workflows - PyCon 2024
Presented at PyCon 2024 (PK) on integrating Large Language Models into software development workflows.
Publications
EACL 2026 (Findings)
Language Model-Driven Data Pruning Enables Efficient Active Learning
Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza
ACL 2025 Workshop on GEM
The fellowship of the llms: Multi-agent workflows for synthetic preference optimization dataset generation
Samee Arif, Sualeha Farid, Abdul Hameed Azeemi, Awais Athar, Agha Ali Raza
COLING 2025
To Label or Not to Label: Hybrid Active Learning for Neural Machine Translation
Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza
EMNLP 2024 (Findings)
Generalists vs. Specialists: Evaluating Large Language Models for Urdu
Samee Arif, Abdul Hameed Azeemi, Agha Ali Raza, Awais Athar
ACL 2024 Workshop on Low-Resource Machine Translation
Challenges in Urdu Machine Translation
Abdul Basit, Abdul Hameed Azeemi, Agha Ali Raza
NeurIPS 2023 Workshop on Efficient Speech and Natural Language Processing
Representative Subset Selection for Efficient Fine-Tuning in Self-Supervised Speech Recognition
Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza
EMNLP 2023 (Findings)
Data Pruning for Efficient Model Pruning in Neural Machine Translation
Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza
INTERSPEECH 2023
Self-Supervised Dataset Pruning for Efficient Training in Audio Anti-spoofing
Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza
INTERSPEECH 2022
Dataset Pruning for Resource-constrained Spoofed Audio Detection
Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza