Stockholm university

Mahbub Ul Alam

About me

My passion lies in the power of generative AI, deep learning, and machine learning to transform healthcare and language technology.

My journey began with computational linguistics at the University of Stuttgart, Germany, and led to deep dives into AI and data science at Stockholm University. My doctoral work is dedicated to pushing the boundaries of clinical decision support through machine learning and the Internet of Medical Things (IoMT), with a sharp focus on improving the detection of COVID-19 and early sepsis.

The path I've traveled blends my academic background in computational linguistics with practical software engineering skills honed at Samsung Research. This blend has enabled me to contribute effectively to healthcare AI, as acknowledged by the best paper awards I'm honored to have received. I find the convergence of natural language processing (NLP) and healthcare AI especially compelling.

Looking ahead, my vision is to utilize large language models (LLMs), Retrieval Augmented Generation (RAG), LangChain, Quantization, and matryoshka embedding models to create scalable, remote, and cost-effective systems. These systems could be non-invasive and align with various language technologies and a patient-centric approach to healthcare.

Described by my peers as friendly, supportive, and humble, I try my best to bring these traits to every team I'm part of. Driven by a desire to explore the frontiers of generative AI, NLP, and healthcare AI, I am committed to developing intelligent, compassionate, and practical solutions in these dynamic fields.

If you're interested in collaboration, discussion, or providing guidance, I'm open to connecting. Please reach out on LinkedIn (https://www.linkedin.com/in/anondo) or drop me an email at mahbub.ul.alam.anondo@gmail.com.

Together, we can harness AI's vast potential to create a healthier world for all.

Thank you for taking the time to read about me!

Let all of us prosper together!

-------------------------------------------------------------------------------------------------------------------------

You can read my doctoral thesis clicking here.

 

Doctoral Thesis Title:

Advancing Clinical Decision Support Using Machine Learning & the Internet of Medical Things: Enhancing COVID-19 & Early Sepsis Detection

Abstract:

This thesis presents a critical examination of the positive impact of Machine Learning (ML) and the Internet of Medical Things (IoMT) for advancing the Clinical Decision Support System (CDSS) in the context of COVID-19 and early sepsis detection.

It emphasizes the transition towards patient-centric healthcare systems, which necessitate personalized and participatory care—a transition that could be facilitated by these emerging fields. The thesis accentuates how IoMT could serve as a robust platform for data aggregation, analysis, and transmission, which could empower healthcare providers to deliver more effective care. The COVID-19 pandemic has particularly stressed the importance of such patient-centric systems for remote patient monitoring and disease management.

The integration of ML-driven CDSSs with IoMT is viewed as an extremely important step in healthcare systems that could offer real-time decision-making support and enhance patient health outcomes. The thesis investigates ML's capability to analyze complex medical datasets, identify patterns and correlations, and adapt to changing conditions, thereby enhancing its predictive capabilities. It specifically focuses on the development of IoMT-based CDSSs for COVID-19 and early sepsis detection, using advanced ML methods and medical data.

Key issues addressed cover data annotation scarcity, data sparsity, and data heterogeneity, along with the aspects of security, privacy, and accessibility. The thesis also intends to enhance the interpretability of ML prediction model-based CDSSs. Ethical considerations are prioritized to ensure adherence to the highest standards.

The thesis demonstrates the potential and efficacy of combining ML with IoMT to enhance CDSSs by emphasizing the importance of model interpretability, system compatibility, and the integration of multimodal medical data for an effective CDSS.

Overall, this thesis makes a significant contribution to the fields of ML and IoMT in healthcare, featuring their combined potential to enhance CDSSs, particularly in the areas of COVID-19 and early sepsis detection.

The thesis hopes to enhance understanding among medical stakeholders and acknowledges the need for continuous development in this sector.

Keywords:

Internet of Medical Things, Patient-Centric Healthcare, Clinical Decision Support System, Predictive Modeling in Healthcare, Health Informatics, Healthcare analytics, COVID-19, Sepsis, COVID-19 Detection, Early Sepsis Detection, Lung Segmentation Detection, Medical Data Annotation Scarcity, Medical Data Sparsity, Medical Data Heterogeneity, Medical Data Security & Privacy, Practical Usability Enhancement, Low-End Device Adaptability, Medical Significance, Interpretability, Visualization, LIME, SHAP, Grad-CAM, LRP, Electronic Health Records, Thermal Image, Tabular Medical Data, Chest X-ray, Machine Learning, Deep Learning, Federated Learning, Semi-Supervised Machine Learning, Multi-Task Learning, Transfer Learning, Multi-Modality, Natural Language Processing, ClinicalBERT, GAN

Teaching

Research

My vision is to utilize large language models (LLMs), Retrieval Augmented Generation (RAG), LangChain, Quantization, and matryoshka embedding models to create scalable, remote, and cost-effective systems. These systems could be non-invasive and align with various language technologies and a patient-centric approach to healthcare.

Research projects

Publications

A selection from Stockholm University publication database

  • Advancing Clinical Decision Support Using Machine Learning & the Internet of Medical Things: Enhancing COVID-19 & Early Sepsis Detection

    2024. Mahbub Ul Alam.

    Thesis (Doc)

    This thesis presents a critical examination of the positive impact of Machine Learning (ML) and the Internet of Medical Things (IoMT) for advancing the Clinical Decision Support System (CDSS) in the context of COVID-19 and early sepsis detection.

    It emphasizes the transition towards patient-centric healthcare systems, which necessitate personalized and participatory care—a transition that could be facilitated by these emerging fields. The thesis accentuates how IoMT could serve as a robust platform for data aggregation, analysis, and transmission, which could empower healthcare providers to deliver more effective care. The COVID-19 pandemic has particularly stressed the importance of such patient-centric systems for remote patient monitoring and disease management.

    The integration of ML-driven CDSSs with IoMT is viewed as an extremely important step in healthcare systems that could offer real-time decision-making support and enhance patient health outcomes. The thesis investigates ML's capability to analyze complex medical datasets, identify patterns and correlations, and adapt to changing conditions, thereby enhancing its predictive capabilities. It specifically focuses on the development of IoMT-based CDSSs for COVID-19 and early sepsis detection, using advanced ML methods and medical data.

    Key issues addressed cover data annotation scarcity, data sparsity, and data heterogeneity, along with the aspects of security, privacy, and accessibility. The thesis also intends to enhance the interpretability of ML prediction model-based CDSSs. Ethical considerations are prioritized to ensure adherence to the highest standards.

    The thesis demonstrates the potential and efficacy of combining ML with IoMT to enhance CDSSs by emphasizing the importance of model interpretability, system compatibility, and the integration of multimodal medical data for an effective CDSS.

    Overall, this thesis makes a significant contribution to the fields of ML and IoMT in healthcare, featuring their combined potential to enhance CDSSs, particularly in the areas of COVID-19 and early sepsis detection.

    The thesis hopes to enhance understanding among medical stakeholders and acknowledges the need for continuous development in this sector.

    Read more about Advancing Clinical Decision Support Using Machine Learning & the Internet of Medical Things
  • SHAMSUL: Systematic Holistic Analysis to investigate Medical Significance Utilizing Local interpretability methods in deep learning for chest radiography pathology prediction

    2023. Mahbub Ul Alam (et al.). Nordic Machine Intelligence 3, 27-47

    Article

    The interpretability of deep neural networks has become a subject of great interest within the medical and healthcare domain. This attention stems from concerns regarding transparency, legal and ethical considerations, and the medical significance of predictions generated by these deep neural networks in clinical decision support systems. To address this matter, our study delves into the application of four well-established interpretability methods: Local Interpretable Model-agnostic Explanations (LIME), Shapley Additive exPlanations (SHAP), Gradient-weighted Class Activation Mapping (Grad-CAM), and Layer-wise Relevance Propagation (LRP). Leveraging the approach of transfer learning with a multi-label-multi-class chest radiography dataset, we aim to interpret predictions pertaining to specific pathology classes. Our analysis encompasses both single-label and multi-label predictions, providing a comprehensive and unbiased assessment through quantitative and qualitative investigations, which are compared against human expert annotation. Notably, Grad-CAM demonstrates the most favorable performance in quantitative evaluation, while the LIME heatmap score segmentation visualization exhibits the highest level of medical significance. Our research underscores both the outcomes and the challenges faced in the holistic approach adopted for assessing these interpretability methods and suggests that a multimodal-based approach, incorporating diverse sources of information beyond chest radiography images, could offer additional insights for enhancing interpretability in the medical domain.

    Read more about SHAMSUL: Systematic Holistic Analysis to investigate Medical Significance Utilizing Local interpretability methods in deep learning for chest radiography pathology prediction
  • COVID-19 detection from thermal image and tabular medical data utilizing multi-modal machine learning

    2023. Mahbub Ul Alam, Jaakko Hollmén, Rahim Rahmani Chianeh. 2023 IEEE 36th International Symposium on Computer-Based Medical Systems (CBMS), 646-653

    Conference

    COVID-19 is a viral infectious disease that has created a global pandemic, resulting in millions of deaths and disrupting the world order. Different machine learning and deep learning approaches were considered to detect it utilizing different medical data. Thermal imaging is a promising option for detecting COVID-19 as it is low-cost, non-invasive, and can be maintained remotely. This work explores the COVID-19 detection issue using the thermal image and associated tabular medical data obtained from a publicly available dataset. We incorporate a multi-modal machine learning approach where we investigate the different combinations of medical and data type modalities to get an improved result. We use different machine learning and deep learning methods, namely random forests, Extreme Gradient Boosting (XGBoost), Multilayer Perceptron (MLP), and Convolutional Neural Network (CNN). Overall multi-modal results outperform any single modalities, and it is observed that the thermal image is a crucial factor in achieving it. XGBoost provided the best result with the area under the receiver operating characteristic curve (AUROC) score of 0.91 and the area under the precision-recall curve (AUPRC) score of 0.81. We also report the average of leave-one-positive-instance-out cross- validation evaluation scores. This average score is consistent with the test evaluation score for random forests and XGBoost methods. Our results suggest that utilizing thermal image with associated tabular medical data could be a viable option to detect COVID-19, and it should be explored further to create and test a real-time, secure, private, and remote COVID-19 detection application in the future.

    Read more about COVID-19 detection from thermal image and tabular medical data utilizing multi-modal machine learning
  • FedSepsis: A Federated Multi-Modal Deep Learning-Based Internet of Medical Things Application for Early Detection of Sepsis from Electronic Health Records Using Raspberry Pi and Jetson Nano Devices

    2023. Mahbub Ul Alam, Rahim Rahmani. Sensors 23 (2)

    Article

    The concept of the Internet of Medical Things brings a promising option to utilize various electronic health records stored in different medical devices and servers to create practical but secure clinical decision support systems. To achieve such a system, we need to focus on several aspects, most notably the usability aspect of deploying it using low-end devices. This study introduces one such application, namely FedSepsis, for the early detection of sepsis using electronic health records. We incorporate several cutting-edge deep learning techniques for the prediction and natural-language processing tasks. We also explore the multimodality aspect for the better use of electronic health records. A secure distributed machine learning mechanism is essential to building such a practical internet of medical things application. To address this, we analyze two federated learning techniques. Moreover, we use two different kinds of low-computational edge devices, namely Raspberry Pi and Jetson Nano, to address the challenges of using such a system in a practical setting and report the comparisons. We report several critical system-level information about the devices, namely CPU utilization, disk utilization, process CPU threads in use, process memory in use (non-swap), process memory available (non-swap), system memory utilization, temperature, and network traffic. We publish the prediction results with the evaluation metrics area under the receiver operating characteristic curve, the area under the precision–recall curve, and the earliness to predict sepsis in hours. Our results show that the performance is satisfactory, and with a moderate amount of devices, the federated learning setting results are similar to the single server-centric setting. Multimodality provides the best results compared to any single modality in the input features obtained from the electronic health records. Generative adversarial neural networks provide a clear superiority in handling the sparsity of electronic health records. Multimodality with the generative adversarial neural networks provides the best result: the area under the precision–recall curve is 96.55%, the area under the receiver operating characteristic curve is 99.35%, and earliness is 4.56 h. FedSepsis suggests that incorporating such a concept together with low-end computational devices could be beneficial for all the medical sector stakeholders and should be explored further.

    Read more about FedSepsis: A Federated Multi-Modal Deep Learning-Based Internet of Medical Things Application for Early Detection of Sepsis from Electronic Health Records Using Raspberry Pi and Jetson Nano Devices
  • Exploring LRP and Grad-CAM visualization to interpret multi-label-multi-class pathology prediction using chest radiography

    2022. Mahbub Ul Alam, Jón Rúnar Baldvinsson, Yuxia Wang. 2022 IEEE 35th International Symposium on Computer-Based Medical Systems (CBMS), 258-263

    Conference

    The area of interpretable deep neural networks has received increased attention in recent years due to the need for transparency in various fields, including medicine, healthcare, stock market analysis, compliance with legislation, and law. Layer-wise Relevance Propagation (LRP) and Gradient-weighted Class Activation Mapping (Grad-CAM) are two widely used algorithms to interpret deep neural networks. In this work, we investigated the applicability of these two algorithms in the sensitive application area of interpreting chest radiography images. In order to get a more nuanced and balanced outcome, we use a multi-label classification-based dataset and analyze the model prediction by visualizing the outcome of LRP and Grad-CAM on the chest radiography images. The results show that LRP provides more granular heatmaps than Grad-CAM when applied to the CheXpert dataset classification model. We posit that this is due to the inherent construction difference of these algorithms (LRP is layer-wise accumulation, whereas Grad-CAM focuses primarily on the final sections in the model's architecture). Both can be useful for understanding the classification from a micro or macro level to get a superior and interpretable clinical decision support system.

    Read more about Exploring LRP and Grad-CAM visualization to interpret multi-label-multi-class pathology prediction using chest radiography
  • Federated Semi-Supervised Multi-Task Learning to Detect COVID-19 and Lungs Segmentation Marking Using Chest Radiography Images and Raspberry Pi Devices: An Internet of Medical Things Application

    2021. Mahbub Ul Alam, Rahim Rahmani. Sensors 21 (15)

    Article

    Internet of Medical Things (IoMT) provides an excellent opportunity to investigate better automatic medical decision support tools with the effective integration of various medical equipment and associated data. This study explores two such medical decision-making tasks, namely COVID-19 detection and lung area segmentation detection, using chest radiography images. We also explore different cutting-edge machine learning techniques, such as federated learning, semi-supervised learning, transfer learning, and multi-task learning to explore the issue. To analyze the applicability of computationally less capable edge devices in the IoMT system, we report the results using Raspberry Pi devices as accuracy, precision, recall, Fscore for COVID-19 detection, and average dice score for lung segmentation detection tasks. We also publish the results obtained through server-centric simulation for comparison. The results show that Raspberry Pi-centric devices provide better performance in lung segmentation detection, and server-centric experiments provide better results in COVID-19 detection. We also discuss the IoMT application-centric settings, utilizing medical data and decision support systems, and posit that such a system could benefit all the stakeholders in the IoMT domain.

    Read more about Federated Semi-Supervised Multi-Task Learning to Detect COVID-19 and Lungs Segmentation Marking Using Chest Radiography Images and Raspberry Pi Devices: An Internet of Medical Things Application
  • Cognitive Internet of Medical Things Architecture for Decision Support Tool to Detect Early Sepsis Using Deep Learning

    2021. Mahbub Ul Alam, Rahim Rahmani. Biomedical Engineering Systems and Technologies, 366-384

    Chapter

    The internet of medical things (IoMT) is a relatively new territory for the internet of things (IoT) platforms where we can obtain a significant amount of potential benefits with the combination of cognitive computing. Effective utilization of the healthcare data is the critical factor in achieving such potential, which can be a significant challenge as the medical data is extraordinarily heterogeneous and spread across different devices with different degrees of importance and authority. To address this issue, in this paper, we introduce a cognitive internet of medical things architecture with a use case of early sepsis detection using electronic health records. We discuss the various aspects of IoMT architecture. Based on the discussion, we posit that the proposed architecture could improve the overall performance and usability in the IoMT platforms in particular for different IoMT based services and applications. The use of an RNN-LSTM network for early prediction of sepsis according to Sepsis-3 criteria is evaluated with the empirical investigation using six different time window sizes. The best result is obtained from a model using a four-hour window with the assumption that data is missing-not-at-random. It is observed that when learning from heterogeneous sequences of sparse medical data for early prediction of sepsis, the size of the time window has a considerable impact on predictive performance.

    Read more about Cognitive Internet of Medical Things Architecture for Decision Support Tool to Detect Early Sepsis Using Deep Learning
  • Terminology Expansion with Prototype Embeddings: Extracting Symptoms of Urinary Tract Infection from Clinical Text

    2021. Mahbub Ul Alam (et al.). Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - Volume 5: HEALTHINF, 47-57

    Conference

    Many natural language processing applications rely on the availability of domain-specific terminologies containing synonyms. To that end, semi-automatic methods for extracting additional synonyms of a given concept from corpora are useful, especially in low-resource domains and noisy genres such as clinical text, where nonstandard language use and misspellings are prevalent. In this study, prototype embeddings based on seed words were used to create representations for (i) specific urinary tract infection (UTI) symptoms and (ii) UTI symptoms in general. Four word embedding methods and two phrase detection methods were evaluated using clinical data from Karolinska University Hospital. It is shown that prototype embeddings can effectively capture semantic information related to UTI symptoms. Using prototype embeddings for specific UTI symptoms led to the extraction of more symptom terms compared to using prototype embeddings for UTI symptoms in general. Overall, 142 additional UTI symp tom terms were identified, yielding a more than 100% increment compared to the initial seed set. The mean average precision across all UTI symptoms was 0.51, and as high as 0.86 for one specific UTI symptom. This study provides an effective and cost-effective solution to terminology expansion with small amounts of labeled data.

    Read more about Terminology Expansion with Prototype Embeddings
  • The accuracy of fully automated algorithms for surveillance of healthcare-associated urinary tract infections in hospitalized patients

    2021. Suzanne Desirée van der Werff (et al.). Journal of Hospital Infection 110, 139-147

    Article

    Background: Surveillance for healthcare-associated infections such as healthcareassociated urinary tract infections (HA-UTI) is important for directing resources and evaluating interventions. However, traditional surveillance methods are resourceintensive and subject to bias.

    Aim: To develop and validate a fully automated surveillance algorithm for HA-UTI using electronic health record (EHR) data.

    Methods: Five algorithms were developed using EHR data from 2979 admissions at Karolinska University Hospital from 2010 to 2011: (1) positive urine culture (UCx); (2) positive UCx + UTI codes (International Statistical Classification of Diseases and Related Health Problems, 10th revision); (3) positive UCx + UTI-specific antibiotics; (4) positive UCx + fever and/or UTI symptoms; (5) algorithm 4 with negation for fever without UTI symptoms. Natural language processing (NLP) was used for processing free-text medical notes. The algorithms were validated in 1258 potential UTI episodes from January to March 2012 and results extrapolated to all UTI episodes within this period (N 1/4 16,712). The reference standard for HA-UTIs was manual record review according to the European Centre for Disease Prevention and Control (and US Centers for Disease Control and Prevention) definitions by trained healthcare personnel.

    Findings: Of the 1258 UTI episodes, 163 fulfilled the ECDC HA-UTI definition and the algorithms classified 391, 150, 189, 194, and 153 UTI episodes, respectively, as HA-UTI. Algorithms 1, 2, and 3 had insufficient performances. Algorithm 4 achieved better performance and algorithm 5 performed best for surveillance purposes with sensitivity 0.667 (95% confidence interval: 0.594-0.733), specificity 0.997 (0.996-0.998), positive predictive value 0.719 (0.624-0.807) and negative predictive value 0.997 (0.996-0.997).

    Conclusion: A fully automated surveillance algorithm based on NLP to find UTI symptoms in free-text had acceptable performance to detect HA-UTI compared to manual record review. Algorithms based on administrative and microbiology data only were not sufficient.

    Read more about The accuracy of fully automated algorithms for surveillance of healthcare-associated urinary tract infections in hospitalized patients
  • Intelligent context-based healthcare metadata aggregator in internet of medical things platform

    2020. Mahbub Ul Alam, Rahim Rahmani. Proceedings of the 15th International Conference on Future Networks and Communications (FNC), Procedia Computer Science, 411-418

    Conference

    The internet of medical things (IoMT) is relatively new territory for the internet of things (IoT) platforms where we can obtain a significant amount of potential benefits in terms of smart future network computing and intelligent health-care systems. Effective utilization of the health-care data is the key factor here in achieving such potential, which can be a significant challenge as the data is extraordinarily heterogeneous and spread across different devices with different degrees of importance and authority to access it. To address this issue, in this paper, we introduce an intelligent context-based metadata aggregator in the decentralized and distributed edge-based IoMT platform with a use case of early sepsis detection using clinical data. We thoroughly discuss the various aspects of the metadata aggregator and the overall IoMT architecture. Based on the discussion, we posit that the proposed architecture could improve the overall performance and usability in the IoMT platforms in particular for different IoMT based services and applications.

    Read more about Intelligent context-based healthcare metadata aggregator in internet of medical things platform
  • Deep Learning from Heterogeneous Sequences of Sparse Medical Data for Early Prediction of Sepsis

    2020. Mahbub Ul Alam (et al.). Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies, Volume 5: HEALTHINF, 45-55

    Conference

    Sepsis is a life-threatening complication to infections, and early treatment is key for survival. Symptoms of sepsis are difficult to recognize, but prediction models using data from electronic health records (EHRs) can facilitate early detection and intervention. Recently, deep learning architectures have been proposed for the early prediction of sepsis. However, most efforts rely on high-resolution data from intensive care units (ICUs). Prediction of sepsis in the non-ICU setting, where hospitalization periods vary greatly in length and data is more sparse, is not as well studied. It is also not clear how to learn effectively from longitudinal EHR data, which can be represented as a sequence of time windows. In this article, we evaluate the use of an LSTM network for early prediction of sepsis according to Sepsis-3 criteria in a general hospital population. An empirical investigation using six different time window sizes is conducted. The best model uses a two-hour window and assumes data is missing not at random, clearly outperforming scoring systems commonly used in healthcare today. It is concluded that the size of the time window has a considerable impact on predictive performance when learning from heterogeneous sequences of sparse medical data for early prediction of sepsis.

    Read more about Deep Learning from Heterogeneous Sequences of Sparse Medical Data for Early Prediction of Sepsis

Show all publications by Mahbub Ul Alam at Stockholm University