Advancements in Machine Learning Applications for The Pharmaceutical, Biomedical, And Healthcare Industries

Parankush Koul, Dr. Indu B. Koul,

doi:10.5281/zenodo.15204262

Review Paper | Open Access
Volume 03 | Issue 04 | Article Id IJPS/250304145

Advancements in Machine Learning Applications for The Pharmaceutical, Biomedical, And Healthcare Industries
Parankush Koul* ¹ Dr. Indu B. Koul
¹Department of Mechanical and Aerospace Engineering, Illinois Institute of Technology, 3201 South State Street, Chicago, 60616, Illinois, United States of America
²Department of Biochemistry, Postgraduate Institute of Medical Education and Research, Sector-12, Chandigarh, 160012, India

Abstract

This paper investigates the developments in machine learning (ML) applications within the pharmaceutical industry along with biomedical and healthcare sectors. ML demonstrates its crucial impact through biomarker identification and improvements in drug discovery and diagnostic accuracy. The biomedical sector has implemented multiple ML algorithms, including Support Vector Machines (SVMs) and Random Forests (RFs), to detect microRNA (miRNA) biomarkers for cancer with excellent classification performance. Automated learning tools, including BioAutoML, have optimized feature extraction and model selection to surpass traditional methods regarding predictive performance. The pharmaceutical sector has benefited from ML-integrated high-content screening platforms that have resulted in the discovery of new antibacterial agents. Metrics such as the lowest effective dose (LOED) have broadened the scope of antibiotic discovery. The paper demonstrates how ML brings transformative enhancements to efficiency and accuracy along with innovative advancements in critical industry sectors.

Keywords

Machine Learning, Biomarkers, Automated Learning, Drug Discovery, Biomedical Imaging

Introduction

Recent years have witnessed significant progress in the adoption of ML technologies across pharmaceutical, biomedical, and healthcare industries. As a specialized branch of artificial intelligence (AI), ML has transformed pharmaceutical and healthcare sectors through its impact on operational efficiency as well as accuracy and innovation within multiple processes that include drug discovery and patient care. ML algorithms allow industries to analyze massive datasets with efficiency, which previously required extensive labor and time, thus accelerating decision-making and enhancing performance outcomes [1]. The pharmaceutical sector has experienced a major transformation in drug discovery and development because of ML advancements. Traditional drug development methods demand extensive time and resources for trials, yet ML streamlines these processes by predicting biological activity and optimizing drug formulations. The progress in ML has enabled high-throughput screening, which helps researchers discover potential drug candidates with greater efficiency. The application of ML proves crucial for solving the problems caused by high attrition rates in clinical trials since many candidates do not advance in later stages because of unexpected biological interactions [2, 3].

ML applications reach other fields beyond the pharmaceutical sector. ML methods deployed in the biomedical domain analyze complex biological datasets and facilitate genomic research while also enhancing diagnostic precision. These algorithms enable real-time biological process modeling, which provides insights previously inaccessible with conventional methods. ML has become vital in creating personalized medical treatments by combining genetic data with demographic information, which enables healthcare providers to develop treatment plans suited to each patient's unique profile [4].

The healthcare sector has adopted ML technologies more frequently to improve both patient care quality and operational effectiveness. ML demonstrates cost reduction and better patient outcomes through its applications in patient monitoring, predictive analytics, and administrative task automation. AI-driven applications equip clinicians with real-time decision support mechanisms that enable faster patient responses and treatment pathway optimization [5]. The review highlights how advancements in ML applications have created a substantial transformative effect across pharmaceutical, biomedical, and healthcare sectors. The review demonstrates how ML applications boost productivity and effectiveness in pharmaceutical and healthcare sectors and underscores the importance of ongoing research to solve current challenges. The integration of ML with established methods will transform pharmaceutical and healthcare advancements to deliver enhanced solutions that will serve humanity [6, 7].

Methodologies for Implementing ML in the Pharmaceutical, Biomedical, and Healthcare Sectors

Identifying Use Cases: The initial phase requires the identification of precise scenarios that will benefit from ML solutions. Potential applications of ML cover fields like drug discovery and patient outcome prediction as well as personalized medicine and operational efficiency. Stakeholders can establish priorities for ML initiatives by analyzing both business requirements and operational obstacles. ML enables the pharmaceutical industry to perform real-time analysis using Process Analytical Technology (PAT), which supports flexible production and enhanced quality control (QC) during drug manufacturing [8].
Data Collection and Preparation: The collection and preparation of data play a fundamental role in achieving ML project success. ML projects need data from multiple sources like clinical trials as well as electronic health records (EHRs) and laboratory tests. The aggregation of large datasets stands as a fundamental step to extract valuable insights. Data must undergo thorough cleaning and standardization and receive proper labeling to guarantee precise training of ML models. The pharmaceutical industry uses near-infrared spectroscopy (NIRS) to monitor processes in real time through reliable sensor data collection [9].
Choosing the Right ML Algorithms: After preprocessing data for quality assurance, it is essential to select suitable ML algorithms that match the problem requirements. The effectiveness of ML algorithms like neural networks, decision trees, and SVMs varies according to both the complexity and type of the data used. Artificial neural networks (ANNs) excel at managing complex tasks and demonstrate potential benefits for smart manufacturing methodologies [8].
Model Training and Testing: The model training process requires the prepared dataset to instruct the chosen ML algorithm on how to generate predictions or decisions. The model requires comprehensive testing to assess its performance accuracy after this process. Researchers commonly use cross-validation methods to both prevent overfitting and verify that ML models perform effectively on new datasets. The application of Principal Component Analysis and Partial Least Squares (PLS) Regression models improves outcome prediction accuracy in pharmaceutical processes [9].
Implementation and Integration: After completing training and validation processes for the model implementation, the subsequent step. The process involves embedding ML models into the organization's current workflows and systems. Proper training of system-interacting staff members remains crucial for achieving an efficient transition process. Maintaining regulatory compliance and data privacy protection within the system is essential and particularly important in healthcare operations. The automation of the CDISC Study Data Tabulation Model (SDTM) migration demonstrates the application of ML to improve data standardization for regulatory submissions while saving time and resources [10].
Monitoring and Maintenance: Continuous monitoring and maintenance of ML models post-implementation are essential to maintain their effectiveness over time. The maintenance process requires periodic updates to the ML models using new data, which improves their accuracy while also fixing any existing problems. Evaluation against set metrics allows for regular assessment of model performance and helps identify necessary adjustments. QC processes provide assurance that results remain compliant with established regulations [10].
Future Scalability and Enhancement: Organizations need to evaluate how ML applications will scale in the future. The development of scalable ML systems becomes crucial when expanding datasets and innovative methodologies emerge to support future advancements. The ability to adapt facilitates progressive improvements across clinical research initiatives and operational processes as well as patient care standards in pharmaceutical and healthcare sectors. By recognizing ML's support for innovation across these sectors, we achieve better patient outcomes along with operational excellence [8, 10].

Benefits and Applications of ML for the Pharmaceutical, Biomedical, and Healthcare Industries

The pharmaceutical, biomedical, and healthcare industries have transformed through ML, which delivers numerous benefits and applications that improve operational efficiency alongside drug discovery and patient care.

BENEFITS

ML technology enables these industries to speed up their drug discovery processes, which is considered one of its most significant advantages. Pharmaceutical companies can use ML algorithms to process extensive datasets for discovering potential drug candidates in both efficient and effective ways [11]. The accelerated identification process cuts down both development time and research and development expenditure [2]. ML reduces dependency on conventional trial-and-error techniques through the implementation of more focused drug testing strategies. The advanced predictive analytics power of ML proves to be extremely valuable in clinical trials. ML algorithms forecast clinical trial results through analysis of past trial data which boosts success likelihood while decreasing drug development financial risks. ML algorithms enable the identification of patient groups more likely to benefit from particular treatments thereby enhancing personalized medical practices [2]. ML addresses the big data analysis difficulties faced in biomedical research. The exponential growth of healthcare data allows ML algorithms to analyze this information for hidden insights which can enhance patient care and improve healthcare systems [12]. In genomics research genetic data analysis through ML enables identification of disease susceptibility factors which supports timely medical interventions.

1. Applications of ML

ML technology finds many different uses within healthcare and pharmaceutical sectors while these applications consistently develop. The pharmaceutical research and development sector utilize ML for both discovering new drugs and optimizing them. The latest algorithms evaluate chemical substances and biological reactions by modeling potential drug-body interactions [13]. Leading pharmaceutical corporations Novartis and AstraZeneca use ML to optimize drug development by deploying algorithms that forecast both drug efficacy and safety results [11]. ML applications in clinical environments serve diagnostic purposes and support treatment planning procedures. ML models are now widely used to evaluate imaging data including X-rays and Magnetic Resonance Imaging (MRI) scans which helps healthcare professionals diagnose cancer faster and with greater precision. Through wearables and Internet of Things (IoT) devices ML enables patient remote monitoring which improves patient engagement and facilitates proactive healthcare management using real-time data analysis [11]. ML plays a vital role in pharmacovigilance as an essential tool for monitoring drug safety after they have been released to the market. ML algorithms have the capability to examine healthcare database reports of adverse drug events to detect possible safety signals while assisting with regulatory compliance requirements [14]. Through its application, patient safety becomes guaranteed while drug development receives optimized feedback regarding potential marketing phase issues. ML technologies are currently utilized to refine and optimize clinical trial designs. ML algorithms use historical trial data to create more resource-efficient trials by developing superior recruitment strategies and endpoints [15]. ML integration within healthcare processes creates valuable insights while enhancing patient care and stimulating innovation throughout the pharmaceutical, biomedical, and healthcare industries.

Companies Leveraging ML in Pharmaceutical, Biomedical, and Healthcare Industries
1. Pharmaceutical Companies

PhaseV: Phase V utilizes ML platforms to enhance the efficiency of drug development procedures. The AdaptV Platform from PhaseV supports the design of adaptive clinical trials by enabling sponsors to make real-time adjustments according to emerging data. The Causal Platform applies causal ML techniques to uncover unseen patterns in clinical data, which improves therapeutic indication selection and expansion across multiple therapeutic fields with a specific focus on oncology and CNS disorders [16].
Pfizer: Through collaborations with firms such as CytoReason, Pfizer makes use of AI technologies to boost pharmaceutical discovery. Their research efforts target immune-mediated and oncology diseases, where they use AI-powered analysis to expedite their research activities [17].
AstraZeneca: AstraZeneca combines the expertise of BenevolentAI and Schrödinger to enhance drug discovery through ML for target identification and drug design. Through these technological partnerships, the company focuses on enhancing their R&D process efficiency [17].
Roche: Roche leads in the application of ML technologies to discover new drugs in the fields of neuroscience and oncology. Roche engages with AI-focused startups, including Recursion Pharmaceuticals, to enhance drug development speed through advanced data analysis and predictive modeling techniques [17].
Merck: Through strategic partnerships with AI companies Atomwise and Iktos, Merck utilizes ML capabilities. The organization concentrates on refining new drug candidate identification while improving the drug discovery process to showcase the pharmaceutical industry’s increasing adoption of AI technology [17].
1. Biomedical Companies

Atomwise: Atomwise employs ML techniques to discover small molecule drugs, which enables the transition from conventional methods to data-centered drug discovery processes. The AtomNet platform allows them to scan over three trillion compounds for potential drug candidates. Their collaboration with Sanofi demonstrates how cooperative innovation plays an essential role in drug development processes [18].
Exscientia: Exscientia leads the field of AI-driven precision medicine by applying ML to speed up drug discovery processes. The partnership between Exscientia and Sanofi demonstrates their dedication to AI applications in biopharmaceuticals through the development of bispecific small molecules for treating various diseases [18].
Iktos: This company employs ML to quickly find small molecules that can move into clinical trials. The proprietary AI technology they developed helped major pharmaceutical companies to streamline their drug discovery processes through successful collaboration. These advanced platforms greatly enhance both the efficiency of drug development processes and the chances of achieving success with drug candidates [18].
Insilico Medicine: AI technology powers every phase of Insilico Medicine's pharmaceutical research activities, including target identification and clinical trial evaluation. The Pharma.AI platform provides comprehensive support that speeds up new medicine discovery and development while reducing time and costs [18].
Cradle: The use of generative AI by Cradle facilitates protein design, which improves the development process of bio-based products. They work to speed up research and development processes while enabling biologists to efficiently understand complex protein structures [18].
1. Healthcare Companies

Definitive Healthcare: This business focuses on commercial intelligence by applying ML techniques through its Atlas AI platform, which generates insights from extensive datasets for healthcare providers and medical device organizations. Through health-related data analysis, they enable organizations to make informed decisions thanks to actionable insights [19].
GRAIL: GRAIL utilizes ML algorithms for early cancer detection by examining DNA fragments extracted from blood samples. The Galleri test developed by GRAIL demonstrates diagnostic innovation through its scalable approach to early cancer detection [19].
Microsoft: Through its Project InnerEye initiative, Microsoft uses computer vision and ML techniques to improve radiological imaging, which helps healthcare providers to diagnose cancer accurately and plan treatments. The tools they develop analyze huge datasets to make personalized recommendations for patients [19].
Novo Nordisk: The global pharmaceutical company uses ML in its Modelling and Predictive Technologies department to streamline drug development by automating complex cognitive tasks, which leads to enhanced efficiency [19].
Ciox Health: Ciox Health deploys ML systems that allow healthcare professionals to obtain patient data more quickly. Through their Datavant Switchboard platform, AI tools streamline EHR management and analysis and maintain privacy regulation compliance [19].

Overview of Past Research in Advancements in ML Applications for the Pharmaceutical, Biomedical, and Healthcare Industries

The number of articles covered in this review on the Advancements in ML Applications for the Pharmaceutical, Biomedical, and Healthcare Industries are shown in Figure 1 from 2019 through 2024.

Fig. 1. Number of articles on advancements in ML applications for the Pharmaceutical, Biomedical, and Healthcare Industries vs. Year

1. Pharmaceutical Industries

Table 1 below shows a quantitative distribution by publisher of the number of articles related to the advancements in ML applications for the Pharmaceutical Industries.

Table 1. Number of articles from different publishers reviewed on the advancements in ML applications for the Pharmaceutical Industries

Publisher	Number of Articles Reviewed
Springer	9
MDPI	5
Elsevier	4
ACS Publications	2
Frontiers	2
IEEE	2
Oxford University Press	2
PLOS	2
ETFLIN	1
Journal of Advanced Zoology	1
LPPM ISB Atma Luhur	1
Royal Society of Chemistry	1
The American Association for Cancer Research (AACR)	1
Total	33

Zoffmann and his team (2019) designed a semi-automated system that merges high-content screening with ML to process phenotypic data from 1.5 million compounds, which resulted in the discovery of new antibacterial agents with unique mechanisms of action. The development of the LOED metric showed that antibacterial effects could be detected at concentrations below the minimum inhibitory concentration (MIC), which broadened the chemical space available for antibiotic discovery [20]. The study by Galata et al. (2019) utilized ANNs to estimate the dissolution profiles of extended-release tablets through spectroscopy data analysis. The ANN model achieved better accuracy than PLS models by utilizing near-infrared and Raman spectroscopy data, which led to enhanced precision in pharmaceutical formulation analysis [21]. Ruano-Ordás et al. (2019) developed the Drug Discovery Multiple Classifier System (D2-MCS) tool, which addressed high-dimensional datasets in drug discovery by segmenting molecular data into feature-based groups and selecting the best classifier for each group. The multi-classifier system achieved superior predictive results compared to single-model approaches, which improved the discovery of biologically active compounds [22]. The research of Abbas et al. (2020) introduced a blockchain-based drug supply chain management and recommendation system that used N-gram and LightGBM models trained on drug reviews to build consumer trust while reducing counterfeit drug circulation. The research showed how ML can enhance both transparency and recommendation precision in pharmaceutical supply chains [23]. Sturm et al. (2020) investigated deep learning methods to predict drug targets using the ExCAPE-DB dataset and demonstrated the superiority of these methods compared to conventional ML techniques. The ML models used by researchers achieved high predictive accuracy through knowledge transfer from public datasets to internal pharmaceutical datasets, specifically in protein-ligand interaction identification [24]. Park et al. (2020) used ML to enhance lead optimization of anticancer drugs that failed during phase III trials. The researchers created a deep learning framework that combines long short-term memory (LSTM) networks with convolutional neural network (CNN) architectures together with Molecule Deep Q-Networks to enhance molecular attributes, including binding affinity and toxicity. Researchers optimized failed drug candidates by enhancing both drug-likeness and synthetic accessibility scores, which showed potential for clinical success [25]. The research team led by Ong in 2020 created Vaxign-ML, which is a supervised ML system that uses five different ML techniques to enhance the prediction of bacterial protective antigens (BPAgs) for vaccine development while they validated their results through cross-validation methods. The research showed that eXtreme Gradient Boosting (XGBoost) outperformed existing predictive tools for BPAgs, and the model was released to the public via a web server and GitHub [26]. Mohsen et al. (2021) used deep learning techniques to create models that predicted adverse drug reactions (ADRs) by integrating Open TG-GATEs gene expression data and FAERS-reported ADRs. The deep neural network (DNN)-trained model reached a mean accuracy of 89.94% across 14 predictive models and successfully identified drug-induced duodenal ulcers and hepatitis fulminant with area under the curve (AUC) values between 0.76 and 0.99 [27]. Masumshah and his team (2021) developed the Neural Network-based Polypharmacy Side Effects Prediction (NNPS) model, which utilizes mono side effects and drug-protein interaction data to evaluate drug-drug interactions (DDIs). The NNPS model surpassed five existing approaches by increasing the Area Under the Receiver-Operating Characteristic (AUROC) by 9.2% while cutting computation time down from 15 days to 8 hours [28]. A Bayesian optimization algorithm was used by Narayanan et al. (2021) to enhance biopharmaceutical formulation processes, which minimized experimental demands while facilitating concurrent optimization of various properties. Their approach cut down the needed experiments to a third of standard methods, which led to improved developability of biologic drugs [29]. Pandi and colleagues (2021) created a ML system using RF as the main algorithm to categorize pharmacogenomic variants according to their functional effects, which achieved 85% accuracy and an AUC score of 0.92. The researchers demonstrated their model's effectiveness for both whole genome sequencing and targeted pharmacogenomic data, which showed promise for personalized medicine by prioritizing genetic variants [30]. The research by Wang et al. (2021) presented DeepDRK as a deep learning platform that merges multi-omics data to find new use cases for existing drugs in cancer therapy. DeepDRK demonstrated high predictive accuracy through the analysis of more than 20,000 drug-cell line pairs by achieving an AUC of 0.84, which helped identify effective drug repurposing candidates [31]. Wang et al. (2022) implemented ML algorithms RF and XGBoost to forecast metabolic drug interactions concerning cytochrome P450 isozymes, which improved DDI assessment and assisted clinical pharmacy. The study's methodology combined multiple chemical descriptors with consensus-based predictions to achieve 80% internal validation accuracy and 79.5% external validation accuracy, which identified 54,013 potential drug interaction pairs [32]. Zhu and Dupuy (2022) utilized a machine-learning system that combined biological knowledge to examine drug response mechanisms in cancer while highlighting the pathways that determine drug sensitivity and resistance. The researchers found critical biological elements that influence treatment outcomes for GPX4, BRAF, and microtubule inhibitors and discovered new resistance pathways like NOTCH3/PAX8 signaling during paclitaxel therapy [33]. Han et al. (2022) developed a model using XGBoost to identify new target-disease connections in the Open Targets platform by combining various biological features such as tissue specificity and protein-protein interactions, which achieved an area under the precision-recall curve of 0.73 during validation [34]. Qureshi et al. (2022) implemented an XGBoost classifier to build a personalized drug response prediction model that combines molecular dynamics simulations and clinical data for predicting lung cancer patients' reactions to targeted therapy. The model obtained a 97.5% accuracy rate, which demonstrated the importance of geometric features for drug-target interactions [35]. Using ML techniques, Goldwaser et al. (2022) successfully identified inhibitors that target the cytochrome P450 2C9 (CYP2C9) enzyme to prevent harmful DDIs. The predictive models developed from public databases demonstrated about 80% accuracy when validated in vitro assays confirmed the inhibitory properties of the identified compounds [36]. The study conducted by Rahman et al. (2022) improved antibacterial drug discovery through directed-message passing neural networks, which boosted hit rates for FDA-approved compounds and natural products by over 14 times compared to conventional screening methods [37]. Badwan et al. (2023) examined how ML algorithms can be used in oncology to improve drug efficacy and toxicity predictions through enhanced disease state and therapeutic agent representation. The research findings demonstrated the expanding influence of ML tools in the fields of drug discovery and repurposing while emphasizing the need for a comprehensive understanding of ML techniques to enhance cancer therapy approaches [38]. The research conducted by Bannigan et al. (2023) used ML to speed up polymeric long-acting injectables (LAIs) development and found the Light Gradient Boosting Machine (LGBM) model to be the most precise in forecasting drug release with a mean absolute error of 0.125. They demonstrated that ML-driven predictive models can make pharmaceutical manufacturing more efficient by cutting down both development time and costs compared to traditional formulation development methods [39]. The research team led by Hou (2023) developed a ML-based data analysis approach for identifying ligands with DNA-encoded libraries (DELs) in cell-based selection processes. The research used a Maximum A Posteriori (MAP) loss function to lessen noisy data effects, which improved hit identification and structure–activity relationships (SAR) accuracy for therapeutic compound research [40]. A ML model built by Vojjala et al. (2023) fills gaps in pharmacy cost information within claims data to boost data completeness and refine healthcare cost evaluations. The ML model constructed from a fully informative dataset demonstrated superior performance compared to conventional imputation methods through enhanced prediction accuracy and reliability for pharmacy costs [41]. Patel and colleagues (2023) developed DE-INTERACT, which uses ML to predict interactions between drugs and excipients during pharmaceutical development. Through experimental studies, researchers validated the tool using paracetamol and vanillin as case studies, which confirmed its capability to predict significant drug-excitement interactions important for drug formulation [42]. Pirzada et al. (2023) used ML techniques to discover small-molecule glycogen synthase kinase 3 (GSK3) inhibitors, which could serve as COVID-19 treatment options. Their analysis of ChEMBL database datasets enabled predictive models to select selinexor and ruboxistaurin as GSK3 inhibitors, while molecular dynamics simulations validated their stability and potential efficacy [43]. Shin and colleagues (2023) applied ML techniques to create structure-activity relationship models, which helped identify new phytochemicals that block the glucocorticoid receptor and showed potential to fight obesity. The two-step workflow produced 65 potential compounds, with nine receiving validation and demethylzeylasteral emerging as a promising therapeutic agent [44]. Asha et al. (2024) merged ML techniques with blockchain systems to advance drug discovery and development processes by using generative adversarial networks (GANs) for creating molecules and deep learning methods to predict drug targets alongside reinforcement learning strategies for designing clinical trials. The research demonstrated advancements in precision and efficiency and increased security in drug research that may transform pharmaceutical innovation processes [45]. The study by Arunkumar and Baskaran (2024) implemented ANNs for predicting DDIs by analyzing pharmacokinetic and pharmacodynamic data obtained from Lexi-Comp and Vidal databases. The research showed how ML could improve pharmacovigilance with multi-layer perceptron models reaching an F1 score of 82% for minor interactions and 54% for major interactions [46]. The research conducted by Singh and Kaewprapha (2024) utilized the You Only Look Once (YOLOv7) model to improve real-time detection of defective and QC-approved tablets during pharmaceutical production, which resulted in 97.5% accuracy, thus showcasing ML capabilities for QC [47]. Bello et al. (2024) developed a ML approach using speckle pattern imaging to categorize parenteral artificial nutrition pharmaceutical suspensions with RF and Multi-Layer Perceptron algorithms to enhance traditional optical analysis methods. The research demonstrated that statistical imaging techniques combined with ML algorithms provide accurate drug classification methods [48]. Cysewski et al. (2024) investigated active pharmaceutical ingredient solubility in choline- and betaine-based deep eutectic solvents using 8014 data points and Nu Support Vector Regression (nuSVR)-based predictive models to improve solubility predictions [49]. Mustapa and Tjahyanto (2024) conducted research on ML methods to improve Total Organic Carbon (TOC) level predictions in pharmaceutical water treatment systems because accurate predictions are crucial for product integrity protection. The researchers compared linear regression, RF, and multilayer perceptron models and found that RF achieved the best predictive accuracy of 95%, which enhanced monitoring and maintenance processes [50]. Nhlapho et al. used ML to classify drug compounds based on Lipinski’s Rule of Five in 2024. The authors achieved near-perfect classification results by using RF, Extreme Gradient Boost, and Decision Tree classifiers, with RF achieving 99.94%. The research led to the creation of DrugCheckMaster, which enables efficient screening of compounds [51]. Kalaichelvan and team (2024) enhanced pharmaceutical inventory management by applying fuzzy theory and ML techniques, which combined pentagonal fuzzy numbers with the naive Bayes classifier. The proposed method demonstrated 95.9% classification accuracy and successfully tackled storage limitations while reducing inventory expenses [52].

1. Biomedical Industries

Table 2 below shows a quantitative distribution by publisher of the number of articles related to the advancements in ML applications for the Biomedical Industries.

Table 2. Number of articles on the advancements in ML applications for the Biomedical Industries by Publisher

Publisher	Number of Articles Reviewed
Springer	13
Frontiers	3
IEEE	3
MDPI	3
Elsevier	2
Oxford University Press	2
Wiley	2
arXiv (Cornell University)	1
European Alliance for Innovation (EAI)	1
European Association of Percutaneous Cardiovascular Interventions (EAPCI)	1
Massachusetts Medical Society	1
PLOS	1
Total	33

Kim et al. (2019) improved biomedical named entity recognition (BioNER) performance by employing a bootstrapping approach that combined Conditional Random Fields (CRFs) with LSTM networks, resulting in a 23.69% F1-score enhancement over traditional methods. A repeated machine-generated corpus labeling approach resulted in substantial improvements in entity recognition performance across numerous biomedical sub-domains [53]. The BioWordVec word embedding model developed by Zhang et al. (2019) addresses the shortcomings in biomedical word representations by combining subword information with Medical Subject Headings (MeSH). In biomedical natural language processing (BioNLP) tasks such as relation extraction and semantic similarity computations, their model achieved better results compared to existing word embedding methods [54]. The researchers Hathaway and colleagues (2019) used ML algorithms to categorize type 2 diabetes mellitus (T2DM) patients based on cardiac biomarkers and integrative genomics and achieved an 84% accuracy rate through Classification and Regression Trees (CART) and SHapley Additive exPlanations (SHAP). The researchers discovered meaningful associations between nuclear methylation levels and mitochondrial functionality, which relate to diabetic status, and identified potential new biomarkers for diabetes diagnostics [55]. Richens and colleagues (2020) enhanced diagnostic precision by redefining medical diagnosis as a counterfactual inference task, which allowed causal ML techniques to surpass both associative algorithms and expert clinicians in rare disease detection [56]. Martino et al. (2020) created a ML method that predicts Ki67/MIB1 labeling indices using hematoxylin and eosin-stained sections and determined nuclear hematoxylin mean optical density as the principal distinguishing element. The team developed a method that allowed fast quantitative analysis of tumor growth and enhanced pathology processing capabilities [57]. Hazra and Byun (2020) presented SynSigGAN, which is a GAN created to produce synthetic biomedical signals for building extensive and varied datasets while maintaining patient privacy. The new model demonstrated superior performance when compared to existing methods in producing high-quality electrocardiograms, electroencephalograms, electromyography, and photoplethysmography signals, which shows its usefulness for medical education and diagnostic purposes [58]. Marcinkiewicz-Siemion et al. (2020) introduced a new diagnostic panel for heart failure that combines ML with untargeted metabolomics to show that metabolite-based models reached 0.85 accuracy, matching conventional B-type natriuretic peptide (BNP) biomarkers. The study showed how ML can discover new biomarkers while underlining their clinical importance and the need for additional confirmation [59]. Gandouz et al. (2021) created a solution for biomedical decision-making challenges where they introduced asymmetric abstention intervals that led to better classification accuracy with lower rejection rates in imbalanced datasets, especially in cancer diagnostics. The research showed that ML models can be improved for better uncertainty management, which leads to increased reliability in critical medical decisions [60]. The research by Gu et al. (2021) produced Galaxy-ML, which delivers scalability and user-friendly integration of multiple ML tools to enhance both accessibility and reproducibility in biomedical ML applications. A benchmark analysis of 4,028 models across 276 datasets highlighted boosted tree models as top performers and demonstrated platform versatility through drug response prediction and deep learning validation applications [61]. The 2021 study by Du et al. introduced a deep learning method specifically designed for analyzing coronary angiography, which combines clinical and imaging data from multiple sources to improve diagnostic accuracy. The model demonstrated excellent performance with 98.4% accuracy in coronary segment recognition and strong F1 scores between 0.802 and 0.854 for lesion morphology detection, which enhanced diagnostic efficiency [62]. Akazawa et al. (2021) applied ML techniques to develop several predictive models for postpartum hemorrhage (PPH) in vaginal births based on clinical data. The top model's performance achieved a moderate AUC score of 0.708 yet faced limitations due to high false positive and false negative rates, which suggests that larger datasets and more predictive variables are necessary to improve results [63]. Feng et al. (2021) used ML techniques to identify substantial fibrosis in non-alcoholic fatty liver disease (NAFLD) patients, which resulted in better diagnostic performance than traditional fibrosis tests. The ML algorithm performed significantly better than logistic regression (LR) and traditional biomarkers with an AUROC score of 0.902 in the training cohort and 0.893 in the validation cohort [64]. The research by Kim (2022) implemented ML techniques using radiomic features to distinguish COVID-19 from pneumonia through chest X-ray imaging, which showcased the capabilities of automated diagnostic systems in biomedical imaging. Researchers found four primary radiomic features, which multiple classifiers analyzed, and LGBM reached the top AUC score of 0.900, showing its strong ability to differentiate between the conditions [65]. Bonidia et al. (2022) introduced BioAutoML to enhance automated learning within bioinformatics through an automated pipeline that handles feature engineering and model selection for bacterial noncoding ribonucleic acids (RNAs) prediction. This tool enhanced feature extraction and algorithm recommendation processes and perfected hyperparameter tuning to surpass established Automated ML (AutoML) frameworks RECIPE and TPOT in both efficiency and predictive accuracy [66]. Fu et al. (2022) successfully used ML strategies such as Least Absolute Shrinkage and Selection Operator (LASSO), SVM Recursive Feature Elimination (SVM-RFE), and RFs to detect diagnostic biomarkers for diabetic kidney disease (DKD). The examination of differentially expressed genes from microarray datasets allowed researchers to find potential diagnostic markers DUSP1 and PRKAR2B, which were confirmed using ROC analysis and then connected to immune cell infiltration patterns seen in DKD patients [67]. The research conducted by Zeng et al. (2022) resulted in KV-PLM, which combines molecular structures and biomedical text for enhanced molecular property prediction and drug discovery through deep learning. The KV-PLM system attained a molecular comprehension accuracy of 0.83 by analyzing Simplified Molecular Input Line Entry System (SMILES) strings and biomedical text together, which exceeded human professional performance [68]. Zhang et al. (2022) used ML approaches like weighted gene co-expression network analysis (WGCNA) and LASSO to find metabolism-related biomarkers for diabetic nephropathy (DN). Research demonstrated that the genes ADI1 and POLR2B are critical to DN development and show links to immune cell infiltration, which indicates promising avenues for diagnosis and treatment strategies [69]. Akatsuka et al. (2022) combined ultrasound imaging with clinical data and used ML techniques to improve prostate cancer detection accuracy. The integration of ultrasound imaging with clinical data in their model increased the AUC from 0.691 to 0.835, which showed enhanced detection of high-grade prostate cancer [70]. Jan et al. (2023) introduced an AI model that merged radiomics and deep learning features from computerized tomography (CT) images to classify ovarian tumors as benign or malignant and achieved 82% accuracy, which surpassed the performance of junior radiologists. Their study revealed AI's capability to improve diagnostic accuracy in medical imaging [71]. Through the application of ML to transcriptomic data from bovine embryos, Rabaglino et al. (2023) successfully detected crucial gene patterns that can predict embryonic competence with more than 85% accuracy across different datasets. The research offered important findings about reproductive efficiency via ML, which showed how large biological datasets could be combined for prediction tasks [72]. Rana and Bhushan (2023) performed a systematic evaluation of ML and deep learning (DL) methods for medical image analysis and found that DL techniques like CNNs demonstrated exceptional classification accuracy of 97.6% for MRI-based disease detection [73]. Jungo and Hewer (2023) demonstrated how ML techniques could be applied to histopathology using Microsoft Custom Vision and Google AutoML as code-free platforms to achieve precision and recall rates up to 98.4% while classifying central nervous system tumor images. The research results demonstrated ML tools' usability for non-experts and highlighted external validation as a method to prevent accuracy overestimation [74]. In their research from 2023, Shuryak et al. used a RF algorithm to process radiation-responsive biomarkers along with blood cell counts, which improved biodosimetry techniques to distinguish radiation exposure between partial-body and complete-body exposures. The model achieved high accuracy as shown by an AUROC of 0.944, which supports its application in radiological emergency response [75]. Sun et al. (2023) combined bioinformatics with ML to discover shared biomarkers between chronic obstructive pulmonary disease and atrial fibrillation and identified cyclin-dependent kinase 8 (CDK8) as a critical biomarker and a possible therapeutic target. The research team used gene co-expression analysis alongside immune cell infiltration assessment and drug prediction to discover 20 drugs that may target CDK8 [76]. Azari et al. (2023) used SVM, RF, and k-Nearest Neighbors (KNNs) to analyze The Cancer Genome Atlas (TCGA) data for gastric cancer miRNA biomarker identification, which achieved its highest classification accuracy through SVM at 93%. Through their study, researchers showed how miRNA biomarkers contribute to early detection and prognosis by associating their dysregulation with cancer pathways, including Wnt signaling [77]. Su et al. (2024) investigated few-shot BioNER by transforming it into a machine reading comprehension task and created "grape" demonstrations to boost learning performance. By employing a demonstration-based learning approach, they achieved up to a 1.1% improvement in F1 scores over traditional sequence labeling, which proved their method's viability in situations with limited resources [78]. Wu et al. (2024) applied Bayesian optimization within ML to produce functionally graded ceramic scaffolds for bone regeneration while combining lithography-based ceramic manufacturing and micro-CT analysis to ensure structural integrity. Their method succeeded in producing scaffolds with enhanced biomechanical properties, which led to better bone growth in segmental defects [79]. The research by Islam et al. (2024) introduced an unsupervised ML technique to clean biomedical signals during cardiopulmonary resuscitation (CPR), which supports real-time medical decision-making in emergency situations. The research team's multi-modal framework improved signal fidelity while reducing noise without using labeled data and demonstrated better performance in signal-to-noise and peak signal-to-noise ratios compared to existing methods [80]. Slonopas et al. (2024) demonstrated the application of ML algorithms to biomedical imaging through a ML-based histogram equalization (ML-HE) technique that integrated reservoir computing to improve both image clarity and contrast. Their results showed substantial enhancements to image visibility, which supported better diagnostic accuracy and immediate medical decisions [81]. By exploiting both CNNs and KNN models Huan et al. (2024) created a biomedical knowledge graph for symptom phenotype analysis in coronary artery plaque. The researchers attained an AUC score of 92.5%, which helped them pinpoint essential symptom connections along with central genes and molecular pathways that relate to both inflammatory responses and lipid regulation [82]. He et al. (2024) used a U-Net CNN architecture to perform human tissue classification within biomedical inverse scattering studies by applying subspace optimization techniques for acquiring dielectric permittivity distributions. The team validated their findings using synthetic data, which demonstrated precise tissue classification and highlighted deep learning potential for medical imaging applications [83]. Lehmann et al. (2024) established a ML method to identify hypoglycemic events in diabetes patients during driving through analysis of driving behavior and gaze/head motion tracking. The model obtained an AUROC score of 0.80 ± 0.11, which confirmed that noninvasive detection techniques can improve driving safety and diabetes self-management [84]. Mercaldo et al. (2024) focused on Extreme Learning Machine (ELM) for biomedical image classification, showing its advantages in cost efficiency compared to deep learning networks. The research confirmed ELM managed to deliver similar predictive outcomes as other methods while offering substantial training cost savings, thus establishing it as a practical choice for biomedical image analysis [85].

1. Healthcare Industries

Table 3 below shows a quantitative distribution by publisher of the number of articles related to the advancements in ML applications for the Healthcare Industries.

Table 3. Number of articles on the advancements in ML applications for the Healthcare Industries by Publisher

Publisher	Number of Articles Reviewed
Springer	10
Elsevier	8
PLOS	6
IEEE	3
JMIR Publications	2
European Modern Studies Journal (EMSJ)	1
EWA Publishing	1
IOS Press	1
Oxford University Press	1
Total	33

Through the use of Hadoop-Spark for processing big data, the Naïve Bayes technique-based Big Data Predictive Analytics Model developed by Venkatesh et al. (2019) reached 97.12% accuracy in heart disease prediction. The model enabled early detection and improved health outcomes through the analysis of extensive datasets [86]. Ramkumar et al. (2019) conducted a validation study of a remote patient monitoring system for total knee arthroplasty (TKA) through the use of wearable devices together with mobile health applications. The research team attained complete continuous data collection throughout their study while patients exhibited enhanced mobility by 30% after three months of surgery and remained highly engaged with the technology, which demonstrated the importance of ML in post-surgical recovery and rehabilitation [87]. In 2019 Myers and colleagues used ML to analyze EHRs to create the FIND FH model, which identifies familial hypercholesterolemia (FH). The FIND FH model identified 1.3 million patients at high risk for FH with 85% precision, which facilitated targeted clinical evaluation and intervention processes [88]. Maarseveen et al. (2020) developed a ML workflow for identifying rheumatoid arthritis patients through EHRs. The researchers' model successfully combined two datasets to produce high-precision results with F1 scores of 0.83 and 0.82 and effectively worked across various languages and healthcare environments, which demonstrated ML's capability for efficient cohort studies [89]. A research team led by Du in 2020 created a coronary heart disease prediction model by applying the XGBoost algorithm to EHRs from a cohort of 42,000 hypertensive patients. The ML model surpassed traditional risk scales by achieving an AUC of 0.943, which highlighted big data's potential to enhance cardiovascular disease prediction accuracy [90]. El-Ganainy and colleagues (2020) presented a real-time clinical decision support system that uses Hierarchical Temporal Memory (HTM) and LSTM models for mean arterial pressure prediction in critically ill patients. Traditional models showed inferior performance compared to this system, which delivered a predictive accuracy improvement of 20% while significantly reducing decision-to-event time [91]. The 2020 study by Artzi and colleagues used ML methods to analyze EHRs from more than 588,000 pregnancies to predict gestational diabetes mellitus (GDM) with an AUROC of 0.85. This model demonstrated superior performance against traditional risk assessment tools while identifying new risk factors, including previous pregnancy glucose challenges, which improved early detection of GDM and enabled early-stage interventions [92]. Philpott-Morgan et al. (2021) directed their research toward forecasting missed outpatient appointments within the National Health Service (NHS) by applying gradient boosting machines (GBMs) to examine hospital episode statistics from 2016 through 2018. The model they developed pinpointed age and previous missed appointments as essential predictors while delivering 28.7% sensitivity and stressed the importance of targeted interventions such as personalized reminders to reduce missed appointments [93]. The study by Liu et al. (2021) showcased how ML improved bipolar disorder screening through the EarlyDetect tool, which achieved an 80.6% balanced accuracy rate and outperformed the traditional Mood Disorder Questionnaire in terms of sensitivity and specificity. The research findings demonstrated that ML methods could substantially improve mental health screening accuracy [94]. Guo et al. (2021) investigated how ML models can predict liver cirrhosis patient mortality through EHR data. Researchers used multiple ML techniques such as DNNs, RFs, and LR to evaluate mortality risks across different time periods. The DNN model surpassed traditional MELD-Na scores to reach a 0.88 AUC score for predicting 90-day mortality [95]. In 2021, Estiri and colleagues created the MLHO framework, which predicts COVID-19 patient adverse outcomes through analysis of their medical records. The analysis of 600+ features across 13,000 patient records resulted in high predictive accuracy for the model, which achieved an AUC score of 0.91 for mortality prediction. The research demonstrated the critical necessity of integrating demographic and clinical data while identifying age as a fundamental factor for determining severe outcomes [96]. Zeng and colleagues (2021) created an ensemble ML model designed to estimate hospital mortality rates among intensive care unit (ICU) patients suffering from sepsis. The method used nine distinct ML models, which they trained on the electronic ICU Collaborative Research Database (eICU-CRD) and tested on the Mart for Intensive Care III (MIMIC-III) database. The model displayed enhanced predictive performance beyond traditional severity scores like Simplified Acute Physiology Score II (SAPS II) and Sequential Organ Failure Assessment (SOFA), reaching an AUROC of 0.806 [97]. Shahbandegan and colleagues (2022) created a ML algorithm to forecast CT imaging needs in emergency departments based on triage data from 81,118 patient encounters. The ML model attained an AUROC score of 0.86, enabling better resource allocation and patient flow management while patient complaints and triage acuity determined CT scan decisions [98]. In a 2022 study by Xi et al., the researchers evaluated multiple ML algorithms performance versus LR for cardiovascular disease risk prediction in 143,043 hypertensive patients. RF paired with XGBoost and deep learning made up an ensemble model that achieved a 0.760 AUROC score, which surpassed LR and demonstrated the enhanced predictive potential of ML for cardiovascular disease risk assessment while offering preventive care opportunities [99]. Chen and Chen (2022) investigated the application of synthetic patient data within a Learning Health System (LHS) supported by ML for predicting risks of lung cancer and stroke. The study showed that recall in lung cancer risk prediction rose from 0.849 to 0.936 as the size of the dataset grew. The research highlighted that healthcare delivery can be enhanced through a progressive model improvement process that involves adding new patient data [100]. Lazzarini et al. (2022) developed a ML model to predict Acute Respiratory Distress Syndrome (ARDS) progression in COVID-19 patients by training it on data from 289,351 individuals. The LightGBM model demonstrated superior performance to clinical predictions with an AUC score of 0.695 while also identifying age, diabetes, and hypertension as significant risk factors [101]. Liao et al. (2022) used ML techniques to determine which chronic stroke patients would experience enhanced health-related quality of life (HRQOL) following rehabilitation interventions. Through their evaluation of various algorithms, they found RF reached an accuracy of 85% and KNNs reached 82.5%, using baseline HRQOL scores and muscle function as predictors [102]. Uddin and colleagues (2022) utilized ML techniques, including XGBoost, to forecast the simultaneous occurrence of major chronic diseases in their examination of chronic disease comorbidity and multimorbidity. Their model obtained an accuracy rate of 95.05% by identifying key indicators such as patient trajectory episode counts and patient network transitivity [103]. The Feature Engineering Automation Tool (FEAT) presented by La Cava et al. (2023) enables the creation of clinical prediction models that interpret EHR data while ensuring accuracy. The study showed that FEAT models analyzed data from 1,200 patients with different types of hypertension and proved to be more compact and precise than traditional approaches, which enhances their clinical application potential due to better interpretability and scalability [104]. The approach taken by Langenberger et al. (2023) involved using ML to identify patients with high healthcare costs through the analysis of healthcare claims data. When researchers evaluated multiple algorithms, among which RF and Gradient Boosting were included, they found that tree-based models had significantly better results compared to ANNs and LR by achieving high accuracy as reflected in their AUC values. The research demonstrated how ML techniques can enable healthcare organizations to predict costs and allocate resources more effectively [105]. The study by Pasieczna et al. (2023) examined frailty syndrome in heart failure patients through ML analysis of psychosocial and physical information collected from the Tilburg Frailty Indicator. The study results demonstrated that psychological aspects such as mood and irritability had greater significance than physical aspects when diagnosing frailty. The study indicated that non-physical elements play a significant role in patient treatment and recommended that medical practitioners take psychological health into account when diagnosing frailty [106]. Caratsch and colleagues (2023) introduced a complete ML solution for automated radiographic hand osteoarthritis detection that enables clinicians without programming skills to generate predictive models using no-coding platforms. The system achieved high diagnostic accuracy in rheumatological diseases by combining genetic data with AI algorithms and reached a 92% success rate for knee osteoarthritis severity prediction [107]. The study by Limketkai et al. (2023) implemented ML to classify inflammatory bowel disease (IBD) patients into three categories based on their healthcare usage patterns with an accuracy range of 81-85%. These models demonstrated superior performance compared to traditional approaches and provided important information for resource distribution and patient care management [108]. The research conducted by Kwak et al. (2023) examined sex-specific cardiovascular risk factors through a RF model that predicted a 10-year development of atherosclerotic cardiovascular disease (ASCVD). The research showed different risk factors for men and women, including a stronger link between total cholesterol and ASCVD risk in men as well as the importance of waist circumference in women. The model obtained an AUC score of 0.733 for male subjects and 0.769 for female subjects, which demonstrated its capability for individualized risk assessment [109]. Liu et al. (2023) used XGBoost integrated with Cox models to discover new post-menopausal breast cancer risk predictors through UK Biobank data analysis. ML methods, including XGBoost, demonstrated high efficiency in feature selection from extensive predictor sets, which improved risk prediction, yet novel feature augmentation showed no substantial improvement in model results [110]. Wang et al. (2024) employed a combined approach that utilized RF and Support Vector Regression alongside traditional statistical forecasting methods to study American healthcare expenditure trends. The study demonstrated the continuous escalation of healthcare costs while recommending policy measures to address the economic burden as a foundation for future research [111]. A ML model was used by Biswas et al. (2024) to examine how patient ethnicity, alongside socio-economic deprivation and existing health conditions, influences hospital stay durations after cervical decompression surgery. The results indicated that socio-economic elements play a critical role in healthcare outcomes in public health facilities and that ML applications could boost both resource management and patient care [112]. The 2024 study by Zhou investigated how ML algorithms such as KNN, recurrent neural network (RNN), CNN, and GAN serve various healthcare fields, including medical imaging and heart disease prediction as well as eye health management. The research indicated that these algorithms improve diagnostic precision and service quality but face significant challenges from data quality and patient privacy concerns [113]. While other studies explored specific ML applications in healthcare domains, Ali et al. (2024) examined multiple ML algorithms to determine their effectiveness in predicting health outcomes and improving healthcare services. The research demonstrated that RF and KNN algorithms stand out as highly effective tools that can optimize healthcare operations while enabling treatment personalization and administrative efficiency [114]. Moradpour et al. (2024) developed a multi-objective optimization framework called MOOF, which aims to improve clinical diagnostic ML model performance through the balance of accuracy, sensitivity, and specificity. The integration of Non-dominated Sorting Genetic Algorithm II (NSGA-II) with Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) optimization techniques in their framework delivered optimal solutions that enhanced ML model precision for healthcare decision-making beyond traditional methods [115]. Bhute et al. (2024) carried out research on disease detection in smart cities using multiple ML algorithms such as Gradient Boost, CatBoost, and KNN to identify heart disease and diabetes at early stages. The study results indicated that CatBoost and Gradient Boost delivered superior outcomes for heart disease detection with 89.1% and 88.6% accuracy ratings, respectively, and KNN performed less effectively for diabetes detection at 77.9% accuracy [116]. Ismukhamedova et al. (2024) combined ML technology with electronic health passports to enhance diabetes detection and optimize resources. The research established GBM as the top-performing model and found that deep learning through RNNs improved diagnostic precision [117]. Lastly, Sevukamoorthy and colleagues (2024) applied ML and GANs for cancer detection to automate risk assessment while identifying biomarkers through healthcare data analysis and imaging scans. The method they employed improved cancer detection accuracy and speed, which supported personalized treatment plans and timely medical interventions [118].

Challenges and Future Directions in the Application of ML in the Pharmaceutical, Biomedical, and Healthcare Industries
1. Challenges

Challenges in Pharmaceutical Applications: The pharmaceutical sector encounters multiple difficulties, including expensive research and development costs along with lengthy development periods and complicated regulations, but AI and ML technologies aid in analyzing voluminous datasets and enhancing clinical trial methodologies. The pharmaceutical industry faces major challenges through ethical concerns together with privacy issues and the demand for trained professionals [119]. The pharmaceutical sector faces significant challenges from data quality issues, regulatory requirements, and ethical considerations when implementing AI and ML technologies [5]. A lack of strategy combined with difficulty finding talent and functional silos, as well as insufficient management support and behavioral change, results in delays for ML projects in the pharmaceutical sector [120]. Patient safety requirements and strict industry regulations restrict AI implementation within the pharmaceutical field [119].
Challenges in Biomedical Applications: Smart manufacturing faces obstacles when implementing ML techniques because of data management and privacy issues, an inadequate skilled workforce, and the requirement for collaboration between manufacturers and technology providers [121]. The reproduction of data-based prejudices together with ethical and moral dilemmas stands as a major concern [122]. In nano-scale biomedical engineering fields remain under-explored with the application of ML despite efforts while research challenges continue to exist in structure and material design and simulations as well as communications and signal processing and biomedicine applications [123]. The main challenges in ML include choosing effective learning algorithms and feature representation techniques [124]. The inaccurate labeling of diseases within EHRs, the inclusion of multiple underlying endotypes in conditions, and the scarcity of data from healthy individuals create difficulties when applying standard ML models [125].
Challenges in Healthcare Applications: Security and privacy of sensitive patient data stand as the highest concern while processing information because it goes hand in hand with challenges concerning data quality and systems interoperability together with ethical issues about algorithm bias and transparency. The integration process becomes more difficult due to regulatory obstacles and healthcare professionals' reluctance to accept change. As healthcare providers utilize ML insights more frequently, ethical issues become evident, including patient data privacy, informed consent requirements, and the demand for open and unbiased algorithms [126]. The use of ML models could worsen current inequalities that result in fairness issues, including unequal resource distribution and diagnostic mistakes among different demographic groups [127].
1. Future Directions

Future Directions in Pharmaceutical Applications: The pharmaceutical industry will see ML applications in de novo drug discovery along with biometric data analysis from wearable devices plus enhancements in pharmacovigilance manufacturing and supply chains. The main aim focuses on minimizing both time and resources used during drug discovery and development by optimizing high-throughput screening methods while eliminating animal testing [2]. The integration of AI and ML technologies promises to accelerate the creation of improved medicines and drug development workflows, which will enhance the well-being of millions of patients [128]. Modern supercomputers enable ML to establish a steady and lasting flow of new medicines that arrive at the market faster and with lowered expenses. Human-machine cooperation defines future trends as clinical professionals learn to advance with technological innovations [129].
Future Directions in Biomedical Applications: The next wave of research needs to investigate structure and material design along with simulations, communications, signal processing, and nano-scale biomedical applications in biomedical engineering [123]. ML represents a transformative force for healthcare by combining human expertise and machine capabilities to improve precision and efficiency while driving innovation in biomedical device and pharmaceutical production. Human-machine interaction in biomedical manufacturing must address data security concerns along with workforce transition and regulatory compliance to achieve its full potential [130]. Researchers are developing privacy-preserving collaborative ML through alternative privacy protection strategies that utilize cryptographic methods such as homomorphic encryption and secure multiparty computation [131]. The combined use of ML with magnesium-based biomedical technologies holds great promise to transform personalized medicine through new research opportunities and clinical applications [132].
Future Directions in Healthcare Applications: ML adoption in healthcare has led to significant strides that have revolutionized medical diagnosis and treatment methods as well as patient care delivery. Predictive analytics will help detect health problems early, while wearable devices and remote monitoring will enhance continual patient care, and ML combined with genomics will create personalized medicine solutions [133]. The ongoing development of technology requires AI ethics and responsible AI principles to become essential in constructing healthcare's ethical framework [126]. To achieve effective and fair use in pediatric care, it is crucial to work on increasing data diversity along with creating standardized ethical guidelines and boosting model transparency [134]. We need additional studies to confirm whether these technologies meet regulatory standards while maintaining patient care quality and security privacy ethics [135].

CONCLUSIONS

The paper examines how ML has brought transformative changes to numerous industries with a special emphasis on healthcare and pharmaceuticals. The application of ML methods to biomarker identification has produced high classification rates for conditions like gastric cancer and diabetic kidney disease, with SVM models reaching up to 93% accuracy. ML integration into drug discovery processes enables optimized identification of potential drugs and enhances adverse reaction predictions. The integration of Vaxign-ML and deep learning frameworks has enhanced molecular properties, which shows their promising role in clinical achievements. BioAutoML and similar automated learning systems improve feature selection and model selection processes while delivering better efficiency and predictive accuracy than traditional methods. The study stresses the importance of continuous research to ensure ML technologies fulfill regulatory requirements and uphold ethical medical standards. Upcoming advancements suggest the combination of wearable technology with predictive analytics will enable ongoing patient monitoring alongside customized medical treatments. The ongoing advancements in ML highlight the crucial role of AI ethics and responsible AI principles, which demand standardized guidelines for fair application in healthcare environments. ML advancements are transforming healthcare by developing more effective diagnostic tools and treatment methods while improving patient care strategies in the pharmaceutical and biomedical sectors. Researchers outline the need for additional research and validation of these technologies to both maximize benefits and resolve ethical issues.

REFERENCES

Han, Y., & Tao, J. (2024). Revolutionizing Pharma: Unveiling the AI and LLM trends in the pharmaceutical industry. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2401.10273
Pargaien, A. V., Pargaien, S., Chilwal, B., Gupta, S., Nawaz, A., Adhikari, M., & Khan, F. (2023). A New Era of Machine Learning Augmenting Pharmaceutical Industry. 2023 2nd International Conference on Ambient Intelligence in Health Care (ICAIHC), 1–6. https://doi.org/10.1109/icaihc59020.2023.10430483
Likhitha, M., Surarchita, S., & Yeshamaina, S. (2024). Applications of newer technologies in enhancement of pharmaceutical industry. International Journal of Innovative Science and Research Technology (IJISRT), 9(9), 2697–2707. https://doi.org/10.38124/ijisrt/ijisrt24sep1485
Mottaghi-Dastjerdi, N., & Soltany-Rezaee-Rad, M. (2024). Advancements and Applications of Artificial Intelligence in Pharmaceutical Sciences: A Comprehensive review. Iranian Journal of Pharmaceutical Research (IJPR), 23(1), e150510. https://doi.org/10.5812/ijpr-150510
Al Rammadan, A. H. A., Almarhoon, M. W., Almarhoon, H. W. A., Alomran, A. R., Al Amer, H. M. T., Hirabah, J. A., Alkhamees, M. S. A., Almari, J. K., Al Jeadah, N. H., Almalki, A. M., Alquaymi, A. E., Alsaleem, A. A., Alsanawi, H. A., & Almajhad, H. A. E. (2022). Leveraging Artificial Intelligence And Machine Learning In Drug Development: Opportunities And Challenges. Scientific Journal for Research Publishing, 2. https://sjr-publishing.com/wp-content/uploads/2019/03/Leveraging-Artificial-Intelligence-And-Machine-Learning-In-Drug-Development-Opportunities-And-Challenges-1.pdf
Khalid, W., Khalid, M. Y., Hena, M., Sarwar, A., & Iqbal, S. (2023). Advancing Pharmaceuticals with Machine Learning: A Short Review of Research and Development Applications. Pharmaceutical Communications, 2(1), 63–69. https://doi.org/10.55627/pharma.002.01.0297
Nilima, S. I., Hossain, M. A., Sharmin, S., Rahman, R., Esa, H., Manik, M. M. T. G., & Hasan, R. (2024). Advancement of Drug Discovery Using Artificial Intelligence and Machine Learning. 2024 IEEE International Conference on Computing, Applications and Systems (COMPAS), 1–7. https://doi.org/10.1109/compas60761.2024.10796748
Nagy, B., Galata, D. L., Farkas, A., & Nagy, Z. K. (2022). Application of artificial neural networks in the Process Analytical Technology of pharmaceutical Manufacturing—A review. The AAPS Journal, 24, 74. https://doi.org/10.1208/s12248-022-00706-0
O’Mahony, N., Murphy, T., Panduru, K., Riordan, D., & Walsh, J. (2017). Real-time monitoring of powder blend composition using near infrared spectroscopy. 2017 Eleventh International Conference on Sensing Technology (ICST), 1–6. https://doi.org/10.1109/icsenst.2017.8304431
Sathyabhama, B., & Menon, S. (2018). CDISC SDTM - an Automated Approach. PhUSE EU Connect 2018, 1–6. https://www.lexjansen.com/phuse/2018/si/SI11.pdf
Jain, D., Chandra, P., Ali, Z., Fatma, N., & Khan, H. (2024). A Comprehensive Investigation: Developing the Pharmaceutical Industry through Artificial Intelligence. Current Drug Discovery Technologies, 21, e15701638313233. https://doi.org/10.2174/0115701638313233240830132804
Ogunye, R. O., Egwuatu, D., Anene, P. C., Azubuike, E. O., Asenuga, O. O., Sargwak, J. P., Ojobor, J.-F. C., Onoharigho, F. O., & Nwokafor, C. V. (2024). The Impact of Emerging Technologies on pharmaceutical process design and optimization in Africa: a review. Journal of Pharmaceutical Research International, 36(9), 46–60. https://doi.org/10.9734/jpri/2024/v36i97576
Markus, B., C, G. C., Andreas, K., Arkadij, K., Stefan, L., Gustav, O., Elina, S., & Radka, S. (2023). Accelerating Biocatalysis Discovery with Machine Learning: A Paradigm Shift in Enzyme Engineering, Discovery, and Design. ACS Catalysis, 13(21), 14454–14469. https://doi.org/10.1021/acscatal.3c03417
Smaldone, A. M., Shee, Y., Kyro, G. W., Xu, C., Vu, N. P., Dutta, R., Farag, M. H., Galda, A., Kumar, S., Kyoseva, E., & Batista, V. S. (2024). Quantum Machine Learning in Drug Discovery: Applications in academia and pharmaceutical industries. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2409.15645
Schiro, F., & Agaian, S. (2022). A machine-learning approach to predicting new pharmaceutical successes in clinical trials. Proceedings Volume 12100, Multimodal Image Exploitation and Learning 2022, 1210009. https://doi.org/10.1117/12.2619069
PhaseV. (2024). Seven Top Pharma Companies Adopt PhaseV’s Machine Learning Technology for Drug Development. PR Newswire. Retrieved March 18, 2025, from https://www.prnewswire.com/news-releases/seven-top-pharma-companies-adopt-phasevs-machine-learning-technology-for-drug-development-302315425.html
Chen, F. (2025). Top 10 Pharmaceutical Companies Using AI and Machine Learning in Drug Discovery (2024) [Online forum post]. LinkedIn. Retrieved March 18, 2025, from https://www.linkedin.com/posts/fengqian-chen_top-10-pharmaceutical-companies-using-ai-activity-7281107628208844800-0Y0H/
Shah-Neville, W. (2024). Five AI drug discovery companies you should know about. Labiotech.eu. Retrieved March 18, 2025, from https://www.labiotech.eu/best-biotech/ai-drug-discovery-companies/
Thomas, M. (2024). 24 Machine learning in healthcare examples. Built In. Retrieved March 18, 2025, from https://builtin.com/artificial-intelligence/machine-learning-healthcare
Zoffmann, S., Vercruysse, M., Benmansour, F., Maunz, A., Wolf, L., Marti, R. B., Heckel, T., Ding, H., Truong, H. H., Prummer, M., Schmucki, R., Mason, C. S., Bradley, K., Jacob, A. I., Lerner, C., Del Rosario, A. A., Burcin, M., Amrein, K. E., & Prunotto, M. (2019). Machine learning-powered antibiotics phenotypic drug discovery. Scientific Reports, 9, 5013. https://doi.org/10.1038/s41598-019-39387-9
Galata, D. L., Farkas, A., Könyves, Z., Mészáros, L. A., Szabó, E., Csontos, I., Pálos, A., Marosi, G., Nagy, Z. K., & Nagy, B. (2019). Fast, Spectroscopy-Based prediction of in vitro dissolution profile of extended release tablets using artificial neural networks. Pharmaceutics, 11(8), 400. https://doi.org/10.3390/pharmaceutics11080400
Ruano-Ordás, D., Yevseyeva, I., Fernandes, V. B., Méndez, J. R., & Emmerich, M. T. M. (2019). Improving the drug discovery process by using multiple classifier systems. Expert Systems With Applications, 121, 292–303. https://doi.org/10.1016/j.eswa.2018.12.032
Abbas, K., Afaq, M., Khan, T. A., & Song, W.-C. (2020). A Blockchain and Machine Learning-Based drug supply chain management and recommendation system for smart pharmaceutical industry. Electronics, 9(5), 852. https://doi.org/10.3390/electronics9050852
Sturm, N., Mayr, A., Van, T. L., Chupakhin, V., Ceulemans, H., Wegner, J., Golib-Dzib, J.-F., Jeliazkova, N., Vandriessche, Y., Böhm, S., Cima, V., Martinovic, J., Greene, N., Aa, T. V., Ashby, T. J., Hochreiter, S., Engkvist, O., Klambauer, G., & Chen, H. (2020). Industry-scale application and evaluation of deep learning for drug target prediction. Journal of Cheminformatics, 12, 26. https://doi.org/10.1186/s13321-020-00428-5
Park, S., Ko, Y. H., Lee, B., Shin, B., & Beck, B. R. (2020). Abstract 35: Molecular optimization of phase III trial failed anticancer drugs using target affinity and toxicity-centered multiple properties reinforcement learning. Clinical Cancer Research, 26(12_Supplement_1), 35. https://doi.org/10.1158/1557-3265.advprecmed20-35
Ong, E., Wang, H., Wong, M. U., Seetharaman, M., Valdez, N., & He, Y. (2020). Vaxign-ML: supervised machine learning reverse vaccinology model for improved prediction of bacterial protective antigens. Bioinformatics, 36(10), 3185–3191. https://doi.org/10.1093/bioinformatics/btaa119
Mohsen, A., Tripathi, L. P., & Mizuguchi, K. (2021). Deep learning prediction of adverse drug reactions in drug discovery using open TG–GATEs and FAERS databases. Frontiers in Drug Discovery, 1, 768792. https://doi.org/10.3389/fddsv.2021.768792
Masumshah, R., Aghdam, R., & Eslahchi, C. (2021). A neural network-based method for polypharmacy side effects prediction. BMC Bioinformatics, 22, 385. https://doi.org/10.1186/s12859-021-04298-y
Narayanan, H., Dingfelder, F., Morales, I. C., Patel, B., Heding, K. E., Bjelke, J. R., Egebjerg, T., Butté, A., Sokolov, M., Lorenzen, N., & Arosio, P. (2021). Design of biopharmaceutical formulations accelerated by machine learning. Molecular Pharmaceutics, 18(10), 3843–3853. https://doi.org/10.1021/acs.molpharmaceut.1c00469
Pandi, M.-T., Koromina, M., Tsafaridis, I., Patsilinakos, S., Christoforou, E., Van Der Spek, P. J., & Patrinos, G. P. (2021). A novel machine learning-based approach for the computational functional assessment of pharmacogenomic variants. Human Genomics, 15, 51. https://doi.org/10.1186/s40246-021-00352-1
Wang, Y., Yang, Y., Chen, S., & Wang, J. (2021). DeepDRK: a deep learning framework for drug repurposing through kernel-based multi-omics integration. Briefings in Bioinformatics, 22(5), bbab048. https://doi.org/10.1093/bib/bbab048
Wang, N.-N., Wang, X.-G., Xiong, G.-L., Yang, Z.-Y., Lu, A.-P., Chen, X., Liu, S., Hou, T.-J., & Cao, D.-S. (2022). Machine learning to predict metabolic drug interactions related to cytochrome P450 isozymes. Journal of Cheminformatics, 14, 23. https://doi.org/10.1186/s13321-022-00602-x
Zhu, E. Y., & Dupuy, A. J. (2022). Machine learning approach informs biology of cancer drug response. BMC Bioinformatics, 23, 184. https://doi.org/10.1186/s12859-022-04720-z
Han, Y., Klinger, K., Rajpal, D. K., Zhu, C., & Teeple, E. (2022). Empowering the discovery of novel target-disease associations via machine learning approaches in the open targets platform. BMC Bioinformatics, 23, 232. https://doi.org/10.1186/s12859-022-04753-4
Qureshi, R., Basit, S. A., Shamsi, J. A., Fan, X., Nawaz, M., Yan, H., & Alam, T. (2022). Machine learning based personalized drug response prediction for lung cancer patients. Scientific Reports, 12, 18935. https://doi.org/10.1038/s41598-022-23649-0
Goldwaser, E., Laurent, C., Lagarde, N., Fabrega, S., Nay, L., Villoutreix, B. O., Jelsch, C., Nicot, A. B., Loriot, M.-A., & Miteva, M. A. (2022). Machine learning-driven identification of drugs inhibiting cytochrome P450 2C9. PLoS Computational Biology, 18(1), e1009820. https://doi.org/10.1371/journal.pcbi.1009820
Rahman, A. S. M. Z., Liu, C., Sturm, H., Hogan, A. M., Davis, R., Hu, P., & Cardona, S. T. (2022). A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery. PLoS Computational Biology, 18(10), e1010613. https://doi.org/10.1371/journal.pcbi.1010613
Badwan, B. A., Liaropoulos, G., Kyrodimos, E., Skaltsas, D., Tsirigos, A., & Gorgoulis, V. G. (2023). Machine learning approaches to predict drug efficacy and toxicity in oncology. Cell Reports Methods, 3(2), 100413. https://doi.org/10.1016/j.crmeth.2023.100413
Bannigan, P., Bao, Z., Hickman, R. J., Aldeghi, M., Häse, F., Aspuru-Guzik, A., & Allen, C. (2023). Machine learning models to accelerate the design of polymeric long-acting injectables. Nature Communications, 14, 35. https://doi.org/10.1038/s41467-022-35343-w
Hou, R., Xie, C., Gui, Y., Li, G., & Li, X. (2023). Machine-Learning-Based data analysis method for Cell-Based selection of DNA-Encoded libraries. ACS Omega, 8(21), 19057–19071. https://doi.org/10.1021/acsomega.3c02152
Vojjala, S. K., Barron, J., Kumar, A., Grabner, M., Eshete, B., Tan, H., & Willey, V. (2023). P21 Machine learning for imputing missing pharmacy costs in claims data. Value in Health, 26(6), S5. https://doi.org/10.1016/j.jval.2023.03.034
Patel, S., Patel, M., Kulkarni, M., & Patel, M. S. (2023). DE-INTERACT: A machine-learning-based predictive tool for the drug-excipient interaction study during product development—Validation through paracetamol and vanillin as a case study. International Journal of Pharmaceutics, 637, 122839. https://doi.org/10.1016/j.ijpharm.2023.122839
Pirzada, R. H., Ahmad, B., Qayyum, N., & Choi, S. (2023). Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics. Frontiers in Endocrinology, 14, 1084327. https://doi.org/10.3389/fendo.2023.1084327
Shin, S. H., Hur, G., Kim, N. R., Park, J. H. Y., Lee, K. W., & Yang, H. (2023). A machine learning-integrated stepwise method to discover novel anti-obesity phytochemicals that antagonize the glucocorticoid receptor. Food & Function, 14(4), 1869–1883. https://doi.org/10.1039/d2fo03466b
Asha, V., Anjimoon, S., Verma, M. R., Singla, A., Khan, I., & Alkhafaji, M. A. (2024). Machine Learning-Driven Blockchain for Enhanced Drug Discovery and Development in Pharmaceutical Research. 2024 OPJU International Technology Conference (OTCON) on Smart Computing for Innovation and Advancement in Industry 4.0, 1–6. https://doi.org/10.1109/otcon60325.2024.10688287
Arunkumar, M., & Baskaran, T. S. (2024). Use of machine learning for intelligence detection for pharmaceutical Drug-Drug interactions. Journal of Advanced Zoology, 45(S4), 478–484. https://doi.org/10.53555/jaz.v45is4.4309
Singh, V., & Kaewprapha, P. (2024). Machine Learning Application for Precise Identification of Defective and QC-Approved Tablets in Pharmaceutical Manufacturing. 2024 12th International Electrical Engineering Congress (iEECON), 01–05. https://doi.org/10.1109/ieecon60677.2024.10537855
Bello, V., Coghe, L., Gerbasi, A., Figus, E., Dagliati, A., & Merlo, S. (2024). Machine Learning-Based Approach towards Identification of Pharmaceutical Suspensions Exploiting Speckle Pattern Images. Sensors, 24(20), 6635. https://doi.org/10.3390/s24206635
Cysewski, P., Jeli?ski, T., & Przyby?ek, M. (2024). Exploration of the solubility hyperspace of selected active pharmaceutical ingredients in choline- and Betaine-Based deep eutectic solvents: machine learning modeling and experimental validation. Molecules, 29(20), 4894. https://doi.org/10.3390/molecules29204894
Mustapa, D. R., & Tjahyanto, A. (2024). Comparative analysis: Machine learning algorithms for TOC prediction in pharmaceutical water treatment systems. Jurnal Sisfokom (Sistem Informasi Dan Komputer), 13(2), 253–260. https://doi.org/10.32736/sisfokom.v13i2.2148
Nhlapho, S., Nyathi, M. H. L., Ngwenya, B. L., Dube, T., Telukdarie, A., Munien, I., Vermeulen, A., & Chude-Okonkwo, U. A. K. (2024). Druggability of Pharmaceutical Compounds Using Lipinski Rules with Machine Learning. Sciences of Pharmacy, 3(4), 177–192. https://doi.org/10.58920/sciphar0304264
Kalaichelvan, K., Ramalingam, S., Dhandapani, P. B., Leiva, V., & Castro, C. (2024). Optimizing the economic order quantity using fuzzy theory and machine learning applied to a pharmaceutical framework. Mathematics, 12(6), 819. https://doi.org/10.3390/math12060819
Kim, J., Ko, Y., & Seo, J. (2019). A bootstrapping approach with CRF and deep learning models for improving the biomedical named entity recognition in Multi-Domains. IEEE Access, 7, 70308–70318. https://doi.org/10.1109/access.2019.2914168
Zhang, Y., Chen, Q., Yang, Z., Lin, H., & Lu, Z. (2019). BioWordVec, improving biomedical word embeddings with subword information and MeSH. Scientific Data, 6, 52. https://doi.org/10.1038/s41597-019-0055-0
Hathaway, Q. A., Roth, S. M., Pinti, M. V., Sprando, D. C., Kunovac, A., Durr, A. J., Cook, C. C., Fink, G. K., Cheuvront, T. B., Grossman, J. H., Aljahli, G. A., Taylor, A. D., Giromini, A. P., Allen, J. L., & Hollander, J. M. (2019). Machine-learning to stratify diabetic patients using novel cardiac biomarkers and integrative genomics. Cardiovascular Diabetology, 18, 78. https://doi.org/10.1186/s12933-019-0879-0
Richens, J. G., Lee, C. M., & Johri, S. (2020). Improving the accuracy of medical diagnosis with causal machine learning. Nature Communications, 11, 3923. https://doi.org/10.1038/s41467-020-17419-7
Martino, F., Varricchio, S., Russo, D., Merolla, F., Ilardi, G., Mascolo, M., Dell’Aversana, G. O., Califano, L., Toscano, G., De Pietro, G., Frucci, M., Brancati, N., Fraggetta, F., & Staibano, S. (2020). A machine-learning approach for the assessment of the proliferative compartment of solid tumors on Hematoxylin-Eosin-Stained sections. Cancers, 12(5), 1344. https://doi.org/10.3390/cancers12051344
Hazra, D., & Byun, Y.-C. (2020). SynSigGAN: Generative Adversarial Networks for Synthetic Biomedical Signal Generation. Biology, 9(12), 441. https://doi.org/10.3390/biology9120441
Marcinkiewicz-Siemion, M., Kaminski, M., Ciborowski, M., Ptaszynska-Kopczynska, K., Szpakowicz, A., Lisowska, A., Jasiewicz, M., Tarasiuk, E., Kretowski, A., Sobkowicz, B., & Kaminski, K. A. (2020). Machine-learning facilitates selection of a novel diagnostic panel of metabolites for the detection of heart failure. Scientific Reports, 10, 130. https://doi.org/10.1038/s41598-019-56889-8
Gandouz, M., Holzmann, H., & Heider, D. (2021). Machine learning with asymmetric abstention for biomedical decision-making. BMC Medical Informatics and Decision Making, 21, 294. https://doi.org/10.1186/s12911-021-01655-y
Gu, Q., Kumar, A., Bray, S., Creason, A., Khanteymoori, A., Jalili, V., Grüning, B., & Goecks, J. (2021). Galaxy-ML: An accessible, reproducible, and scalable machine learning toolkit for biomedicine. PLoS Computational Biology, 17(6), e1009014. https://doi.org/10.1371/journal.pcbi.1009014
Du, T., Xie, L., Zhang, H., Liu, X., Wang, X., Chen, D., Xu, Y., Sun, Z., Zhou, W., Song, L., Guan, C., Lansky, A. J., & Xu, B. (2021). Training and validation of a deep learning architecture for the automatic analysis of coronary angiography. EuroIntervention, 17(1), 32–40. https://doi.org/10.4244/eij-d-20-00570
Akazawa, M., Hashimoto, K., Katsuhiko, N., & Kaname, Y. (2021). Machine learning approach for the prediction of postpartum hemorrhage in vaginal birth. Scientific Reports, 11, 22620. https://doi.org/10.1038/s41598-021-02198-y
Feng, G., Zheng, K. I., Li, Y.-Y., Rios, R. S., Zhu, P.-W., Pan, X.-Y., Li, G., Ma, H.-L., Tang, L.-J., Byrne, C. D., Targher, G., He, N., Mi, M., Chen, Y.-P., & Zheng, M.-H. (2021). Machine learning algorithm outperforms fibrosis markers in predicting significant fibrosis in biopsy?confirmed NAFLD. Journal of Hepato-Biliary-Pancreatic Sciences, 28(7), 593–603. https://doi.org/10.1002/jhbp.972
Kim, Y. J. (2022). Machine Learning Model Based on Radiomic Features for Differentiation between COVID-19 and Pneumonia on Chest X-ray. Sensors, 22(17), 6709. https://doi.org/10.3390/s22176709
Bonidia, R. P., Santos, A. P. A., De Almeida, B. L. S., Stadler, P. F., Da Rocha, U. N., Sanches, D. S., & De Carvalho, A. C. P. L. F. (2022). BioAutoML: automated feature engineering and metalearning to predict noncoding RNAs in bacteria. Briefings in Bioinformatics, 23(4), bbac218. https://doi.org/10.1093/bib/bbac218
Fu, S., Cheng, Y., Wang, X., Huang, J., Su, S., Wu, H., Yu, J., & Xu, Z. (2022). Identification of diagnostic gene biomarkers and immune infiltration in patients with diabetic kidney disease using machine learning strategies and bioinformatic analysis. Frontiers in Medicine, 9, 918657. https://doi.org/10.3389/fmed.2022.918657
Zeng, Z., Yao, Y., Liu, Z., & Sun, M. (2022). A deep-learning system bridging molecule structure and biomedical text with comprehension comparable to human professionals. Nature Communications, 13, 862. https://doi.org/10.1038/s41467-022-28494-3
Zhang, H., Hu, J., Zhu, J., Li, Q., & Fang, L. (2022). Machine learning-based metabolism-related genes signature and immune infiltration landscape in diabetic nephropathy. Frontiers in Endocrinology, 13, 1026938. https://doi.org/10.3389/fendo.2022.1026938
Akatsuka, J., Numata, Y., Morikawa, H., Sekine, T., Kayama, S., Mikami, H., Yanagi, M., Endo, Y., Takeda, H., Toyama, Y., Yamaguchi, R., Kimura, G., Kondo, Y., & Yamamoto, Y. (2022). A data-driven ultrasound approach discriminates pathological high grade prostate cancer. Scientific Reports, 12, 860. https://doi.org/10.1038/s41598-022-04951-3
Jan, Y.-T., Tsai, P.-S., Huang, W.-H., Chou, L.-Y., Huang, S.-C., Wang, J.-Z., Lu, P.-H., Lin, D.-C., Yen, C.-S., Teng, J.-P., Mok, G. S. P., Shih, C.-T., & Wu, T.-H. (2023). Machine learning combined with radiomics and deep learning features extracted from CT images: a novel AI model to distinguish benign from malignant ovarian tumors. Insights Into Imaging, 14, 68. https://doi.org/10.1186/s13244-023-01412-x
Rabaglino, M. B., Salilew?Wondim, D., Zolini, A., Tesfaye, D., Hoelker, M., Lonergan, P., & Hansen, P. J. (2023). Machine?learning methods applied to integrated transcriptomic data from bovine blastocysts and elongating conceptuses to identify genes predictive of embryonic competence. The FASEB Journal, 37(3), e22809. https://doi.org/10.1096/fj.202201977r
Rana, M., & Bhushan, M. (2023). Machine learning and deep learning approach for medical image analysis: diagnosis to detection. Multimedia Tools and Applications, 82, 26731–26769. https://doi.org/10.1007/s11042-022-14305-w
Jungo, P., & Hewer, E. (2023). Code-free machine learning for classification of central nervous system histopathology images. Journal of Neuropathology & Experimental Neurology, 82(3), 221–230. https://doi.org/10.1093/jnen/nlac131
Shuryak, I., Nemzow, L., Bacon, B. A., Taveras, M., Wu, X., Deoli, N., Ponnaiya, B., Garty, G., Brenner, D. J., & Turner, H. C. (2023). Machine learning approach for quantitative biodosimetry of partial-body or total-body radiation exposures by combining radiation-responsive biomarkers. Scientific Reports, 13, 949. https://doi.org/10.1038/s41598-023-28130-0
Sun, Z., Lin, J., Zhang, T., Sun, X., Wang, T., Duan, J., & Yao, K. (2023). Combining bioinformatics and machine learning to identify common mechanisms and biomarkers of chronic obstructive pulmonary disease and atrial fibrillation. Frontiers in Cardiovascular Medicine, 10, 1121102. https://doi.org/10.3389/fcvm.2023.1121102
Azari, H., Nazari, E., Mohit, R., Asadnia, A., Maftooh, M., Nassiri, M., Hassanian, S. M., Ghayour-Mobarhan, M., Shahidsales, S., Khazaei, M., Ferns, G. A., & Avan, A. (2023). Machine learning algorithms reveal potential miRNAs biomarkers in gastric cancer. Scientific Reports, 13, 6147. https://doi.org/10.1038/s41598-023-32332-x
Su, L., Chen, J., Peng, Y., & Sun, C. (2024). Demonstration-based learning for few-shot biomedical named entity recognition under machine reading comprehension. Journal of Biomedical Informatics, 159, 104739. https://doi.org/10.1016/j.jbi.2024.104739
Wu, C., Wan, B., Entezari, A., Fang, J., Xu, Y., & Li, Q. (2024). Machine learning-based design for additive manufacturing in biomedical engineering. International Journal of Mechanical Sciences, 266, 108828. https://doi.org/10.1016/j.ijmecsci.2023.108828
Islam, S., Bentahar, J., Cohen, R., & Rjoub, G. (2024). A Multi-Modal unsupervised machine learning approach for biomedical signal processing in CPR. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2411.11869
Slonopas, A., Beatty, A., & Djajalaksana, Y. (2024). Applying Reservoir Computing and Machine Learning Techniques for Image Enhancement in Biomedical Imaging. 2024 International Conference on Smart Applications, Communications and Networking (SmartNets), 1–7. https://doi.org/10.1109/smartnets61466.2024.10577705
Huan, J.-M., Wang, X.-J., Li, Y., Zhang, S.-J., Hu, Y.-L., & Li, Y.-L. (2024). The biomedical knowledge graph of symptom phenotype in coronary artery plaque: machine learning-based analysis of real-world clinical data. BioData Mining, 17, 13. https://doi.org/10.1186/s13040-024-00365-1
He, Z., Du, N., Wang, J., & Ye, X. (2024). A Machine Learning Classification Approach for Solving Biomedical Inverse Scattering Problem. 2024 IEEE International Conference on Computational Electromagnetics (ICCEM), 1–3. https://doi.org/10.1109/iccem60619.2024.10558921
Lehmann, V., Zueger, T., Maritsch, M., Notter, M., Schallmoser, S., Bérubé, C., Albrecht, C., Kraus, M., Feuerriegel, S., Fleisch, E., Kowatsch, T., Lagger, S., Laimer, M., Wortmann, F., & Stettler, C. (2024). Machine Learning to Infer a Health State Using Biomedical Signals — Detection of Hypoglycemia in People with Diabetes while Driving Real Cars. NEJM AI, 1(3), AIoa2300013. https://doi.org/10.1056/aioa2300013
Mercaldo, F., Brunese, L., Santone, A., Martinelli, F., & Cesarelli, M. (2024). Extreme Learning Machine for Biomedical Image Classification: A Multi-Case Study. EAI Endorsed Transactions on Pervasive Health and Technology, 10, 1–8. https://doi.org/10.4108/eetpht.10.5542
Venkatesh, R., Balasubramanian, C., & Kaliappan, M. (2019). Development of Big Data Predictive Analytics Model for Disease Prediction using Machine learning Technique. Journal of Medical Systems, 43, 272. https://doi.org/10.1007/s10916-019-1398-y
Ramkumar, P. N., Haeberle, H. S., Ramanathan, D., Cantrell, W. A., Navarro, S. M., Mont, M. A., Bloomfield, M., & Patterson, B. M. (2019). Remote patient monitoring using mobile health for total knee arthroplasty: validation of a wearable and Machine Learning–Based surveillance platform. The Journal of Arthroplasty, 34(10), 2253–2259. https://doi.org/10.1016/j.arth.2019.05.021
Myers, K. D., Knowles, J. W., Staszak, D., Shapiro, M. D., Howard, W., Yadava, M., Zuzick, D., Williamson, L., Shah, N. H., Banda, J. M., Leader, J., Cromwell, W. C., Trautman, E., Murray, M. F., Baum, S. J., Myers, S., Gidding, S. S., Wilemon, K., & Rader, D. J. (2019). Precision screening for familial hypercholesterolaemia: a machine learning study applied to electronic health encounter data. The Lancet Digital Health, 1(8), e393–e402. https://doi.org/10.1016/s2589-7500(19)30150-5
Maarseveen, T. D., Meinderink, T., Reinders, M. J. T., Knitza, J., Huizinga, T. W. J., Kleyer, A., Simon, D., Van Den Akker, E. B., & Knevel, R. (2020). Machine Learning Electronic Health Record Identification of Patients with Rheumatoid Arthritis: Algorithm Pipeline Development and Validation Study. JMIR Medical Informatics, 8(11), e23930. https://doi.org/10.2196/23930
Du, Z., Yang, Y., Zheng, J., Li, Q., Lin, D., Li, Y., Fan, J., Cheng, W., Chen, X., & Cai, Y. (2020). Accurate prediction of coronary heart disease for patients with hypertension from electronic health records with big data and Machine-Learning methods: model development and performance evaluation. JMIR Medical Informatics, 8(7), e17257. https://doi.org/10.2196/17257
El-Ganainy, N. O., Balasingham, I., Halvorsen, P. S., & Rosseland, L. A. (2020). A new real time clinical decision support system using machine learning for critical care units. IEEE Access, 8, 185676–185687. https://doi.org/10.1109/access.2020.3030031
Artzi, N. S., Shilo, S., Hadar, E., Rossman, H., Barbash-Hazan, S., Ben-Haroush, A., Balicer, R. D., Feldman, B., Wiznitzer, A., & Segal, E. (2020). Prediction of gestational diabetes based on nationwide electronic health records. Nature Medicine, 26, 71–76. https://doi.org/10.1038/s41591-019-0724-8
Philpott-Morgan, S., Thakrar, D. B., Symons, J., Ray, D., Ashrafian, H., & Darzi, A. (2021). Characterising the nationwide burden and predictors of unkept outpatient appointments in the National Health Service in England: A cohort study using a machine learning approach. PLoS Medicine, 18(10), e1003783. https://doi.org/10.1371/journal.pmed.1003783
Liu, Y. S., Chokka, S., Cao, B., & Chokka, P. R. (2021). Screening for bipolar disorder in a tertiary mental health centre using EarlyDetect: A machine learning-based pilot study. Journal of Affective Disorders Reports, 6, 100215. https://doi.org/10.1016/j.jadr.2021.100215
Guo, A., Mazumder, N. R., Ladner, D. P., & Foraker, R. E. (2021). Predicting mortality among patients with liver cirrhosis in electronic health records with machine learning. PLoS ONE, 16(8), e0256428. https://doi.org/10.1371/journal.pone.0256428
Estiri, H., Strasser, Z. H., & Murphy, S. N. (2021). Individualized prediction of COVID-19 adverse outcomes with MLHO. Scientific Reports, 11, 5322. https://doi.org/10.1038/s41598-021-84781-x
Zeng, Z., Yao, S., Zheng, J., & Gong, X. (2021). Development and validation of a novel blending machine learning model for hospital mortality prediction in ICU patients with Sepsis. BioData Mining, 14, 40. https://doi.org/10.1186/s13040-021-00276-5
Shahbandegan, A., Mago, V., Alaref, A., Van Der Pol, C. B., & Savage, D. W. (2022). Developing a machine learning model to predict patient need for computed tomography imaging in the emergency department. PLoS ONE, 17(12), e0278229. https://doi.org/10.1371/journal.pone.0278229
Xi, Y., Wang, H., & Sun, N. (2022). Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension. Frontiers in Cardiovascular Medicine, 9, 1025705. https://doi.org/10.3389/fcvm.2022.1025705
Chen, A., & Chen, D. O. (2022). Simulation of a machine learning enabled learning health system for risk prediction using synthetic patient data. Scientific Reports, 12, 17917. https://doi.org/10.1038/s41598-022-23011-4
Lazzarini, N., Filippoupolitis, A., Manzione, P., & Eleftherohorinou, H. (2022). A machine learning model on Real World Data for predicting progression to Acute Respiratory Distress Syndrome (ARDS) among COVID-19 patients. PLoS ONE, 17(7), e0271227. https://doi.org/10.1371/journal.pone.0271227
Liao, W.-W., Hsieh, Y.-W., Lee, T.-H., Chen, C.-L., & Wu, C.-Y. (2022). Machine learning predicts clinically significant health related quality of life improvement after sensorimotor rehabilitation interventions in chronic stroke. Scientific Reports, 12, 11235. https://doi.org/10.1038/s41598-022-14986-1
Uddin, S., Wang, S., Lu, H., Khan, A., Hajati, F., & Khushi, M. (2022). Comorbidity and multimorbidity prediction of major chronic diseases using machine learning and network analytics. Expert Systems With Applications, 205, 117761. https://doi.org/10.1016/j.eswa.2022.117761
La Cava, W. G., Lee, P. C., Ajmal, I., Ding, X., Solanki, P., Cohen, J. B., Moore, J. H., & Herman, D. S. (2023). A flexible symbolic regression method for constructing interpretable clinical prediction models. Npj Digital Medicine, 6, 107. https://doi.org/10.1038/s41746-023-00833-8
Langenberger, B., Schulte, T., & Groene, O. (2023). The application of machine learning to predict high-cost patients: A performance-comparison of different models using healthcare claims data. PLoS ONE, 18(1), e0279540. https://doi.org/10.1371/journal.pone.0279540
Pasieczna, A. H., Szczepanowski, R., Sobecki, J., Katarzyniak, R., Uchmanowicz, I., Gobbens, R. J. J., Kahsin, A., & Dixit, A. (2023). Importance analysis of psychosociological variables in frailty syndrome in heart failure patients using machine learning approach. Scientific Reports, 13, 7782. https://doi.org/10.1038/s41598-023-35037-3
Caratsch, L., Lechtenboehmer, C., Caorsi, M., Oung, K., Zanchi, F., Aleman, Y., Omoumi, P., & Hügle, T. (2023). POS0892 AN END-TO-END MACHINE LEARNING PIPELINE FOR THE AUTOMATED DETECTION OF RADIOGRAPHIC HAND OSTEOARTHRITIS: a NO-CODING PLATFORM EXPERIENCE. Annals of the Rheumatic Diseases, 82(1), 753–754. https://doi.org/10.1136/annrheumdis-2023-eular.3422
Limketkai, B. N., Maas, L., Krishna, M., Dua, A., DeDecker, L., Sauk, J. S., & Parian, A. M. (2023). Machine learning-based characterization of longitudinal health care utilization among patients with inflammatory bowel diseases. Inflammatory Bowel Diseases, 30(5), 697–703. https://doi.org/10.1093/ibd/izad127
Kwak, S., Lee, H.-J., Kim, S., Park, J.-B., Lee, S.-P., Kim, H.-K., & Kim, Y.-J. (2023). Machine learning reveals sex-specific associations between cardiovascular risk factors and incident atherosclerotic cardiovascular disease. Scientific Reports, 13, 9364. https://doi.org/10.1038/s41598-023-36450-4
Liu, X., Morelli, D., Littlejohns, T. J., Clifton, D. A., & Clifton, L. (2023). Combining machine learning with Cox models to identify predictors for incident post-menopausal breast cancer in the UK Biobank. Scientific Reports, 13, 9221. https://doi.org/10.1038/s41598-023-36214-0
Wang, J., Qin, Z., Hsu, J., & Zhou, B. (2024). A fusion of machine learning algorithms and traditional statistical forecasting models for analyzing American healthcare expenditure. Healthcare Analytics, 5, 100312. https://doi.org/10.1016/j.health.2024.100312
Biswas, S., Aizan, L. N. B., Mathieson, K., Neupane, P., Snowdon, E., MacArthur, J., Sarkar, V., Tetlow, C., & George, K. J. (2024). Clinicosocial determinants of hospital stay following cervical decompression: A public healthcare perspective and machine learning model. Journal of Clinical Neuroscience, 126, 1–11. https://doi.org/10.1016/j.jocn.2024.05.032
Zhou, X. (2024). A study of machine learning applications in healthcare. Applied and Computational Engineering, 102, 128–133. https://doi.org/10.54254/2755-2721/102/20241057
Ali, L., Gun, T. C., & Alhasan, W. (2024). Comparative analysis of machine learning algorithms in enhancing healthcare outcomes. European Modern Studies Journal, 8(3), 606–618. https://doi.org/10.59573/emsj.8(3).2024.38
Moradpour, M., Ritter, Z., & Haushild, A.-C. (2024). Multi-Objective performance optimization of machine learning models in healthcare. In Digital Health and Informatics Innovations for Sustainable Health Care Systems (pp. 822–826). IOS Press. https://doi.org/10.3233/shti240538
Bhute, H., Wani, R., Patil, N., & Naik, V. (2024). Smart Healthcare in Smart Cities: Leveraging Machine Learning for Disease Detection. 2024 4th International Conference on Intelligent Technologies (CONIT), 1–7. https://doi.org/10.1109/conit61985.2024.10627086
Ismukhamedova, A., Uvaliyeva, I., & Belginova, S. (2024). Integrating machine learning in electronic health passport based on WHO study and healthcare resources. Informatics in Medicine Unlocked, 44, 101428. https://doi.org/10.1016/j.imu.2023.101428
Sevukamoorthy, L., Chintapalli, G. S., & Pander, V. K. (2024). Machine learning and generative adversarial networks for accurate and timely cancer detection in smart healthcare systems. 2024 15th International Conference on Computing Communication and Networking Technologies (ICCCNT), 1–5. https://doi.org/10.1109/icccnt61001.2024.10724781
Thakare, R. M., Gangurde, P., & Sawant, G. S. (2024). Machine learning and artificial intelligence in pharmaceutical industry and development. International Journal for Multidisciplinary Research, 6(6), 31522. https://doi.org/10.36948/ijfmr.2024.v06i06.31522
Pazhayattil, A. B., & Konyu-Fogel, G. (2023). An empirical study to accelerate machine learning and artificial intelligence adoption in pharmaceutical manufacturing organizations. Journal of Generic Medicines: The Business Journal for the Generic Medicines Sector, 19(2), 81–91. https://doi.org/10.1177/17411343221151109
Kolhe, K., Somatkar, A. A., Bhandarkar, M. S., Kotangale, K. B., Ayane, S. S., & Shirke, S. I. (2023). Applications and challenges of machine learning techniques for smart manufacturing in Industry 4.0. 2023 7th International Conference on Computing, Communication, Control and Automation (ICCUBEA), 1–6. https://doi.org/10.1109/iccubea58933.2023.10392071
Susanty, M., Puspasari, I., Fitriah, N., Mahayana, D., Rajab, T. E. L., Zakaria, H., Setiawan, A. W., & Hertadi, R. (2023). Avoiding machine learning becoming pseudoscience in biomedical research. Jurnal Informatika, 10(1), 1–12. https://doi.org/10.31294/inf.v10i1.12787
Boulogeorgos, A.-A. A., Trevlakis, S. E., Tegos, S. A., Papanikolaou, V. K., & Karagiannidis, G. K. (2021). Machine learning in Nano-Scale biomedical Engineering. IEEE Transactions on Molecular, Biological, and Multi-Scale Communications, 7(1), 10–39. https://doi.org/10.1109/tmbmc.2020.3035383
Remya, K. R., & Ramya, J. S. (2014). A survey of machine learning approaches for relation classification from biomedical texts. IJETAE International Journal of Emerging Technology and Advanced Engineering, 4(3).
Ghassemi, M., Naumann, T., Schulam, P., Beam, A. L., Chen, I. Y., & Ranganath, R. (2020). A Review of Challenges and Opportunities in Machine Learning for Health. AMIA Summits on Translational Science Proceedings, 2020, 191. https://arxiv.org/pdf/1806.00388.pdf
Yadav, K. K., & Gaurav, A. (2023). Application and Challenges of machine learning in healthcare. International Journal for Research in Applied Science and Engineering Technology, 11(9), 458–466. https://doi.org/10.22214/ijraset.2023.55678
Feng, Q., Du, M., Zou, N., & Hu, X. (2025). Fair Machine Learning in Healthcare: a survey. IEEE Transactions on Artificial Intelligence, 6(3), 493–507. https://doi.org/10.1109/tai.2024.3361836
Chhina, A., Trehan, K., Saini, M., Thakur, S., Kaur, M., Shahtaghi, N. R., Shivgotra, R., Soni, B., Modi, A., Bakrey, H., & Jain, S. K. (2023). Revolutionizing pharmaceutical industry: The radical impact of artificial intelligence and machine learning. Current Pharmaceutical Design, 29(21), 1645–1658. https://doi.org/10.2174/1381612829666230807161421
Wanjul, P. B., Parshuram, M. S., & Laxman, G. V. (2023). Future Directions of AI in Pharma: Innovation in Pharmaceutical Industry. International Journal for Multidisciplinary Research (IJFMR), 5(3), 3098. https://www.ijfmr.com/papers/2023/3/3098.pdf
Leong, W. Y., Leong, Y. Z., & Leong, W. S. (2023). Human-Machine Interaction in Biomedical Manufacturing. 2023 IEEE 5th Eurasia Conference on IOT, Communication and Engineering (ECICE), 939–944. https://doi.org/10.1109/ecice59523.2023.10383070
Kim, W., & Seok, J. (2022). Privacy-preserving collaborative machine learning in biomedical applications. 2022 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), 179–183. https://doi.org/10.1109/icaiic54071.2022.9722703
Jnyanadeep, B., Sahana, S., Suresh, R., S, R. T., & Angadi, G. (2024). Machine Learning for Predictive Modeling and Personalized Treatment in Magnesium-based Biomedical Applications. 2024 8th International Conference on Computational System and Information Technology for Sustainable Solutions (CSITSS), 1–7. https://doi.org/10.1109/csitss64042.2024.10816742
Ramírez, J. G. C., Islam, M. M., & Even, A. I. H. (2024). Machine learning applications in healthcare: Current trends and future prospects. Journal of Artificial Intelligence General Science (JAIGS), 1(1). https://doi.org/10.60087/jaigs.v1i1.33
Ganatra, H. A. (2025). Machine learning in Pediatric healthcare: current trends, challenges, and future directions. Journal of Clinical Medicine, 14(3), 807. https://doi.org/10.3390/jcm14030807
Nadakuditi, S., Kumar, B., & Kumar, T. (2024). AI and Machine Learning in Healthcare - Applications, Challenges and Ethics. International Journal of Health Sciences, 7(4), 36–43. https://doi.org/10.47941/ijhs.1949.

Reference

Han, Y., & Tao, J. (2024). Revolutionizing Pharma: Unveiling the AI and LLM trends in the pharmaceutical industry. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2401.10273
Pargaien, A. V., Pargaien, S., Chilwal, B., Gupta, S., Nawaz, A., Adhikari, M., & Khan, F. (2023). A New Era of Machine Learning Augmenting Pharmaceutical Industry. 2023 2nd International Conference on Ambient Intelligence in Health Care (ICAIHC), 1–6. https://doi.org/10.1109/icaihc59020.2023.10430483
Likhitha, M., Surarchita, S., & Yeshamaina, S. (2024). Applications of newer technologies in enhancement of pharmaceutical industry. International Journal of Innovative Science and Research Technology (IJISRT), 9(9), 2697–2707. https://doi.org/10.38124/ijisrt/ijisrt24sep1485
Mottaghi-Dastjerdi, N., & Soltany-Rezaee-Rad, M. (2024). Advancements and Applications of Artificial Intelligence in Pharmaceutical Sciences: A Comprehensive review. Iranian Journal of Pharmaceutical Research (IJPR), 23(1), e150510. https://doi.org/10.5812/ijpr-150510
Al Rammadan, A. H. A., Almarhoon, M. W., Almarhoon, H. W. A., Alomran, A. R., Al Amer, H. M. T., Hirabah, J. A., Alkhamees, M. S. A., Almari, J. K., Al Jeadah, N. H., Almalki, A. M., Alquaymi, A. E., Alsaleem, A. A., Alsanawi, H. A., & Almajhad, H. A. E. (2022). Leveraging Artificial Intelligence And Machine Learning In Drug Development: Opportunities And Challenges. Scientific Journal for Research Publishing, 2. https://sjr-publishing.com/wp-content/uploads/2019/03/Leveraging-Artificial-Intelligence-And-Machine-Learning-In-Drug-Development-Opportunities-And-Challenges-1.pdf
Khalid, W., Khalid, M. Y., Hena, M., Sarwar, A., & Iqbal, S. (2023). Advancing Pharmaceuticals with Machine Learning: A Short Review of Research and Development Applications. Pharmaceutical Communications, 2(1), 63–69. https://doi.org/10.55627/pharma.002.01.0297
Nilima, S. I., Hossain, M. A., Sharmin, S., Rahman, R., Esa, H., Manik, M. M. T. G., & Hasan, R. (2024). Advancement of Drug Discovery Using Artificial Intelligence and Machine Learning. 2024 IEEE International Conference on Computing, Applications and Systems (COMPAS), 1–7. https://doi.org/10.1109/compas60761.2024.10796748
Nagy, B., Galata, D. L., Farkas, A., & Nagy, Z. K. (2022). Application of artificial neural networks in the Process Analytical Technology of pharmaceutical Manufacturing—A review. The AAPS Journal, 24, 74. https://doi.org/10.1208/s12248-022-00706-0
O’Mahony, N., Murphy, T., Panduru, K., Riordan, D., & Walsh, J. (2017). Real-time monitoring of powder blend composition using near infrared spectroscopy. 2017 Eleventh International Conference on Sensing Technology (ICST), 1–6. https://doi.org/10.1109/icsenst.2017.8304431
Sathyabhama, B., & Menon, S. (2018). CDISC SDTM - an Automated Approach. PhUSE EU Connect 2018, 1–6. https://www.lexjansen.com/phuse/2018/si/SI11.pdf
Jain, D., Chandra, P., Ali, Z., Fatma, N., & Khan, H. (2024). A Comprehensive Investigation: Developing the Pharmaceutical Industry through Artificial Intelligence. Current Drug Discovery Technologies, 21, e15701638313233. https://doi.org/10.2174/0115701638313233240830132804
Ogunye, R. O., Egwuatu, D., Anene, P. C., Azubuike, E. O., Asenuga, O. O., Sargwak, J. P., Ojobor, J.-F. C., Onoharigho, F. O., & Nwokafor, C. V. (2024). The Impact of Emerging Technologies on pharmaceutical process design and optimization in Africa: a review. Journal of Pharmaceutical Research International, 36(9), 46–60. https://doi.org/10.9734/jpri/2024/v36i97576
Markus, B., C, G. C., Andreas, K., Arkadij, K., Stefan, L., Gustav, O., Elina, S., & Radka, S. (2023). Accelerating Biocatalysis Discovery with Machine Learning: A Paradigm Shift in Enzyme Engineering, Discovery, and Design. ACS Catalysis, 13(21), 14454–14469. https://doi.org/10.1021/acscatal.3c03417
Smaldone, A. M., Shee, Y., Kyro, G. W., Xu, C., Vu, N. P., Dutta, R., Farag, M. H., Galda, A., Kumar, S., Kyoseva, E., & Batista, V. S. (2024). Quantum Machine Learning in Drug Discovery: Applications in academia and pharmaceutical industries. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2409.15645
Schiro, F., & Agaian, S. (2022). A machine-learning approach to predicting new pharmaceutical successes in clinical trials. Proceedings Volume 12100, Multimodal Image Exploitation and Learning 2022, 1210009. https://doi.org/10.1117/12.2619069
PhaseV. (2024). Seven Top Pharma Companies Adopt PhaseV’s Machine Learning Technology for Drug Development. PR Newswire. Retrieved March 18, 2025, from https://www.prnewswire.com/news-releases/seven-top-pharma-companies-adopt-phasevs-machine-learning-technology-for-drug-development-302315425.html
Chen, F. (2025). Top 10 Pharmaceutical Companies Using AI and Machine Learning in Drug Discovery (2024) [Online forum post]. LinkedIn. Retrieved March 18, 2025, from https://www.linkedin.com/posts/fengqian-chen_top-10-pharmaceutical-companies-using-ai-activity-7281107628208844800-0Y0H/
Shah-Neville, W. (2024). Five AI drug discovery companies you should know about. Labiotech.eu. Retrieved March 18, 2025, from https://www.labiotech.eu/best-biotech/ai-drug-discovery-companies/
Thomas, M. (2024). 24 Machine learning in healthcare examples. Built In. Retrieved March 18, 2025, from https://builtin.com/artificial-intelligence/machine-learning-healthcare
Zoffmann, S., Vercruysse, M., Benmansour, F., Maunz, A., Wolf, L., Marti, R. B., Heckel, T., Ding, H., Truong, H. H., Prummer, M., Schmucki, R., Mason, C. S., Bradley, K., Jacob, A. I., Lerner, C., Del Rosario, A. A., Burcin, M., Amrein, K. E., & Prunotto, M. (2019). Machine learning-powered antibiotics phenotypic drug discovery. Scientific Reports, 9, 5013. https://doi.org/10.1038/s41598-019-39387-9
Galata, D. L., Farkas, A., Könyves, Z., Mészáros, L. A., Szabó, E., Csontos, I., Pálos, A., Marosi, G., Nagy, Z. K., & Nagy, B. (2019). Fast, Spectroscopy-Based prediction of in vitro dissolution profile of extended release tablets using artificial neural networks. Pharmaceutics, 11(8), 400. https://doi.org/10.3390/pharmaceutics11080400
Ruano-Ordás, D., Yevseyeva, I., Fernandes, V. B., Méndez, J. R., & Emmerich, M. T. M. (2019). Improving the drug discovery process by using multiple classifier systems. Expert Systems With Applications, 121, 292–303. https://doi.org/10.1016/j.eswa.2018.12.032
Abbas, K., Afaq, M., Khan, T. A., & Song, W.-C. (2020). A Blockchain and Machine Learning-Based drug supply chain management and recommendation system for smart pharmaceutical industry. Electronics, 9(5), 852. https://doi.org/10.3390/electronics9050852
Sturm, N., Mayr, A., Van, T. L., Chupakhin, V., Ceulemans, H., Wegner, J., Golib-Dzib, J.-F., Jeliazkova, N., Vandriessche, Y., Böhm, S., Cima, V., Martinovic, J., Greene, N., Aa, T. V., Ashby, T. J., Hochreiter, S., Engkvist, O., Klambauer, G., & Chen, H. (2020). Industry-scale application and evaluation of deep learning for drug target prediction. Journal of Cheminformatics, 12, 26. https://doi.org/10.1186/s13321-020-00428-5
Park, S., Ko, Y. H., Lee, B., Shin, B., & Beck, B. R. (2020). Abstract 35: Molecular optimization of phase III trial failed anticancer drugs using target affinity and toxicity-centered multiple properties reinforcement learning. Clinical Cancer Research, 26(12_Supplement_1), 35. https://doi.org/10.1158/1557-3265.advprecmed20-35
Ong, E., Wang, H., Wong, M. U., Seetharaman, M., Valdez, N., & He, Y. (2020). Vaxign-ML: supervised machine learning reverse vaccinology model for improved prediction of bacterial protective antigens. Bioinformatics, 36(10), 3185–3191. https://doi.org/10.1093/bioinformatics/btaa119
Mohsen, A., Tripathi, L. P., & Mizuguchi, K. (2021). Deep learning prediction of adverse drug reactions in drug discovery using open TG–GATEs and FAERS databases. Frontiers in Drug Discovery, 1, 768792. https://doi.org/10.3389/fddsv.2021.768792
Masumshah, R., Aghdam, R., & Eslahchi, C. (2021). A neural network-based method for polypharmacy side effects prediction. BMC Bioinformatics, 22, 385. https://doi.org/10.1186/s12859-021-04298-y
Narayanan, H., Dingfelder, F., Morales, I. C., Patel, B., Heding, K. E., Bjelke, J. R., Egebjerg, T., Butté, A., Sokolov, M., Lorenzen, N., & Arosio, P. (2021). Design of biopharmaceutical formulations accelerated by machine learning. Molecular Pharmaceutics, 18(10), 3843–3853. https://doi.org/10.1021/acs.molpharmaceut.1c00469
Pandi, M.-T., Koromina, M., Tsafaridis, I., Patsilinakos, S., Christoforou, E., Van Der Spek, P. J., & Patrinos, G. P. (2021). A novel machine learning-based approach for the computational functional assessment of pharmacogenomic variants. Human Genomics, 15, 51. https://doi.org/10.1186/s40246-021-00352-1
Wang, Y., Yang, Y., Chen, S., & Wang, J. (2021). DeepDRK: a deep learning framework for drug repurposing through kernel-based multi-omics integration. Briefings in Bioinformatics, 22(5), bbab048. https://doi.org/10.1093/bib/bbab048
Wang, N.-N., Wang, X.-G., Xiong, G.-L., Yang, Z.-Y., Lu, A.-P., Chen, X., Liu, S., Hou, T.-J., & Cao, D.-S. (2022). Machine learning to predict metabolic drug interactions related to cytochrome P450 isozymes. Journal of Cheminformatics, 14, 23. https://doi.org/10.1186/s13321-022-00602-x
Zhu, E. Y., & Dupuy, A. J. (2022). Machine learning approach informs biology of cancer drug response. BMC Bioinformatics, 23, 184. https://doi.org/10.1186/s12859-022-04720-z
Han, Y., Klinger, K., Rajpal, D. K., Zhu, C., & Teeple, E. (2022). Empowering the discovery of novel target-disease associations via machine learning approaches in the open targets platform. BMC Bioinformatics, 23, 232. https://doi.org/10.1186/s12859-022-04753-4
Qureshi, R., Basit, S. A., Shamsi, J. A., Fan, X., Nawaz, M., Yan, H., & Alam, T. (2022). Machine learning based personalized drug response prediction for lung cancer patients. Scientific Reports, 12, 18935. https://doi.org/10.1038/s41598-022-23649-0
Goldwaser, E., Laurent, C., Lagarde, N., Fabrega, S., Nay, L., Villoutreix, B. O., Jelsch, C., Nicot, A. B., Loriot, M.-A., & Miteva, M. A. (2022). Machine learning-driven identification of drugs inhibiting cytochrome P450 2C9. PLoS Computational Biology, 18(1), e1009820. https://doi.org/10.1371/journal.pcbi.1009820
Rahman, A. S. M. Z., Liu, C., Sturm, H., Hogan, A. M., Davis, R., Hu, P., & Cardona, S. T. (2022). A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery. PLoS Computational Biology, 18(10), e1010613. https://doi.org/10.1371/journal.pcbi.1010613
Badwan, B. A., Liaropoulos, G., Kyrodimos, E., Skaltsas, D., Tsirigos, A., & Gorgoulis, V. G. (2023). Machine learning approaches to predict drug efficacy and toxicity in oncology. Cell Reports Methods, 3(2), 100413. https://doi.org/10.1016/j.crmeth.2023.100413
Bannigan, P., Bao, Z., Hickman, R. J., Aldeghi, M., Häse, F., Aspuru-Guzik, A., & Allen, C. (2023). Machine learning models to accelerate the design of polymeric long-acting injectables. Nature Communications, 14, 35. https://doi.org/10.1038/s41467-022-35343-w
Hou, R., Xie, C., Gui, Y., Li, G., & Li, X. (2023). Machine-Learning-Based data analysis method for Cell-Based selection of DNA-Encoded libraries. ACS Omega, 8(21), 19057–19071. https://doi.org/10.1021/acsomega.3c02152
Vojjala, S. K., Barron, J., Kumar, A., Grabner, M., Eshete, B., Tan, H., & Willey, V. (2023). P21 Machine learning for imputing missing pharmacy costs in claims data. Value in Health, 26(6), S5. https://doi.org/10.1016/j.jval.2023.03.034
Patel, S., Patel, M., Kulkarni, M., & Patel, M. S. (2023). DE-INTERACT: A machine-learning-based predictive tool for the drug-excipient interaction study during product development—Validation through paracetamol and vanillin as a case study. International Journal of Pharmaceutics, 637, 122839. https://doi.org/10.1016/j.ijpharm.2023.122839
Pirzada, R. H., Ahmad, B., Qayyum, N., & Choi, S. (2023). Modeling structure–activity relationships with machine learning to identify GSK3-targeted small molecules as potential COVID-19 therapeutics. Frontiers in Endocrinology, 14, 1084327. https://doi.org/10.3389/fendo.2023.1084327
Shin, S. H., Hur, G., Kim, N. R., Park, J. H. Y., Lee, K. W., & Yang, H. (2023). A machine learning-integrated stepwise method to discover novel anti-obesity phytochemicals that antagonize the glucocorticoid receptor. Food & Function, 14(4), 1869–1883. https://doi.org/10.1039/d2fo03466b
Asha, V., Anjimoon, S., Verma, M. R., Singla, A., Khan, I., & Alkhafaji, M. A. (2024). Machine Learning-Driven Blockchain for Enhanced Drug Discovery and Development in Pharmaceutical Research. 2024 OPJU International Technology Conference (OTCON) on Smart Computing for Innovation and Advancement in Industry 4.0, 1–6. https://doi.org/10.1109/otcon60325.2024.10688287
Arunkumar, M., & Baskaran, T. S. (2024). Use of machine learning for intelligence detection for pharmaceutical Drug-Drug interactions. Journal of Advanced Zoology, 45(S4), 478–484. https://doi.org/10.53555/jaz.v45is4.4309
Singh, V., & Kaewprapha, P. (2024). Machine Learning Application for Precise Identification of Defective and QC-Approved Tablets in Pharmaceutical Manufacturing. 2024 12th International Electrical Engineering Congress (iEECON), 01–05. https://doi.org/10.1109/ieecon60677.2024.10537855
Bello, V., Coghe, L., Gerbasi, A., Figus, E., Dagliati, A., & Merlo, S. (2024). Machine Learning-Based Approach towards Identification of Pharmaceutical Suspensions Exploiting Speckle Pattern Images. Sensors, 24(20), 6635. https://doi.org/10.3390/s24206635
Cysewski, P., Jeli?ski, T., & Przyby?ek, M. (2024). Exploration of the solubility hyperspace of selected active pharmaceutical ingredients in choline- and Betaine-Based deep eutectic solvents: machine learning modeling and experimental validation. Molecules, 29(20), 4894. https://doi.org/10.3390/molecules29204894
Mustapa, D. R., & Tjahyanto, A. (2024). Comparative analysis: Machine learning algorithms for TOC prediction in pharmaceutical water treatment systems. Jurnal Sisfokom (Sistem Informasi Dan Komputer), 13(2), 253–260. https://doi.org/10.32736/sisfokom.v13i2.2148
Nhlapho, S., Nyathi, M. H. L., Ngwenya, B. L., Dube, T., Telukdarie, A., Munien, I., Vermeulen, A., & Chude-Okonkwo, U. A. K. (2024). Druggability of Pharmaceutical Compounds Using Lipinski Rules with Machine Learning. Sciences of Pharmacy, 3(4), 177–192. https://doi.org/10.58920/sciphar0304264
Kalaichelvan, K., Ramalingam, S., Dhandapani, P. B., Leiva, V., & Castro, C. (2024). Optimizing the economic order quantity using fuzzy theory and machine learning applied to a pharmaceutical framework. Mathematics, 12(6), 819. https://doi.org/10.3390/math12060819
Kim, J., Ko, Y., & Seo, J. (2019). A bootstrapping approach with CRF and deep learning models for improving the biomedical named entity recognition in Multi-Domains. IEEE Access, 7, 70308–70318. https://doi.org/10.1109/access.2019.2914168
Zhang, Y., Chen, Q., Yang, Z., Lin, H., & Lu, Z. (2019). BioWordVec, improving biomedical word embeddings with subword information and MeSH. Scientific Data, 6, 52. https://doi.org/10.1038/s41597-019-0055-0
Hathaway, Q. A., Roth, S. M., Pinti, M. V., Sprando, D. C., Kunovac, A., Durr, A. J., Cook, C. C., Fink, G. K., Cheuvront, T. B., Grossman, J. H., Aljahli, G. A., Taylor, A. D., Giromini, A. P., Allen, J. L., & Hollander, J. M. (2019). Machine-learning to stratify diabetic patients using novel cardiac biomarkers and integrative genomics. Cardiovascular Diabetology, 18, 78. https://doi.org/10.1186/s12933-019-0879-0
Richens, J. G., Lee, C. M., & Johri, S. (2020). Improving the accuracy of medical diagnosis with causal machine learning. Nature Communications, 11, 3923. https://doi.org/10.1038/s41467-020-17419-7
Martino, F., Varricchio, S., Russo, D., Merolla, F., Ilardi, G., Mascolo, M., Dell’Aversana, G. O., Califano, L., Toscano, G., De Pietro, G., Frucci, M., Brancati, N., Fraggetta, F., & Staibano, S. (2020). A machine-learning approach for the assessment of the proliferative compartment of solid tumors on Hematoxylin-Eosin-Stained sections. Cancers, 12(5), 1344. https://doi.org/10.3390/cancers12051344
Hazra, D., & Byun, Y.-C. (2020). SynSigGAN: Generative Adversarial Networks for Synthetic Biomedical Signal Generation. Biology, 9(12), 441. https://doi.org/10.3390/biology9120441
Marcinkiewicz-Siemion, M., Kaminski, M., Ciborowski, M., Ptaszynska-Kopczynska, K., Szpakowicz, A., Lisowska, A., Jasiewicz, M., Tarasiuk, E., Kretowski, A., Sobkowicz, B., & Kaminski, K. A. (2020). Machine-learning facilitates selection of a novel diagnostic panel of metabolites for the detection of heart failure. Scientific Reports, 10, 130. https://doi.org/10.1038/s41598-019-56889-8
Gandouz, M., Holzmann, H., & Heider, D. (2021). Machine learning with asymmetric abstention for biomedical decision-making. BMC Medical Informatics and Decision Making, 21, 294. https://doi.org/10.1186/s12911-021-01655-y
Gu, Q., Kumar, A., Bray, S., Creason, A., Khanteymoori, A., Jalili, V., Grüning, B., & Goecks, J. (2021). Galaxy-ML: An accessible, reproducible, and scalable machine learning toolkit for biomedicine. PLoS Computational Biology, 17(6), e1009014. https://doi.org/10.1371/journal.pcbi.1009014
Du, T., Xie, L., Zhang, H., Liu, X., Wang, X., Chen, D., Xu, Y., Sun, Z., Zhou, W., Song, L., Guan, C., Lansky, A. J., & Xu, B. (2021). Training and validation of a deep learning architecture for the automatic analysis of coronary angiography. EuroIntervention, 17(1), 32–40. https://doi.org/10.4244/eij-d-20-00570
Akazawa, M., Hashimoto, K., Katsuhiko, N., & Kaname, Y. (2021). Machine learning approach for the prediction of postpartum hemorrhage in vaginal birth. Scientific Reports, 11, 22620. https://doi.org/10.1038/s41598-021-02198-y
Feng, G., Zheng, K. I., Li, Y.-Y., Rios, R. S., Zhu, P.-W., Pan, X.-Y., Li, G., Ma, H.-L., Tang, L.-J., Byrne, C. D., Targher, G., He, N., Mi, M., Chen, Y.-P., & Zheng, M.-H. (2021). Machine learning algorithm outperforms fibrosis markers in predicting significant fibrosis in biopsy?confirmed NAFLD. Journal of Hepato-Biliary-Pancreatic Sciences, 28(7), 593–603. https://doi.org/10.1002/jhbp.972
Kim, Y. J. (2022). Machine Learning Model Based on Radiomic Features for Differentiation between COVID-19 and Pneumonia on Chest X-ray. Sensors, 22(17), 6709. https://doi.org/10.3390/s22176709
Bonidia, R. P., Santos, A. P. A., De Almeida, B. L. S., Stadler, P. F., Da Rocha, U. N., Sanches, D. S., & De Carvalho, A. C. P. L. F. (2022). BioAutoML: automated feature engineering and metalearning to predict noncoding RNAs in bacteria. Briefings in Bioinformatics, 23(4), bbac218. https://doi.org/10.1093/bib/bbac218
Fu, S., Cheng, Y., Wang, X., Huang, J., Su, S., Wu, H., Yu, J., & Xu, Z. (2022). Identification of diagnostic gene biomarkers and immune infiltration in patients with diabetic kidney disease using machine learning strategies and bioinformatic analysis. Frontiers in Medicine, 9, 918657. https://doi.org/10.3389/fmed.2022.918657
Zeng, Z., Yao, Y., Liu, Z., & Sun, M. (2022). A deep-learning system bridging molecule structure and biomedical text with comprehension comparable to human professionals. Nature Communications, 13, 862. https://doi.org/10.1038/s41467-022-28494-3
Zhang, H., Hu, J., Zhu, J., Li, Q., & Fang, L. (2022). Machine learning-based metabolism-related genes signature and immune infiltration landscape in diabetic nephropathy. Frontiers in Endocrinology, 13, 1026938. https://doi.org/10.3389/fendo.2022.1026938
Akatsuka, J., Numata, Y., Morikawa, H., Sekine, T., Kayama, S., Mikami, H., Yanagi, M., Endo, Y., Takeda, H., Toyama, Y., Yamaguchi, R., Kimura, G., Kondo, Y., & Yamamoto, Y. (2022). A data-driven ultrasound approach discriminates pathological high grade prostate cancer. Scientific Reports, 12, 860. https://doi.org/10.1038/s41598-022-04951-3
Jan, Y.-T., Tsai, P.-S., Huang, W.-H., Chou, L.-Y., Huang, S.-C., Wang, J.-Z., Lu, P.-H., Lin, D.-C., Yen, C.-S., Teng, J.-P., Mok, G. S. P., Shih, C.-T., & Wu, T.-H. (2023). Machine learning combined with radiomics and deep learning features extracted from CT images: a novel AI model to distinguish benign from malignant ovarian tumors. Insights Into Imaging, 14, 68. https://doi.org/10.1186/s13244-023-01412-x
Rabaglino, M. B., Salilew?Wondim, D., Zolini, A., Tesfaye, D., Hoelker, M., Lonergan, P., & Hansen, P. J. (2023). Machine?learning methods applied to integrated transcriptomic data from bovine blastocysts and elongating conceptuses to identify genes predictive of embryonic competence. The FASEB Journal, 37(3), e22809. https://doi.org/10.1096/fj.202201977r
Rana, M., & Bhushan, M. (2023). Machine learning and deep learning approach for medical image analysis: diagnosis to detection. Multimedia Tools and Applications, 82, 26731–26769. https://doi.org/10.1007/s11042-022-14305-w
Jungo, P., & Hewer, E. (2023). Code-free machine learning for classification of central nervous system histopathology images. Journal of Neuropathology & Experimental Neurology, 82(3), 221–230. https://doi.org/10.1093/jnen/nlac131
Shuryak, I., Nemzow, L., Bacon, B. A., Taveras, M., Wu, X., Deoli, N., Ponnaiya, B., Garty, G., Brenner, D. J., & Turner, H. C. (2023). Machine learning approach for quantitative biodosimetry of partial-body or total-body radiation exposures by combining radiation-responsive biomarkers. Scientific Reports, 13, 949. https://doi.org/10.1038/s41598-023-28130-0
Sun, Z., Lin, J., Zhang, T., Sun, X., Wang, T., Duan, J., & Yao, K. (2023). Combining bioinformatics and machine learning to identify common mechanisms and biomarkers of chronic obstructive pulmonary disease and atrial fibrillation. Frontiers in Cardiovascular Medicine, 10, 1121102. https://doi.org/10.3389/fcvm.2023.1121102
Azari, H., Nazari, E., Mohit, R., Asadnia, A., Maftooh, M., Nassiri, M., Hassanian, S. M., Ghayour-Mobarhan, M., Shahidsales, S., Khazaei, M., Ferns, G. A., & Avan, A. (2023). Machine learning algorithms reveal potential miRNAs biomarkers in gastric cancer. Scientific Reports, 13, 6147. https://doi.org/10.1038/s41598-023-32332-x
Su, L., Chen, J., Peng, Y., & Sun, C. (2024). Demonstration-based learning for few-shot biomedical named entity recognition under machine reading comprehension. Journal of Biomedical Informatics, 159, 104739. https://doi.org/10.1016/j.jbi.2024.104739
Wu, C., Wan, B., Entezari, A., Fang, J., Xu, Y., & Li, Q. (2024). Machine learning-based design for additive manufacturing in biomedical engineering. International Journal of Mechanical Sciences, 266, 108828. https://doi.org/10.1016/j.ijmecsci.2023.108828
Islam, S., Bentahar, J., Cohen, R., & Rjoub, G. (2024). A Multi-Modal unsupervised machine learning approach for biomedical signal processing in CPR. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2411.11869
Slonopas, A., Beatty, A., & Djajalaksana, Y. (2024). Applying Reservoir Computing and Machine Learning Techniques for Image Enhancement in Biomedical Imaging. 2024 International Conference on Smart Applications, Communications and Networking (SmartNets), 1–7. https://doi.org/10.1109/smartnets61466.2024.10577705
Huan, J.-M., Wang, X.-J., Li, Y., Zhang, S.-J., Hu, Y.-L., & Li, Y.-L. (2024). The biomedical knowledge graph of symptom phenotype in coronary artery plaque: machine learning-based analysis of real-world clinical data. BioData Mining, 17, 13. https://doi.org/10.1186/s13040-024-00365-1
He, Z., Du, N., Wang, J., & Ye, X. (2024). A Machine Learning Classification Approach for Solving Biomedical Inverse Scattering Problem. 2024 IEEE International Conference on Computational Electromagnetics (ICCEM), 1–3. https://doi.org/10.1109/iccem60619.2024.10558921
Lehmann, V., Zueger, T., Maritsch, M., Notter, M., Schallmoser, S., Bérubé, C., Albrecht, C., Kraus, M., Feuerriegel, S., Fleisch, E., Kowatsch, T., Lagger, S., Laimer, M., Wortmann, F., & Stettler, C. (2024). Machine Learning to Infer a Health State Using Biomedical Signals — Detection of Hypoglycemia in People with Diabetes while Driving Real Cars. NEJM AI, 1(3), AIoa2300013. https://doi.org/10.1056/aioa2300013
Mercaldo, F., Brunese, L., Santone, A., Martinelli, F., & Cesarelli, M. (2024). Extreme Learning Machine for Biomedical Image Classification: A Multi-Case Study. EAI Endorsed Transactions on Pervasive Health and Technology, 10, 1–8. https://doi.org/10.4108/eetpht.10.5542
Venkatesh, R., Balasubramanian, C., & Kaliappan, M. (2019). Development of Big Data Predictive Analytics Model for Disease Prediction using Machine learning Technique. Journal of Medical Systems, 43, 272. https://doi.org/10.1007/s10916-019-1398-y
Ramkumar, P. N., Haeberle, H. S., Ramanathan, D., Cantrell, W. A., Navarro, S. M., Mont, M. A., Bloomfield, M., & Patterson, B. M. (2019). Remote patient monitoring using mobile health for total knee arthroplasty: validation of a wearable and Machine Learning–Based surveillance platform. The Journal of Arthroplasty, 34(10), 2253–2259. https://doi.org/10.1016/j.arth.2019.05.021
Myers, K. D., Knowles, J. W., Staszak, D., Shapiro, M. D., Howard, W., Yadava, M., Zuzick, D., Williamson, L., Shah, N. H., Banda, J. M., Leader, J., Cromwell, W. C., Trautman, E., Murray, M. F., Baum, S. J., Myers, S., Gidding, S. S., Wilemon, K., & Rader, D. J. (2019). Precision screening for familial hypercholesterolaemia: a machine learning study applied to electronic health encounter data. The Lancet Digital Health, 1(8), e393–e402. https://doi.org/10.1016/s2589-7500(19)30150-5
Maarseveen, T. D., Meinderink, T., Reinders, M. J. T., Knitza, J., Huizinga, T. W. J., Kleyer, A., Simon, D., Van Den Akker, E. B., & Knevel, R. (2020). Machine Learning Electronic Health Record Identification of Patients with Rheumatoid Arthritis: Algorithm Pipeline Development and Validation Study. JMIR Medical Informatics, 8(11), e23930. https://doi.org/10.2196/23930
Du, Z., Yang, Y., Zheng, J., Li, Q., Lin, D., Li, Y., Fan, J., Cheng, W., Chen, X., & Cai, Y. (2020). Accurate prediction of coronary heart disease for patients with hypertension from electronic health records with big data and Machine-Learning methods: model development and performance evaluation. JMIR Medical Informatics, 8(7), e17257. https://doi.org/10.2196/17257
El-Ganainy, N. O., Balasingham, I., Halvorsen, P. S., & Rosseland, L. A. (2020). A new real time clinical decision support system using machine learning for critical care units. IEEE Access, 8, 185676–185687. https://doi.org/10.1109/access.2020.3030031
Artzi, N. S., Shilo, S., Hadar, E., Rossman, H., Barbash-Hazan, S., Ben-Haroush, A., Balicer, R. D., Feldman, B., Wiznitzer, A., & Segal, E. (2020). Prediction of gestational diabetes based on nationwide electronic health records. Nature Medicine, 26, 71–76. https://doi.org/10.1038/s41591-019-0724-8
Philpott-Morgan, S., Thakrar, D. B., Symons, J., Ray, D., Ashrafian, H., & Darzi, A. (2021). Characterising the nationwide burden and predictors of unkept outpatient appointments in the National Health Service in England: A cohort study using a machine learning approach. PLoS Medicine, 18(10), e1003783. https://doi.org/10.1371/journal.pmed.1003783
Liu, Y. S., Chokka, S., Cao, B., & Chokka, P. R. (2021). Screening for bipolar disorder in a tertiary mental health centre using EarlyDetect: A machine learning-based pilot study. Journal of Affective Disorders Reports, 6, 100215. https://doi.org/10.1016/j.jadr.2021.100215
Guo, A., Mazumder, N. R., Ladner, D. P., & Foraker, R. E. (2021). Predicting mortality among patients with liver cirrhosis in electronic health records with machine learning. PLoS ONE, 16(8), e0256428. https://doi.org/10.1371/journal.pone.0256428
Estiri, H., Strasser, Z. H., & Murphy, S. N. (2021). Individualized prediction of COVID-19 adverse outcomes with MLHO. Scientific Reports, 11, 5322. https://doi.org/10.1038/s41598-021-84781-x
Zeng, Z., Yao, S., Zheng, J., & Gong, X. (2021). Development and validation of a novel blending machine learning model for hospital mortality prediction in ICU patients with Sepsis. BioData Mining, 14, 40. https://doi.org/10.1186/s13040-021-00276-5
Shahbandegan, A., Mago, V., Alaref, A., Van Der Pol, C. B., & Savage, D. W. (2022). Developing a machine learning model to predict patient need for computed tomography imaging in the emergency department. PLoS ONE, 17(12), e0278229. https://doi.org/10.1371/journal.pone.0278229
Xi, Y., Wang, H., & Sun, N. (2022). Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension. Frontiers in Cardiovascular Medicine, 9, 1025705. https://doi.org/10.3389/fcvm.2022.1025705
Chen, A., & Chen, D. O. (2022). Simulation of a machine learning enabled learning health system for risk prediction using synthetic patient data. Scientific Reports, 12, 17917. https://doi.org/10.1038/s41598-022-23011-4
Lazzarini, N., Filippoupolitis, A., Manzione, P., & Eleftherohorinou, H. (2022). A machine learning model on Real World Data for predicting progression to Acute Respiratory Distress Syndrome (ARDS) among COVID-19 patients. PLoS ONE, 17(7), e0271227. https://doi.org/10.1371/journal.pone.0271227
Liao, W.-W., Hsieh, Y.-W., Lee, T.-H., Chen, C.-L., & Wu, C.-Y. (2022). Machine learning predicts clinically significant health related quality of life improvement after sensorimotor rehabilitation interventions in chronic stroke. Scientific Reports, 12, 11235. https://doi.org/10.1038/s41598-022-14986-1
Uddin, S., Wang, S., Lu, H., Khan, A., Hajati, F., & Khushi, M. (2022). Comorbidity and multimorbidity prediction of major chronic diseases using machine learning and network analytics. Expert Systems With Applications, 205, 117761. https://doi.org/10.1016/j.eswa.2022.117761
La Cava, W. G., Lee, P. C., Ajmal, I., Ding, X., Solanki, P., Cohen, J. B., Moore, J. H., & Herman, D. S. (2023). A flexible symbolic regression method for constructing interpretable clinical prediction models. Npj Digital Medicine, 6, 107. https://doi.org/10.1038/s41746-023-00833-8
Langenberger, B., Schulte, T., & Groene, O. (2023). The application of machine learning to predict high-cost patients: A performance-comparison of different models using healthcare claims data. PLoS ONE, 18(1), e0279540. https://doi.org/10.1371/journal.pone.0279540
Pasieczna, A. H., Szczepanowski, R., Sobecki, J., Katarzyniak, R., Uchmanowicz, I., Gobbens, R. J. J., Kahsin, A., & Dixit, A. (2023). Importance analysis of psychosociological variables in frailty syndrome in heart failure patients using machine learning approach. Scientific Reports, 13, 7782. https://doi.org/10.1038/s41598-023-35037-3
Caratsch, L., Lechtenboehmer, C., Caorsi, M., Oung, K., Zanchi, F., Aleman, Y., Omoumi, P., & Hügle, T. (2023). POS0892 AN END-TO-END MACHINE LEARNING PIPELINE FOR THE AUTOMATED DETECTION OF RADIOGRAPHIC HAND OSTEOARTHRITIS: a NO-CODING PLATFORM EXPERIENCE. Annals of the Rheumatic Diseases, 82(1), 753–754. https://doi.org/10.1136/annrheumdis-2023-eular.3422
Limketkai, B. N., Maas, L., Krishna, M., Dua, A., DeDecker, L., Sauk, J. S., & Parian, A. M. (2023). Machine learning-based characterization of longitudinal health care utilization among patients with inflammatory bowel diseases. Inflammatory Bowel Diseases, 30(5), 697–703. https://doi.org/10.1093/ibd/izad127
Kwak, S., Lee, H.-J., Kim, S., Park, J.-B., Lee, S.-P., Kim, H.-K., & Kim, Y.-J. (2023). Machine learning reveals sex-specific associations between cardiovascular risk factors and incident atherosclerotic cardiovascular disease. Scientific Reports, 13, 9364. https://doi.org/10.1038/s41598-023-36450-4
Liu, X., Morelli, D., Littlejohns, T. J., Clifton, D. A., & Clifton, L. (2023). Combining machine learning with Cox models to identify predictors for incident post-menopausal breast cancer in the UK Biobank. Scientific Reports, 13, 9221. https://doi.org/10.1038/s41598-023-36214-0
Wang, J., Qin, Z., Hsu, J., & Zhou, B. (2024). A fusion of machine learning algorithms and traditional statistical forecasting models for analyzing American healthcare expenditure. Healthcare Analytics, 5, 100312. https://doi.org/10.1016/j.health.2024.100312
Biswas, S., Aizan, L. N. B., Mathieson, K., Neupane, P., Snowdon, E., MacArthur, J., Sarkar, V., Tetlow, C., & George, K. J. (2024). Clinicosocial determinants of hospital stay following cervical decompression: A public healthcare perspective and machine learning model. Journal of Clinical Neuroscience, 126, 1–11. https://doi.org/10.1016/j.jocn.2024.05.032
Zhou, X. (2024). A study of machine learning applications in healthcare. Applied and Computational Engineering, 102, 128–133. https://doi.org/10.54254/2755-2721/102/20241057
Ali, L., Gun, T. C., & Alhasan, W. (2024). Comparative analysis of machine learning algorithms in enhancing healthcare outcomes. European Modern Studies Journal, 8(3), 606–618. https://doi.org/10.59573/emsj.8(3).2024.38
Moradpour, M., Ritter, Z., & Haushild, A.-C. (2024). Multi-Objective performance optimization of machine learning models in healthcare. In Digital Health and Informatics Innovations for Sustainable Health Care Systems (pp. 822–826). IOS Press. https://doi.org/10.3233/shti240538
Bhute, H., Wani, R., Patil, N., & Naik, V. (2024). Smart Healthcare in Smart Cities: Leveraging Machine Learning for Disease Detection. 2024 4th International Conference on Intelligent Technologies (CONIT), 1–7. https://doi.org/10.1109/conit61985.2024.10627086
Ismukhamedova, A., Uvaliyeva, I., & Belginova, S. (2024). Integrating machine learning in electronic health passport based on WHO study and healthcare resources. Informatics in Medicine Unlocked, 44, 101428. https://doi.org/10.1016/j.imu.2023.101428
Sevukamoorthy, L., Chintapalli, G. S., & Pander, V. K. (2024). Machine learning and generative adversarial networks for accurate and timely cancer detection in smart healthcare systems. 2024 15th International Conference on Computing Communication and Networking Technologies (ICCCNT), 1–5. https://doi.org/10.1109/icccnt61001.2024.10724781
Thakare, R. M., Gangurde, P., & Sawant, G. S. (2024). Machine learning and artificial intelligence in pharmaceutical industry and development. International Journal for Multidisciplinary Research, 6(6), 31522. https://doi.org/10.36948/ijfmr.2024.v06i06.31522
Pazhayattil, A. B., & Konyu-Fogel, G. (2023). An empirical study to accelerate machine learning and artificial intelligence adoption in pharmaceutical manufacturing organizations. Journal of Generic Medicines: The Business Journal for the Generic Medicines Sector, 19(2), 81–91. https://doi.org/10.1177/17411343221151109
Kolhe, K., Somatkar, A. A., Bhandarkar, M. S., Kotangale, K. B., Ayane, S. S., & Shirke, S. I. (2023). Applications and challenges of machine learning techniques for smart manufacturing in Industry 4.0. 2023 7th International Conference on Computing, Communication, Control and Automation (ICCUBEA), 1–6. https://doi.org/10.1109/iccubea58933.2023.10392071
Susanty, M., Puspasari, I., Fitriah, N., Mahayana, D., Rajab, T. E. L., Zakaria, H., Setiawan, A. W., & Hertadi, R. (2023). Avoiding machine learning becoming pseudoscience in biomedical research. Jurnal Informatika, 10(1), 1–12. https://doi.org/10.31294/inf.v10i1.12787
Boulogeorgos, A.-A. A., Trevlakis, S. E., Tegos, S. A., Papanikolaou, V. K., & Karagiannidis, G. K. (2021). Machine learning in Nano-Scale biomedical Engineering. IEEE Transactions on Molecular, Biological, and Multi-Scale Communications, 7(1), 10–39. https://doi.org/10.1109/tmbmc.2020.3035383
Remya, K. R., & Ramya, J. S. (2014). A survey of machine learning approaches for relation classification from biomedical texts. IJETAE International Journal of Emerging Technology and Advanced Engineering, 4(3).
Ghassemi, M., Naumann, T., Schulam, P., Beam, A. L., Chen, I. Y., & Ranganath, R. (2020). A Review of Challenges and Opportunities in Machine Learning for Health. AMIA Summits on Translational Science Proceedings, 2020, 191. https://arxiv.org/pdf/1806.00388.pdf
Yadav, K. K., & Gaurav, A. (2023). Application and Challenges of machine learning in healthcare. International Journal for Research in Applied Science and Engineering Technology, 11(9), 458–466. https://doi.org/10.22214/ijraset.2023.55678
Feng, Q., Du, M., Zou, N., & Hu, X. (2025). Fair Machine Learning in Healthcare: a survey. IEEE Transactions on Artificial Intelligence, 6(3), 493–507. https://doi.org/10.1109/tai.2024.3361836
Chhina, A., Trehan, K., Saini, M., Thakur, S., Kaur, M., Shahtaghi, N. R., Shivgotra, R., Soni, B., Modi, A., Bakrey, H., & Jain, S. K. (2023). Revolutionizing pharmaceutical industry: The radical impact of artificial intelligence and machine learning. Current Pharmaceutical Design, 29(21), 1645–1658. https://doi.org/10.2174/1381612829666230807161421
Wanjul, P. B., Parshuram, M. S., & Laxman, G. V. (2023). Future Directions of AI in Pharma: Innovation in Pharmaceutical Industry. International Journal for Multidisciplinary Research (IJFMR), 5(3), 3098. https://www.ijfmr.com/papers/2023/3/3098.pdf
Leong, W. Y., Leong, Y. Z., & Leong, W. S. (2023). Human-Machine Interaction in Biomedical Manufacturing. 2023 IEEE 5th Eurasia Conference on IOT, Communication and Engineering (ECICE), 939–944. https://doi.org/10.1109/ecice59523.2023.10383070
Kim, W., & Seok, J. (2022). Privacy-preserving collaborative machine learning in biomedical applications. 2022 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), 179–183. https://doi.org/10.1109/icaiic54071.2022.9722703
Jnyanadeep, B., Sahana, S., Suresh, R., S, R. T., & Angadi, G. (2024). Machine Learning for Predictive Modeling and Personalized Treatment in Magnesium-based Biomedical Applications. 2024 8th International Conference on Computational System and Information Technology for Sustainable Solutions (CSITSS), 1–7. https://doi.org/10.1109/csitss64042.2024.10816742
Ramírez, J. G. C., Islam, M. M., & Even, A. I. H. (2024). Machine learning applications in healthcare: Current trends and future prospects. Journal of Artificial Intelligence General Science (JAIGS), 1(1). https://doi.org/10.60087/jaigs.v1i1.33
Ganatra, H. A. (2025). Machine learning in Pediatric healthcare: current trends, challenges, and future directions. Journal of Clinical Medicine, 14(3), 807. https://doi.org/10.3390/jcm14030807
Nadakuditi, S., Kumar, B., & Kumar, T. (2024). AI and Machine Learning in Healthcare - Applications, Challenges and Ethics. International Journal of Health Sciences, 7(4), 36–43. https://doi.org/10.47941/ijhs.1949.

Parankush Koul

Corresponding author

Department of Mechanical and Aerospace Engineering, Illinois Institute of Technology, 3201 South State Street, Chicago, 60616, Illinois, United States of America

Dr. Indu B. Koul

Co-author

Department of Biochemistry, Postgraduate Institute of Medical Education and Research, Sector-12, Chandigarh, 160012, India

Parankush Koul*, Dr. Indu B. Koul, Advancements in Machine Learning Applications for The Pharmaceutical, Biomedical, And Healthcare Industries, Int. J. of Pharm. Sci., 2025, Vol 3, Issue 4, 1548-1580. https://doi.org/10.5281/zenodo.15204262

View Article

Advancements in Machine Learning Applications for The Pharmaceutical, Biomedical, And Healthcare Industries

Abstract

Keywords

Introduction

Reference

Parankush Koul

Dr. Indu B. Koul

More related articles

Plant Derived Antioxidant: Significance In Skin He...

An Interventional Study To Assess Knowledge On Ins...

Supersaturated Drug Delivery System: An Approach T...

View more

Formulation and Evaluation of Alum Toner Spray for Anti-Acne Activity ...

Clay Catalyst in Organic Synthesis...

A Comprehensive Review of Methods for Testing in-vitro Anthelmintic Activity...

View more

Related Articles

Development And Formulation Of Sustained Release Capsules Bearing Anti-Hypertens...

Advances In Diabetes Management: A Comprehensive Review of Novel Formulations fo...

Formulation and Evaluation of Herbal Shampoo ...

In Silico Design And ADME Study Of Novel Benzimidazole Containing Derivatives As...

Plant Derived Antioxidant: Significance In Skin Health and Ageing Process ...

More related articles

Plant Derived Antioxidant: Significance In Skin Health and Ageing Process ...

An Interventional Study To Assess Knowledge On Insulin Self Administration Among...

Supersaturated Drug Delivery System: An Approach To Enhance The Bioavailability...

View more

Plant Derived Antioxidant: Significance In Skin Health and Ageing Process ...

An Interventional Study To Assess Knowledge On Insulin Self Administration Among...

Supersaturated Drug Delivery System: An Approach To Enhance The Bioavailability...

View more