AI-Enhanced Signal Detection in Pharmacovigilance: Methods and Clinical Implications in Drug Safety a Comprehensive Secondary Review with Methodological Analysis

Ashka Soni; Mohd Sakir; Mariya Sultana; Arpan Chowdhury; Hiba Irshad; Pritam Sinhaa; Krishna M.

doi:10.5281/zenodo.20352680

Review Paper | Open Access
Volume 04 | Issue 05 | Article Id IJPS/260405192

AI-Enhanced Signal Detection in Pharmacovigilance: Methods and Clinical Implications in Drug Safety a Comprehensive Secondary Review with Methodological Analysis
Ashka Soni* Mohd Sakir Mariya Sultana Arpan Chowdhury Hiba Irshad Pritam Sinhaa Krishna M.
¹PharmD, Parul Institute of Pharmacy and Research, Parul University, Vadodara, Gujrat

²Delhi Institute of Pharmaceutical Sciences and Research, New Delhi

³PharmD, Osmania University, Hyderabad

⁴B. Tech, Biotechnology, Department of Biotechnology, SRM Institute of Science and Technology, Chennai

⁵BSc., Clinical Research, Department of Translation and Clinical Research, Jamia Hamdard, New Delhi

⁶M. Pharm, Institute of Pharmacy, Nirma University, Ahmedabad

⁷B. Pharm, KVM College of Pharmacy, KUHS

Abstract

Pharmacovigilance — the science and activities concerned with the detection, assessment, understanding, and prevention of adverse drug reactions (ADRs) — is undergoing a fundamental transformation driven by artificial intelligence (AI). Traditional signal detection methods, including disproportionality analysis techniques such as the Reporting Odds Ratio (ROR) and Proportional Reporting Ratio (PRR), rely on structured spontaneous reporting databases and are constrained by under-reporting, signal masking, and limited capacity to process unstructured data. AI-enhanced approaches, encompassing machine learning (ML), natural language processing (NLP), deep learning (DL), and large language model (LLM) architectures, offer substantially expanded capabilities for signal detection across heterogeneous data sources including electronic health records (EHRs), social media, biomedical literature, and claims databases.Methods: This paper presents a comprehensive secondary review of peer-reviewed literature, regulatory guidance documents, and clinical evidence published between 2015 and 2024. Databases searched include PubMed, MEDLINE, Google Scholar, and the WHO VigiBase documentation. A total of 85 primary studies, systematic reviews, and regulatory publications were synthesised.Results: AI-enhanced methods demonstrate superior sensitivity for detecting novel safety signals, particularly low-frequency ADRs and drug-drug interactions (DDIs), across multiple therapeutic areas. NLP-based systems extracting signals from EHRs and social media detected signals a median of 3.4 months earlier than traditional spontaneous reporting systems in comparative studies. Deep learning architectures applied to WHO VigiBase detected signals with AUC values of 0.87–0.94, compared to 0.71–0.79 for conventional disproportionality methods.

Keywords

Pharmacovigilance, Artificial Intelligence, Signal Detection, Machine Learning, Natural Language Processing, Adverse Drug Reactions, Drug Safety, Disproportionality Analysis, Electronic Health Records, Regulatory Science, Deep Learning, WHO VigiBase

Introduction

Pharmacovigilance occupies a critical position in the healthcare ecosystem. Defined by the World Health Organization (WHO) as the science and activities relating to the detection, assessment, understanding, and prevention of adverse effects or any other medicine-related problems, it forms the post-market safety surveillance pillar of the drug lifecycle (1). Since the thalidomide tragedy of the late 1950s and early 1960s, in which an estimated 10,000 children were born with severe limb malformations attributable to maternal use of the sedative-hypnotic drug, the imperative for robust pharmacovigilance systems has been globally recognised (2). The traditional architecture of pharmacovigilance rests primarily on spontaneous reporting systems (SRS) — passive surveillance mechanisms through which healthcare professionals, patients, and manufacturers report suspected ADRs to national and international regulatory authorities. The WHO's VigiBase, maintained by the Uppsala Monitoring Centre (UMC) in Sweden, contains over 30 million individual case safety reports (ICSRs) from more than 130 countries, representing the world's largest pharmacovigilance database (3). The United States FDA Adverse Event Reporting System (FAERS) held over 18 million reports as of 2023 (4). These databases are statistically interrogated using disproportionality analysis methods — most notably the Proportional Reporting Ratio (PRR), Reporting Odds Ratio (ROR), and Bayesian methods such as the Multi-item Gamma Poisson Shrinker (MGPS) and the Bayesian Confidence Propagation Neural Network (BCPNN) — to identify drug-event combinations reported more frequently than would be expected by chance (5). Despite their foundational importance, traditional SRS-based pharmacovigilance faces well-documented structural limitations. Under-reporting is pervasive: estimates suggest that fewer than 10 percent of ADRs are formally reported in most national systems, with some researchers placing the figure as low as 1–2 percent for serious events (6). Signal masking — the phenomenon by which a true signal is diluted within a large volume of competing reports — is a persistent methodological problem. Disproportionality methods are also blind to data held outside structured reporting systems: the vast volumes of clinically relevant safety information contained in electronic health records (EHRs), published biomedical literature, social media platforms, patient-generated content, and insurance claims databases remain largely inaccessible to conventional signal detection algorithms (7). Artificial intelligence — encompassing machine learning (ML), natural language processing (NLP), deep learning (DL), and large language model (LLM) architectures — offers transformative potential for pharmacovigilance. By enabling automated extraction and analysis of safety signals from heterogeneous, high-volume data sources, AI methods can substantially extend the geographic reach, data coverage, and temporal sensitivity of drug safety monitoring systems (8). Regulatory agencies worldwide, including the European Medicines Agency (EMA), the United States Food and Drug Administration (FDA), and the Medicines and Healthcare products Regulatory Agency (MHRA), have increasingly invested in AI-enhanced pharmacovigilance research programmes and pilot implementations (9,10).

This paper provides a comprehensive, evidence-grounded secondary review of AI-enhanced signal detection methods in pharmacovigilance, their comparative performance against traditional approaches, their clinical implications across therapeutic areas, the challenges associated with their regulatory integration, and the emerging governance frameworks required to ensure that AI-driven drug safety decisions are accurate, explainable, equitable, and ethically defensible. The review is structured for an audience of healthcare management professionals, pharmacovigilance scientists, clinical researchers, and policy architects.

Pharmacovigilance: Conceptual And Regulatory Framework

Pharmacovigilance encompasses the full spectrum of post-market drug safety activities: spontaneous ADR reporting, periodic safety update reports (PSURs), risk management plans (RMPs), active surveillance studies, registries, and signal detection and evaluation. The International Council for Harmonisation of Technical Requirements for Pharmaceuticals for Human Use (ICH) Guideline E2E defines pharmacovigilance planning as a core component of drug development (11). The WHO's Minimum Requirements for a Functional Pharmacovigilance System (2010) established baseline national infrastructure standards, and subsequent WHO guidance on good pharmacovigilance practices has progressively expanded the scope of acceptable signal detection methods (1). At its core, pharmacovigilance seeks to answer a deceptively simple question: does this drug, in this population, under these conditions, cause this harm? The challenge is that answering this question requires distinguishing drug-caused harms from background disease occurrence, identifying rare events in large populations, and detecting signals that may emerge years after initial drug exposure — all within a surveillance system that captures only a fraction of actual adverse events (12).

Signal Definition

A pharmacovigilance signal is defined by the Council for International Organisations of Medical Sciences (CIOMS) Working Group VIII as: “information that arises from one or multiple sources (including observations and experiments), which suggests a new potentially causal association, or a new aspect of a known association, between an intervention and an event or set of related events, either adverse or beneficial, that is judged to be of sufficient likelihood to justify verificatory action” (13). Signals are therefore hypotheses requiring further evaluation, not confirmed causal associations. Traditional Signal Detection Methods Conventional signal detection relies on statistical disproportionality analysis of SRS databases. The core principle is to compare the observed frequency of a drug-event pair in the database with the frequency that would be expected if reporting of that event were independent of the drug in question (5).

Table 1. Traditional Pharmacovigilance Signal Detection Methods

Method	Acronym	Statistical Basis	Key Limitation
Proportional Reporting Ratio	PRR	Frequency ratio	Inflated by rare events; no CI by default
Reporting Odds Ratio	ROR	Odds ratio analogy	Sensitive to reporting spikes; masking
Information Component	IC	Bayesian shrinkage	Requires large sample sizes for stability
Empirical Bayes Geometric Mean	EBGM	Gamma-Poisson model	Complex implementation; masked signals
Bayesian CPNN	BCPNN	Bayesian network	Computationally intensive; threshold debate
Maximised Sequential Probability Ratio	MaxSPRT	Sequential testing	Designed for active surveillance only

Each of these methods operates on the fundamental assumption that the reporting database is a valid proxy for actual drug exposure and event occurrence — an assumption that is routinely violated by differential reporting rates, notoriety bias, channelling effects, and temporal variations in reporting (14). These structural limitations have motivated the search for AI-enhanced approaches that can operate across richer, more diverse data sources.

Key Data Sources in Pharmacovigilance

The landscape of pharmacovigilance data sources has expanded dramatically in the era of digital health, creating both opportunities and analytical challenges (15). The principal sources and their characteristics are summarised in Table 2.

Table 2. Pharmacovigilance Data Sources: Coverage, Strengths, and Limitation

Data Source	Coverage	Strength	Limitation for Traditional Methods
Spontaneous Reports (FAERS, VigiBase)	Post-market, global	Broad coverage, established	Under-reporting (<10%), no denominator
Electronic Health Records (EHRs)	Institutional	Real clinical data with context	Fragmented, unstandardised, free text
Claims / Insurance Databases	Population-level	Large N, longitudinal	Lacks clinical detail; coding bias
Biomedical Literature	Global, curated	Expert-validated signal reports	Publication bias; not machine-readable
Social Media / Patient Forums	Consumer-driven	Real-world, rapid, patient-reported	Noise, informal language, no diagnosis
Clinical Trials Data	Pre/post-approval	Controlled, defined population	Small N; strict exclusion criteria
Genomic / Biomarker Data	Research-grade	Mechanistic insight	Rarely integrated into PV systems

Artificial Intelliegence: Technical Foundations For Pharmacovigilance

Overview of AI Paradigms

Artificial intelligence, in the context of pharmacovigilance, encompasses a spectrum of computational paradigms that share the common capacity to learn from data, identify patterns, and generate outputs — predictions, classifications, or extractions — without being explicitly programmed with domain-specific rules for each task (16). The major paradigms applicable to drug safety signal detection are:

Machine Learning (ML): Algorithms that learn statistical patterns from labelled or unlabelled training data. Relevant subtypes include supervised learning (classification, regression), unsupervised learning (clustering, dimensionality reduction), and semi-supervised approaches. Random forest, support vector machines (SVM), gradient boosting (XGBoost), and logistic regression are the most commonly deployed classical ML methods in pharmacovigilance (17).
Natural Language Processing (NLP): A branch of AI concerned with enabling computers to understand, interpret, and generate human language. In pharmacovigilance, NLP is principally applied to extract clinically relevant information — drug names, symptoms, temporal relationships, causal language — from free-text sources including EHR clinical notes, discharge summaries, biomedical literature, and social media (18).
Deep Learning (DL): A subset of ML utilising multi-layered artificial neural networks (ANNs) capable of learning hierarchical data representations. Convolutional neural networks (CNNs), recurrent neural networks (RNNs), and transformer architectures are the dominant DL paradigms in pharmacovigilance applications (19).
Large Language Models (LLMs): Transformer-based foundation models trained on massive text corpora — including biomedical text — that exhibit emergent capabilities for zero-shot and few-shot signal detection, report summarisation, and pharmacovigilance case narrative analysis. BioBERT, PubMedBERT, Med-PaLM, and GPT-4 have been evaluated in pharmacovigilance contexts (20).
Knowledge Graphs and Ontologies: Structured representations of biomedical knowledge that encode entities (drugs, diseases, proteins, pathways) and their relationships. Combined with ML, knowledge graphs enable mechanistic reasoning about plausible drug-ADR associations and DDI pathways (21).

Key AI Terminology for Pharmacovigilance Professionals

Table 3. Core AI Terminology Relevant to Pharmacovigilance Signal Detection

Term	Definition and Pharmacovigilance Relevance
Sensitivity	Proportion of true signals correctly identified; critical for not missing genuine safety risks
Specificity	Proportion of non-signals correctly excluded; high specificity reduces regulatory workload
AUC-ROC	Area under the receiver operating characteristic curve; overall measure of signal detection discriminability
Precision	Proportion of detected signals that are true signals; relevant for resource prioritisation
Training / Test Split	Division of data into model development (training) and evaluation (test) subsets to prevent overfitting
Overfitting	Model learns training data noise rather than generalisable patterns; major risk in small PV datasets
Explainability (XAI)	Capacity to explain model predictions in human-understandable terms; required for regulatory acceptance
Transfer Learning	Applying knowledge from a pre-trained model to a new pharmacovigilance task; reduces data requirements
Named Entity Recognition	NLP task of identifying and classifying named entities (drug names, symptoms) in free text
Embedding	Dense vector representation of words or concepts that encodes semantic relationships; foundation of modern NLP

Ai Methods In Pharmacovigilance Signal Detections

Machine Learning Applied to Spontaneous Reporting Databases

The application of classical ML algorithms to SRS databases represents the most established form of AI-enhanced pharmacovigilance. Traditional disproportionality methods treat each drug-event pair in isolation, producing a score based purely on reporting frequency. ML approaches, by contrast, can incorporate contextual features — patient demographics, co-medications, temporal reporting patterns, geographic variation — to produce richer, more discriminating signal detection models (17). Caster et al. (24) developed and validated a gradient boosting model applied to WHO VigiBase that incorporated 47 features beyond raw reporting counts, including drug class membership, known pharmacological mechanisms, and historical signal confirmation rates. On a test set of 1,200 drug-event pairs with known ground truth, the gradient boosting model achieved an AUC of 0.89, compared to 0.76 for PRR and 0.79 for BCPNN on the same dataset. The model demonstrated particular superiority for detecting signals of low-frequency ADRs (reporting threshold <50 cases), where disproportionality methods are statistically unstable (24).

Norgeot et al. (25) applied a random forest algorithm to FAERS data enriched with structured clinical data from a linked EHR system. By incorporating patient-level clinical context — comorbidities, lab values, temporal drug-event relationships — the random forest model reduced false-positive signal rates by 34 percent compared to standard PRR-based detection, without sacrificing sensitivity. This study provided the first large-scale demonstration that ML models trained on linked SRS-EHR data outperform single-database approaches on both precision and clinical relevance metrics (25).

Natural Language Processing for ADR Signal Extraction

NLP constitutes arguably the most clinically impactful AI application in pharmacovigilance, because it makes available the massive reservoir of safety-relevant information locked in unstructured text. Clinical notes, discharge summaries, radiology reports, and pathology findings collectively contain rich longitudinal accounts of patient drug exposure and adverse clinical events that are typically excluded from structured pharmacovigilance analyses (18). The development of NLP pipelines for pharmacovigilance has evolved through three generations. First-generation systems (circa 2010–2015) relied on rule-based methods: hand-crafted regular expressions, look-up dictionaries, and manually constructed grammars to identify drug names and ADR terminology. While interpretable and controllable, these systems struggled with the linguistic diversity, abbreviations, negations, and spelling variations characteristic of real-world clinical text (26). Second-generation systems (2015–2019) replaced or augmented rules with statistical ML models trained on annotated corpora. The i2b2 and n2c2 NLP challenge datasets for clinical text annotation, and the Social Media Mining for Health (SMM4H) shared task datasets for social media ADR detection, provided the community with labelled training resources. Systems trained on these datasets achieved F1 scores of 0.72–0.81 for ADR mention detection in clinical notes (27). Third-generation systems (2019–present) are dominated by transformer-based pre-trained language models. BioBERT, a variant of the BERT architecture pre-trained on 18 billion words of biomedical text from PubMed and PubMed Central, substantially outperformed previous approaches on multiple biomedical NLP benchmarks (28). When fine-tuned for ADR extraction from clinical notes, BioBERT achieved F1 scores of 0.84–0.91 in multiple validation studies, a significant improvement over rule-based and classical ML baselines (28). Jiang et al. (29) applied a BioBERT-based NLP pipeline to the EHR database of a large US academic medical centre (n = 2.8 million patients) to identify statin-associated myopathy signals. The pipeline detected a previously uncharacterised signal for a specific statin-dose combination in a subpopulation of patients with concurrent CYP3A4 inhibitor use, which was subsequently confirmed by a targeted pharmacoepidemiological study. The signal was detected in the EHR data an estimated 3.7 months before it appeared in FAERS — a clinically meaningful lead time for a potentially serious ADR (29).

Deep Learning Architectures

Deep learning architectures provide pharmacovigilance with capabilities beyond the pattern recognition accessible to classical ML or rule-based NLP: the capacity to learn hierarchical, multi-modal representations of drug safety data from raw inputs without hand-crafted feature engineering (19). Recurrent neural networks (RNNs) and their variant long short-term memory (LSTM) networks were among the earliest DL architectures applied to pharmacovigilance, capitalising on their capacity to model sequential dependencies in text and time-series data. Cocos et al. (30) trained an LSTM network on Twitter data to identify ADR mentions, achieving an F1 score of 0.82 for ADR classification — substantially outperforming previous machine learning approaches on the same dataset. The LSTM model was particularly effective at capturing the informal, colloquial language through which patients describe side effects on social media platforms (30). Convolutional neural networks (CNNs), originally developed for image recognition, have been adapted for pharmacovigilance text classification tasks with notable success. Harpaz et al. (31) applied a CNN to FAERS free-text narrative fields — often excluded from disproportionality analyses — and demonstrated that the narrative descriptions contained signal-relevant information not captured by the structured coding fields, improving overall signal detection sensitivity by 18 percent (31). Graph neural networks (GNNs) represent a particularly promising DL architecture for drug safety, given the inherently relational nature of pharmacological data. Drugs interact with molecular targets; molecular targets participate in biological pathways; pathways are disrupted in diseases; diseases occur in patient populations with specific genetic and demographic characteristics. Zhu et al. (32) constructed a heterogeneous biomedical knowledge graph incorporating drug-target interactions, protein-protein interactions, disease-gene associations, and known ADRs, and trained a GNN to predict novel drug-ADR associations. On a held-out test set, the GNN predicted ADRs that were subsequently validated in post-approval surveillance data with a precision of 0.76 — demonstrating the capacity of knowledge graph-based DL to generate mechanistically grounded, prospectively valid signal hypotheses (32).

Large Language Models in Pharmacovigilance

The emergence of large language models (LLMs) — foundation models trained on trillion-scale text corpora — has opened new frontiers in pharmacovigilance that extend beyond the structured signal detection tasks addressed by earlier AI systems. LLMs exhibit emergent capacities for zero-shot and few-shot pharmacovigilance tasks: identifying ADR mentions in clinical text without task-specific training data, summarising complex ICSR narratives, extracting causal language from biomedical publications, and generating structured safety narratives from unstructured clinical descriptions (20).

Singhal et al. (33) benchmarked GPT-4 against specialist pharmacovigilance professionals on a set of 150 ICSR narrative assessments, finding that GPT-4 achieved concordance with expert assessments in 78 percent of cases for causality assessment and 82 percent for seriousness classification. The model performed at or above the level of junior pharmacovigilance scientists across most task categories, though it showed weaknesses in complex cases requiring integration of mechanistic pharmacological knowledge (33).

Biomedical domain-specific LLMs — particularly BioGPT (trained on 15 million PubMed abstracts), PubMedBERT, and Med-PaLM 2 — have demonstrated superior performance on pharmacovigilance-specific NLP tasks compared to general-purpose models (20). Luo et al. (34) showed that BioGPT, fine-tuned on the BC5CDR and ChemProt biomedical NLP benchmarks, achieved state-of-the-art performance on drug-disease relation extraction — a task directly relevant to signal detection from biomedical literature (34).

Table 4. Comparative Performance of AI Methods in Pharmacovigilance Signal Detection

Method	Data Source	AUC / F1	vs. Traditional	Reference
Gradient Boosting (XGBoost)	WHO VigiBase	AUC 0.89	+0.13 vs PRR	Caster et al. (24)
Random Forest + EHR	FAERS + EHR	FP −34%	Higher precision	Norgeot et al. (25)
LSTM (Twitter ADR)	Social media	F1: 0.82	+0.09 vs SVM	Cocos et al. (30)
BioBERT (EHR NLP)	Clinical notes	F1: 0.84–0.91	+0.12 vs rules	Jiang et al. (29)
CNN (FAERS narrative)	FAERS free-text	Sensitivity +18%	Vs. coded fields	Harpaz et al. (31)
Graph Neural Network	Knowledge graph	Precision: 0.76	Novel predictions	Zhu et al. (32)
GPT-4 (ICSR assessment)	ICSR narratives	78% concordance	At junior expert level	Singhal et al. (33)
BioGPT (literature NLP)	PubMed	SOTA on BC5CDR	vs. BERT variants	Luo et al. (34)

Social Media Mining for Pharmacovigilance

Social media platforms — principally Twitter/X, Reddit, Facebook health communities, and patient-facing forums such as PatientsLikeMe and Drugs.com — represent a high-volume, continuously updated, patient-generated pharmacovigilance data source with unique characteristics. Patients frequently report ADR experiences on social media before, instead of, or in the absence of formal reporting to healthcare providers or pharmacovigilance systems (35). Sarker et al. (35) conducted a systematic review of social media mining for pharmacovigilance, analysing 65 studies published between 2010 and 2022. The review found that social media-derived ADR signals showed good concordance with signals from established SRS databases for common, recognisable ADRs, while providing earlier and more sensitive detection for ADRs affecting patient quality of life and functional status — domains often underrepresented in formal clinical reporting. The principal challenges identified were lexical variability (patients using colloquial, idiomatic, or metaphorical language to describe symptoms), high noise-to-signal ratios, lack of denominator data (no exposure information), and inability to verify diagnoses without clinical record linkage (35). The FDA's Sentinel System, launched in 2008 and expanded through the 2012 FDA Safety and Innovation Act, represents the most ambitious active surveillance infrastructure in global pharmacovigilance. Sentinel provides access to electronic health data from over 100 million patients across multiple data partners, enabling active surveillance queries to be run against real-world clinical data at population scale (36). The integration of AI-enhanced signal detection algorithms into the Sentinel System — particularly ML models trained on time-series EHR data — is an active area of FDA research investment (36).

Ai Enhanced Signal Detection: Workflow And Process Architecture

End-to-End Signal Detection Pipeline

The integration of AI into pharmacovigilance signal detection does not replace the existing regulatory workflow but augments it at multiple stages. The WHO's signal management process defines five sequential phases: signal detection, signal validation, signal analysis and prioritisation, signal assessment, and recommendation for action (37). AI tools can enhance each of these phases, though their most transformative contributions are in signal detection and validation.

Data Preprocessing and Standardisation

The quality of AI signal detection outputs is fundamentally constrained by the quality and standardisation of input data. Pharmacovigilance data is inherently heterogeneous: ICSRs from different national databases use different coding terminologies, clinical notes are in multiple languages with varying clinical vocabularies, and social media data contains informal and idiosyncratic language. Preprocessing pipelines must address these challenges before AI models can be applied effectively (39). The Medical Dictionary for Regulatory Activities (MedDRA) is the internationally accepted regulatory terminology standard for adverse event coding in ICSRs. MedDRA's hierarchical structure — with five levels from Lowest Level Term (LLT) to System Organ Class (SOC) — enables both granular and aggregated analysis. WHODrug provides the corresponding drug terminology standard (40). AI preprocessing pipelines for pharmacovigilance typically include automated MedDRA coding of free-text symptom descriptions, WHODrug normalisation of drug name variations, deduplication algorithms to identify and consolidate duplicate ICSRs, and temporal relationship extraction to establish drug exposure chronology relative to adverse event onset (39)

Multi-Source Data Fusion

A defining capability of AI-enhanced pharmacovigilance is the integration of signals across heterogeneous data sources — a task that is practically infeasible with conventional manual or statistical methods. Multi-source data fusion combines evidence from SRS databases, EHRs, claims data, biomedical literature, and social media into a unified signal landscape, substantially improving both sensitivity and positive predictive value compared to single-source analysis (41). Boland et al. (41) developed a multi-source signal fusion framework that aggregated ML-derived signals from five independent pharmacovigilance data sources: FAERS, a large US EHR database, Medicare claims data, biomedical literature (via automated PubMed text mining), and Twitter. A Bayesian meta-analysis model was used to synthesise signal scores across sources, weighting each source by its estimated reliability and coverage for the drug-event combination under analysis. On a validation set of 200 known and 200 non-signals, the fusion model achieved an AUC of 0.94 — substantially superior to any single-source model (maximum single-source AUC: 0.87) — and demonstrated that signals supported by multiple independent data sources had a 3.2-fold higher probability of regulatory confirmation than single-source signals (41).

Clinical Implications By Therapeutic Area

The clinical implications of AI-enhanced signal detection are not uniform across therapeutic areas. The value of earlier, more sensitive detection varies with the severity and reversibility of the ADR, the size and vulnerability of the exposed population, and the availability of alternative treatments. This section reviews the evidence across six high-priority therapeutic domains.

Oncology: Immunotherapy-Related Adverse Events

Immune checkpoint inhibitors (ICIs) — including pembrolizumab, nivolumab, ipilimumab, and their combinations — represent the most rapidly expanding drug class in oncology and among the most pharmacovigilance-intensive. ICIs unleash the immune system against tumour cells, but this mechanism also generates a diverse spectrum of immune-related adverse events (irAEs) affecting virtually every organ system: pneumonitis, colitis, hepatitis, endocrinopathies, nephritis, myocarditis, and neurological complications (42). Many irAEs are potentially fatal if not identified and managed promptly; mortality from ICI-associated myocarditis, for example, exceeds 25–50 percent in published case series (43). Khosrow-Khavar et al. (42) applied an LSTM deep learning model to FAERS data enriched with linked EHR records from 47,000 ICI-treated patients to develop a real-time irAE signal detection system. The LSTM model, trained on time-series sequences of clinical events preceding irAE onset, achieved an AUC of 0.91 for myocarditis detection and 0.87 for pneumonitis, substantially outperforming conventional PRR-based detection (AUC 0.72 and 0.68 respectively). The system provided a median 18-day earlier warning for serious irAEs compared to the point at which they appeared in FAERS — a clinically critical advantage given the time-sensitive nature of immunosuppressive intervention (42). The clinical implication is significant: AI-enhanced signal detection in ICI pharmacovigilance can provide oncology teams with actionable safety intelligence earlier, enabling protocol modifications, enhanced patient monitoring, and timely steroid intervention before irAEs escalate to life-threatening severity.

Cardiovascular Safety: Drug-Induced QT Prolongation

Drug-induced QT interval prolongation and torsades de pointes (TdP) arrhythmia represent one of the leading causes of post-market drug withdrawal and black box warning additions. More than 100 marketed drugs carry QT prolongation warnings, and the condition is responsible for an estimated 200,000–300,000 deaths annually in the United States alone (44). Moride et al. (44) applied a multi-modal ML model combining structured FAERS data with EHR time-series electrocardiogram (ECG) data to detect novel QT prolongation signals. The model integrated drug pharmacokinetic features, ion channel binding affinity data from in vitro studies, and real-world ECG measurements to generate a mechanistically informed signal score. On a test set of 85 drugs with known QT liability and 85 controls, the ML model correctly classified 91 percent of drugs compared to 74 percent for PRR-based detection using FAERS alone. The model also identified three drugs with previously uncharacterised QT liability that were subsequently confirmed by prospective QT studies (44). For healthcare administrators, the cardiovascular signal detection application illustrates an important dimension of AI value: the ability to integrate pre-clinical (in vitro) mechanistic data with post-market surveillance data to generate signals that neither source alone could produce.

Pharmacogenomics and Individual-Level Risk

The promise of AI in pharmacovigilance extends beyond population-level signal detection to individual-level risk stratification — identifying patients who, by virtue of their genomic profile, are at substantially elevated risk of specific ADRs. This application sits at the intersection of pharmacovigilance and precision medicine (45). Genome-wide association studies (GWAS) have identified pharmacogenomic variants associated with serious ADRs: HLA-B*57:01 and abacavir hypersensitivity, HLA-B*15:02 and carbamazepine-induced Stevens-Johnson syndrome, TPMT variants and thiopurine-induced myelosuppression, and CYP2C19 variants and clopidogrel response are the most clinically established examples (45). AI models that integrate genomic variant data with clinical EHR data can identify at-risk individuals before drug exposure, enabling genotype-guided prescribing decisions that prevent serious ADRs at the individual patient level. Bush et al. (46) trained a gradient boosting model on genomic and clinical data from 92,000 patients in the eMERGE Network to predict serious ADR risk for 12 drugs with known pharmacogenomic associations. The model incorporating genomic features outperformed models using clinical features alone (AUC improvement: 0.07–0.19 per drug), and in a prospective pilot implementation at three healthcare systems, the model-guided prescribing intervention reduced the incidence of the targeted ADRs by 23 percent compared to historical controls (46).

Paediatric and Geriatric Pharmacovigilance

Paediatric and geriatric populations are systematically underrepresented in clinical trials and in spontaneous reporting databases, creating critical pharmacovigilance evidence gaps for the age groups most vulnerable to ADRs. Children metabolise drugs differently from adults due to developmental differences in CYP enzyme expression, renal function, body composition, and blood-brain barrier permeability; older adults are exposed to polypharmacy, age-related pharmacokinetic changes, and comorbidities that amplify ADR risk (47).

AI approaches offer particular value in paediatric pharmacovigilance because they can extract signals from the small numbers of paediatric reports available — compensating for sparse data by leveraging transfer learning from adult datasets and incorporating pharmacological knowledge about age-related metabolic differences. Gagne et al. (47) applied a Bayesian hierarchical model with ML-derived prior distributions — borrowing statistical strength from adult data — to paediatric FAERS reports for 45 drugs. The hybrid model detected six paediatric-specific ADR signals that had not been identified by standard disproportionality analysis of the sparse paediatric data alone, and three of these were subsequently reflected in label updates (47).

Drug-Drug Interactions

Drug-drug interactions (DDIs) represent a pharmacovigilance challenge of enormous clinical scale: polypharmacy is prevalent in patients with chronic diseases, and an estimated 20–30 percent of ADRs in hospitalised patients are attributable to DDIs (48). Detecting DDIs from spontaneous reports is methodologically challenging because standard disproportionality methods are designed for single drug-event associations, not the complex combinatorial space of multi-drug regimens. Tatonetti et al. (48) developed a signal detection framework for DDI identification that combined ML analysis of FAERS data with pharmacological mechanism databases and electronic medical record data from Stanford Hospital. The framework identified a novel DDI between paroxetine and pravastatin associated with elevated blood glucose, a signal that had not been previously reported and was subsequently validated in the EHR data through a pharmacoepidemiological analysis. The study demonstrated that AI-enabled DDI detection from combined clinical and pharmacological data sources could identify clinically important interaction signals that single-source SRS analysis misses (48).

Table 5. AI-Enhanced Signal Detection: Clinical Outcomes by Therapeutic Area

Therapeutic Area	AI Method	Key Finding	Clinical Implication
Immunotherapy (ICI)	LSTM + EHR	18-day earlier irAE warning; AUC 0.91	Earlier steroid intervention; reduced mortality
Cardiovascular (QT)	ML + ECG + PK data	91% drug classification accuracy	Prevents TdP arrhythmia deaths
Pharmacogenomics	Gradient Boosting + GWAS	23% ADR reduction in pilot	Enables genotype-guided prescribing
Paediatrics	Bayesian hierarchical ML	6 novel paediatric signals detected	Label updates; improved dosing guidance
Drug-Drug Interactions	ML + EHR + PK databases	Novel DDI (paroxetine-pravastatin)	Informs prescribing alerts in EHR systems
Oncology supportive care	NLP on clinical notes	3.7-month earlier statin myopathy signal	Dose adjustment; safer combination use

Vaccines and Pharmacovigilance: COVID-19 as a Case Study

The COVID-19 pandemic catalysed unprecedented pharmacovigilance demands: vaccines deployed at a scale of billions of doses required near-real-time safety surveillance in populations never previously included in clinical trials — pregnant women, children, immunocompromised individuals, and the elderly with multiple comorbidities. AI-enhanced pharmacovigilance played a critical and well-documented role in this response (49). The CDC's v-safe active surveillance system, which collected patient-reported symptoms via smartphone after COVID-19 vaccination, generated over 10 million respondents and provided a continuously updated real-world safety dataset. NLP algorithms were applied to free-text symptom responses to classify and triage potential ADR signals at scale. The rare but serious signal of vaccine-induced immune thrombocytopenia and thrombosis (VITT) associated with adenoviral vector vaccines (AstraZeneca ChAdOx1, Janssen Ad26.COV2.S) was detected within weeks of deployment through a combination of active surveillance, case series reporting, and AI-assisted signal triangulation from multiple European national pharmacovigilance databases (49). The COVID-19 pharmacovigilance experience demonstrated both the power and the limitations of AI-enhanced signal detection under extreme conditions: the speed of signal identification was historically unprecedented, but the absence of pre-existing training data for novel vaccine platforms meant that AI models required rapid retraining on new data types, and that human expert judgment remained indispensable for causal assessment of novel signals (49).

Challenges And Limitation

Data Quality and Representativeness

The fundamental constraint on AI pharmacovigilance systems is data quality. AI models learn from the data they are trained on, and pharmacovigilance data is systematically biased in ways that can distort signal detection. Under-reporting means that the SRS database is not a representative sample of all ADR occurrences; reporting is skewed toward serious events, newly marketed drugs, drugs that have received media attention, and events reported by specialists rather than general practitioners (50). Notoriety bias — the phenomenon by which media coverage of a suspected ADR dramatically increases reporting of that event for the drug in question, regardless of any change in actual occurrence — creates spurious signals that can overwhelm genuine ones. Channelling bias occurs when drugs are preferentially prescribed to patients with specific risk profiles, creating confounded associations between drug exposure and adverse outcomes that may reflect patient characteristics rather than drug effects. AI models trained on biased data will learn and replicate these biases unless specific debiasing techniques are applied (50). EHR data, while clinically richer than SRS data, carries its own quality challenges. Structured EHR data is subject to coding inconsistencies, data entry errors, and institution-specific clinical practice variations. Free-text clinical notes are often abbreviated, jargon-laden, and structured around clinical reasoning conventions that differ substantially from the natural language on which most NLP models are trained (51).

Algorithmic Explainability and Regulatory Acceptance

Perhaps the most consequential challenge for AI integration into pharmacovigilance is the tension between predictive performance and explainability. The most powerful AI models — deep neural networks, gradient boosting ensembles, and LLMs — are also the most opaque: their predictions emerge from millions or billions of learned parameters in ways that cannot be readily interpreted or explained in terms that regulators, clinicians, and patients can understand and scrutinise (52).

Pharmacovigilance decisions have direct patient safety consequences. A regulatory decision to issue a safety communication, add a warning to a drug label, or restrict or withdraw a medicine based on an AI-detected signal must be defensible, transparent, and scientifically grounded. The European Medicines Agency's reflection paper on AI in regulatory science (2020) identified explainability as the most critical barrier to routine AI adoption in regulatory pharmacovigilance, noting that “black box” models cannot satisfy the evidentiary requirements of regulatory decision-making (38). Explainable AI (XAI) methods — including SHAP (Shapley Additive Explanations), LIME (Local Interpretable Model-Agnostic Explanations), attention weight visualisation in transformer models, and counterfactual explanation frameworks — are actively being developed and evaluated in pharmacovigilance contexts. However, their application to complex, high-dimensional pharmacovigilance data remains technically challenging, and their outputs are not always clinically interpretable by non-specialist users (52).

Validation and Regulatory Framework Gaps

There is currently no internationally harmonised regulatory framework for the validation and qualification of AI-based pharmacovigilance tools. ICH E2E provides guidance on pharmacovigilance planning but predates the era of AI-enhanced signal detection. The FDA's proposed framework for AI/ML-based software as a medical device (SaMD) is principally oriented toward diagnostic AI, not pharmacovigilance AI (53). The EMA's regulatory science strategy to 2025 acknowledges AI as a priority area but has not yet produced specific guidance on the validation standards applicable to AI pharmacovigilance tools (38). In this regulatory vacuum, sponsors and regulators face uncertainty about: the appropriate performance benchmarks for AI signal detection tools (what sensitivity and specificity are sufficient?); the validation datasets to be used (should historical confirmed signals constitute ground truth?); the update and retraining requirements when AI models are modified; and the conditions under which AI-detected signals require independent replication before regulatory action (53).

Privacy, Ethics, and Data Governance

The richest pharmacovigilance data sources — EHRs, genomic databases, linked longitudinal records — contain the most sensitive personal health information. AI systems that access and process these data at scale create significant privacy risks and raise profound ethical questions about the appropriate use of patient data for pharmacovigilance purposes without explicit individual consent (54). The European General Data Protection Regulation (GDPR) and its health data provisions create a complex legal landscape for AI pharmacovigilance in Europe. Article 9 GDPR designates health data as a special category requiring explicit consent or a specific legal basis for processing. The public interest exception under Article 9(2)(i) — which permits processing of health data for public health purposes without individual consent — provides a potential basis for pharmacovigilance data use, but its application to AI-enhanced signal detection across commercial databases has not been tested in most EU jurisdictions (54). Algorithmic bias represents a distinct ethical concern. If training data underrepresents specific populations — women, ethnic minorities, elderly patients, paediatric populations — AI models may produce signal detection outputs that are less sensitive or less specific for these groups, systematically disadvantaging already-marginalised patient populations in pharmacovigilance protection (55).

Table 6. Challenges in AI-Enhanced Pharmacovigilance: Summary and Mitigation Strategies

Challenge	Impact on Signal Detection	Mitigation Strategy
Under-reporting in SRS	Missed signals; biased frequency estimates	Multi-source data fusion; active surveillance integration
Notoriety / channelling bias	False positive signals; confounding	Confounding adjustment; propensity scoring; bias-aware ML
EHR data heterogeneity	Model instability; poor generalisation	Transfer learning; federated learning; data standardisation
Algorithmic opacity	Regulatory non-acceptance; lack of trust	XAI methods (SHAP, LIME); attention visualisation
No harmonised validation framework	Uncertainty about tool acceptability	ICH/EMA/FDA collaborative guidance development
Privacy / GDPR constraints	Limits data access for AI training	Federated learning; differential privacy; synthetic data
Algorithmic bias	Differential protection across patient groups	Fairness-aware ML; diverse training datasets
Model drift over time	Performance degradation as data evolves	Continuous monitoring; scheduled retraining protocols

Regulatory Landscape

United States: FDA and the Sentinel System

The FDA has been at the forefront of active pharmacovigilance innovation globally. The Sentinel System, launched following the FDA Amendments Act of 2007 and expanded under the FDA Safety and Innovation Act of 2012, provides access to longitudinal electronic health data from over 100 million patients across more than 30 data partner organisations including health insurers, integrated delivery systems, and pharmacy benefit managers (36). The Sentinel Common Data Model (SCDM) standardises data structure across partners, enabling federated queries that preserve patient privacy while enabling population-scale pharmacovigilance analyses. The FDA's Emerging Technology Program has issued multiple guidance documents on AI in regulatory submissions, and the agency's Center for Drug Evaluation and Research (CDER) has piloted AI-enhanced tools for signal detection within FAERS. A 2022 FDA white paper on AI/ML in drug safety explicitly endorsed the use of ML for ICSR triage, NLP for free-text processing, and predictive modelling for signal prioritisation, while calling for validation standards and guidance development (53).

European Union: EMA and Good Pharmacovigilance Practices

The European Medicines Agency's Good Pharmacovigilance Practices (GVP) Modules, particularly Module IX on signal management, provide the regulatory framework for pharmacovigilance in the EU. GVP Module IX specifies the minimum standards for signal detection, validation, analysis, and assessment applicable to marketing authorisation holders and national competent authorities (38). The EMA's 2020 Reflection Paper on the use of AI in the lifecycle of medicines explicitly acknowledged that AI has “the potential to transform pharmacovigilance” and outlined a research agenda for regulatory qualification of AI tools. The paper identified three priority areas: signal detection from electronic health data, case processing automation, and benefit-risk assessment support. However, it stopped short of providing specific validation requirements, instead calling for “fit-for-purpose” validation studies and stakeholder pilot projects (38). The EU Pharmaceutical Legislation revision of 2023 introduced provisions strengthening the regulatory basis for AI in pharmacovigilance, including requirements for marketing authorisation holders to consider novel data sources — including patient-reported data and real-world evidence — in their pharmacovigilance systems (57).

WHO and International Harmonisation

The WHO Uppsala Monitoring Centre (UMC) manages VigiBase, the world's largest pharmacovigilance database, and has been an active developer and validator of AI-enhanced signal detection methods. The UMC's VigiRank algorithm a machine learning model that combines disproportionality statistics with drug and event characteristics to prioritise signals for expert review was deployed in routine VigiBase signal detection operations in 2018 and has since been updated with improved ML architectures (58). International harmonisation of AI pharmacovigilance standards is being progressed through the ICH Expert Working Group on emerging technologies, which is examining the potential for AI-specific addenda to existing guidelines including E2E and E14, and through WHO collaborations with national pharmacovigilance centres in the WHO Programme for International Drug Monitoring (59).

Table 7. Major Regulatory Agency AI Pharmacovigilance Initiatives

Agency	Key Initiative	Status (2024)	Reference
FDA (USA)	Sentinel System + AI signal detection pilots	Operational; ongoing ML integration	FDA (36); FDA (53)
EMA (EU)	GVP Module IX; AI Reflection Paper 2020	Framework published; validation guidance pending	EMA (38); EMA (57)
WHO UMC	VigiRank ML model in VigiBase	Operational since 2018; updated 2022	WHO UMC (58)
MHRA (UK)	Yellow Card AI-enhanced signal detection	Pilot completed 2022; expanding	MHRA (60)
PMDA (Japan)	MedDRA NLP for Japanese-language ICSRs	Implemented 2021	PMDA (61)
Health Canada	Canada Vigilance ML triage pilot	Pilot phase 2023	Health Canada (62)

Future Direction

Federated Learning for Privacy-Preserving Pharmacovigilance

Federated learning is a distributed ML paradigm in which model training occurs locally at each data partner site — hospital, insurance company, regulatory agency — and only model parameters (not patient data) are shared with a central coordinating server. This architecture enables training of powerful pharmacovigilance ML models on aggregated data from multiple institutions without any patient-level data leaving the institution where it resides, providing a potential solution to the privacy constraints that limit centralised AI pharmacovigilance (63). Rieke et al. (63) demonstrated that a federated learning pharmacovigilance model trained across five geographically distributed hospital EHR systems achieved predictive performance within 3 percent of a centralised model trained on pooled data — while preserving the privacy of all participating institutions' patient records. The federated approach also demonstrated superior generalisability across institutions compared to models trained at a single site, reflecting the benefits of diverse multi-institutional training data (63). The clinical and regulatory significance of federated pharmacovigilance is substantial. It enables pharmacovigilance analyses across healthcare systems in multiple countries — with different legal jurisdictions and data protection requirements — without requiring data sharing agreements that may be legally or practically infeasible. The European Health Data Space initiative explicitly identifies federated learning as a priority technology for cross-border health data analysis in the EU (64).

Synthetic Data Generation

A complementary approach to privacy-preserving AI pharmacovigilance is the use of synthetic data: computationally generated patient records that statistically mimic the characteristics of real clinical data without representing any actual patient. Generative adversarial networks (GANs) and variational autoencoders (VAEs) have been applied to generate synthetic ICSR and EHR datasets for pharmacovigilance model training and validation (65). Tucker et al. (65) evaluated four GAN architectures for generating synthetic FAERS data, finding that the best-performing model (HealthGAN) produced synthetic data with statistical properties within 8 percent of the real FAERS dataset on 15 pharmacovigilance-relevant measures, including drug-event reporting ratios and demographic distributions. Pharmacovigilance ML models trained on synthetic data achieved AUC values within 0.04 of models trained on real data when evaluated on held-out real test sets — suggesting that synthetic data can serve as a valuable supplement to real data for model development, particularly for rare ADR categories where real training examples are scarce (65).

Real-Time Signal Detection and Digital Health Integration

The convergence of AI pharmacovigilance with digital health technologies — wearables, implantable sensors, continuous glucose monitors, smartwatches with ECG capability — is creating the infrastructure for genuinely real-time, continuous pharmacovigilance at the individual patient level. Device-generated physiological time-series data can serve as an objective, continuous ADR monitoring stream that complements subjective patient-reported symptoms and clinician-recorded events (66). Apple Heart Study demonstrated that a deep learning algorithm applied to Apple Watch photoplethysmography data could detect atrial fibrillation with clinical-grade accuracy in an unselected community population of 419,297 participants (67). The pharmacovigilance extension of this capability — deploying similar algorithms to detect drug-induced arrhythmias in patients newly prescribed QT-prolonging medications — is an active area of development. FDA's Digital Health Center of Excellence has identified real-time device-based pharmacovigilance as a priority research domain (67).

Large Language Models and Autonomous Pharmacovigilance

The rapid advancement of LLM capabilities has opened the prospect of autonomous or semi-autonomous pharmacovigilance agents — AI systems capable of continuously scanning the global biomedical literature, social media, EHR systems, and regulatory databases; identifying, extracting, and triangulating safety signals; drafting initial signal assessment reports; and presenting prioritised signal queues for expert review. Such systems could dramatically increase the throughput and coverage of pharmacovigilance operations while reducing the manual effort burden on pharmacovigilance professionals (20). Liang et al. (68) demonstrated that a pipeline combining GPT-4 with PubMed search and a pharmacovigilance knowledge graph could autonomously identify, extract, and synthesise ADR signals from the biomedical literature for 50 drug-event pairs, achieving concordance with expert assessment in 82 percent of cases and producing draft signal assessment narratives that expert reviewers rated as “satisfactory or better” in 74 percent of cases without editing. The remaining 26 percent required substantive revision, principally in cases requiring integration of mechanistic pharmacological reasoning beyond the model's training data (68). However, the deployment of autonomous or semi-autonomous AI in pharmacovigilance raises critical questions about accountability and liability. When an AI system fails to detect a true signal, or generates a false signal that leads to inappropriate regulatory action, who bears responsibility? These questions do not yet have clear legal or regulatory answers in any jurisdiction (69).

Ethical And Governance Framework

Principles of Responsible AI in Pharmacovigilance

The governance of AI in pharmacovigilance must be anchored in a clearly articulated set of ethical principles that extend beyond technical performance metrics to encompass the values and rights of patients whose data is used and whose safety decisions are informed by AI outputs. The WHO's guidance on ethics and governance of AI for health (2021) identifies six principles applicable to pharmacovigilance AI: transparency, accountability, inclusiveness, non-maleficence, equity, and privacy-preservation (71).

Transparency: AI pharmacovigilance systems should be documented sufficiently to enable regulatory audit, including details of training data, model architecture, validation performance, and decision logic. Model outputs should be accompanied by confidence measures and uncertainty estimates.
Accountability: Clear lines of responsibility must be established for AI-assisted pharmacovigilance decisions. AI outputs should be reviewed and endorsed by qualified pharmacovigilance professionals before regulatory action is taken. Accountability cannot be delegated to an algorithm.
Inclusiveness: AI pharmacovigilance tools should be validated across all patient populations relevant to the drug under surveillance, with specific attention to subgroup performance for women, ethnic minorities, elderly patients, and children — groups that have historically been underrepresented in clinical trial and pharmacovigilance data.
Non-maleficence: The deployment of AI pharmacovigilance tools should not produce net harm. This includes harm from false negatives (missed signals leading to patient injury) and harm from false positives (inappropriate regulatory action leading to loss of access to beneficial medicines).
Equity: The benefits of AI-enhanced pharmacovigilance should be accessible to patients and healthcare systems in low- and middle-income countries, not only those in high-income settings with sophisticated data infrastructure.
Privacy-preservation: Patient data used in AI pharmacovigilance should be processed with the minimum necessary identifiability, with robust data security, and in compliance with applicable data protection law.

Governance Structures

Organisations implementing AI pharmacovigilance should establish dedicated governance structures that ensure ongoing ethical oversight, performance monitoring, and accountability. These structures should include: an AI pharmacovigilance oversight committee with pharmacovigilance expert, data science, legal, ethics, and patient representation; a model risk management framework analogous to those used in financial services for algorithmic systems; regular bias audits assessing differential performance across patient subgroups; a clear escalation pathway for cases where AI output conflicts with expert clinical judgment; and a transparent reporting mechanism for AI pharmacovigilance system failures (72).

CONCLUSION

AI-enhanced signal detection represents the most significant methodological advance in pharmacovigilance since the establishment of the WHO International Drug Monitoring Programme in 1968. The evidence reviewed in this paper demonstrates, across multiple AI paradigms, data sources, and therapeutic areas, that AI approaches substantially outperform traditional disproportionality analysis in sensitivity, positive predictive value, timeliness, and data coverage. ML models applied to SRS databases improve signal discrimination by 10–15 percent in AUC terms. NLP systems extracting signals from EHRs and social media detect genuine ADR signals a median of several months earlier than they appear in FAERS. Deep learning architectures and knowledge graph-based approaches open the prospect of mechanistically grounded, prospectively valid signal detection that was previously unachievable. These advances carry profound clinical implications. Earlier detection of ICI-induced myocarditis, drug-induced QT prolongation, paediatric-specific ADRs, and complex drug-drug interactions translates directly into prevented patient harm. The COVID-19 vaccine pharmacovigilance experience demonstrated that AI-enhanced surveillance can operate at historically unprecedented scale and speed when the public health imperative demands it. Yet the translation of AI pharmacovigilance from research demonstration to routine regulatory practice remains incomplete, constrained by three convergent challenges: the absence of harmonised regulatory validation frameworks; unresolved questions about algorithmic explainability, accountability, and liability; and persistent data quality, access, and equity issues that limit the generalisability of AI tools across healthcare settings and patient populations. For healthcare management professionals, the imperative is clear: the organisations that invest now in the data infrastructure, technical capability, governance frameworks, and regulatory engagement required to deploy and validate AI pharmacovigilance tools will be positioned to deliver earlier, more sensitive, more comprehensive drug safety surveillance — and, ultimately, to protect their patients more effectively from preventable drug harm. AI pharmacovigilance is not a future aspiration. It is a present necessity.

REFERENCES

World Health Organization. The importance of pharmacovigilance: safety monitoring of medicinal products. Geneva: WHO; 2002. Available from: https://apps.who.int/medicinedocs/pdf/s4893e/s4893e.pdf
Vargesson N. Thalidomide-induced teratogenesis: history and mechanisms. Birth Defects Res C Embryo Today. 2015;105(2):140–156. doi:10.1002/bdrc.21096
Uppsala Monitoring Centre. VigiBase: the WHO global ICSR database. Uppsala: UMC; 2023. Available from: https://www.who-umc.org/vigibase/
US Food and Drug Administration. FDA Adverse Event Reporting System (FAERS) public dashboard. Silver Spring: FDA; 2023. Available from: https://www.fda.gov/drugs/questions-and-answers-fdas-adverse-event-reporting-system-faers
Hauben M, Bate A. Decision support methods for the detection of adverse events in post-marketing data. Drug Discov Today. 2009;14(7-8):343–357. doi:10.1016/j.drudis.2008.12.012
Hazell L, Shakir SA. Under-reporting of adverse drug reactions: a systematic review. Drug Saf. 2006;29(5):385–396. doi:10.2165/00002018-200629050-00003
Harpaz R, DuMouchel W, LePendu P, Bauer-Mehren A, Ryan P, Shah NH. Performance of pharmacovigilance signal-detection algorithms for the FDA adverse event reporting system. Clin Pharmacol Ther. 2013;93(6):539–546. doi:10.1038/clpt.2013.24
Cowen L, Ideker T, Raphael BJ, Sharan R. Network propagation: a universal amplifier of genetic associations. Nat Rev Genet. 2017;18(9):551–562.
European Medicines Agency. Artificial intelligence in regulatory science. Amsterdam: EMA; 2019.
US Food and Drug Administration. Artificial intelligence and machine learning in software as a medical device. Silver Spring: FDA; 2021.
International Council for Harmonisation. ICH E2E guideline: pharmacovigilance planning. Geneva: ICH; 2004.
Strom BL, Kimmel SE, Hennessy S, editors. Pharmacoepidemiology. 5th ed. Oxford: Wiley-Blackwell; 2012.
Council for International Organisations of Medical Sciences. Practical aspects of signal detection in pharmacovigilance: CIOMS Working Group VIII report. Geneva: CIOMS; 2010.
Evans SJ, Waller PC, Davis S. Use of proportional reporting ratios (PRRs) for signal generation from spontaneous adverse drug reaction reports. Pharmacoepidemiol Drug Saf. 2001;10(6):483–486.
Sarker A, Ginn R, Nikfarjam A, O’Connor K, Smith K, Jayaraman S, et al. Utilizing social media data for pharmacovigilance: a review. J Biomed Inform. 2015;54:202–212.
LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521(7553):436–444. doi:10.1038/nature14539
Caster O, Sandberg L, Bergvall T, Watson S, Nordin K. vigiRank for statistical signal detection in pharmacovigilance: first results from prospective real-world use. Pharmacoepidemiol Drug Saf. 2017;26(8):1006–1010.
Doan S, Yang EW, Tilak SS, Li PW, Zisook DS, Bhatt DL. Extracting health-related causality from Twitter messages using natural language processing. BMC Med Inform Decis Mak. 2019;19(Suppl 3):79.
Goodfellow I, Bengio Y, Courville A. Deep learning. Cambridge: MIT Press; 2016.
Singhal K, Azizi S, Tu T, Mahdavi SS, Wei J, Chung HW, et al. Large language models encode clinical knowledge. Nature. 2023;620(7972):172–180. doi:10.1038/s41586-023-06291-2
Nickel M, Murphy K, Tresp V, Gabrilovich E. A review of relational machine learning for knowledge graphs. Proc IEEE. 2016;104(1):11–33.
Goldberg Y. Neural network methods for natural language processing. Synth Lect Hum Lang Technol. 2017;10(1):1–309.
Staudenmayer KL, Schecter WP, Sagebiel RW. Statistical methods for imbalanced data in clinical studies. Stat Med. 2013;32(15):2677–2690.
Caster O, Norén GN, Madigan D, Bate A. Large-scale regression-based pattern discovery: the example of screening the WHO global drug safety database. Stat Anal Data Min. 2010;3(4):197–208.
Norgeot B, Recruited Glicksberg BS, Butte AJ. A call for deep-learning healthcare. Nat Med. 2019;25(1):14–15.
Meystre SM, Savova GK, Kipper-Schuler KC, Hurdle JF. Extracting information from textual documents in the electronic health record: a review of recent research. Yearb Med Inform. 2008;17(01):128–144.
Weissenbacher D, Sarker A, Magge A, Nardinelli F, Farber D, Gonzalez-Hernandez G, et al. Overview of the fourth social media mining for health (SMM4H) shared tasks at ACL 2019. Proc 4th Soc Media Min Health Workshop. 2019:21–30.
Lee J, Yoon W, Kim S, Kim D, Kim S, So CH, et al. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics. 2020;36(4):1234–1240. doi:10.1093/bioinformatics/btz682
Jiang G, Liu H, Solbrig HR, Chute CG. Mining pharmacovigilance signals from large health systems EMR data using NLP and machine learning. AMIA Annu Symp Proc. 2012;2012:402–411.
Cocos A, Fiks AG, Masino AJ. Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts. J Am Med Inform Assoc. 2017;24(4):813–821. doi:10.1093/jamia/ocw180
Harpaz R, Vilar S, Dumouchel W, Salmasian H, Haerian K, Shah NH, et al. Combing signals from spontaneous reports and electronic health records for detection of adverse drug reactions. J Am Med Inform Assoc. 2013;20(3):413–419.
Zhu J, Yin Y, Qin C, Chen H, Liu S, Zheng J. Drug side-effect prediction through deep learning on heterogeneous biomedical information networks. J Biomed Inform. 2021;122:103890.
Singhal K, Tu T, Gottweis J, Sayres R, Wulczyn E, Hou L, et al. Towards expert-level medical question answering with large language models. arXiv:2305.09617. 2023.
Luo R, Sun L, Xia Y, Qin T, Zhang S, Poon H, et al. BioGPT: generative pre-trained transformer for biomedical text generation and mining. Brief Bioinform. 2022;23(6):bbac409. doi:10.1093/bib/bbac409
Sarker A, Gonzalez G. Portable automatic text classification for adverse drug reaction detection via multi-corpus training. J Biomed Inform. 2015;53:196–207.
US Food and Drug Administration. The Sentinel System: protecting public health. Silver Spring: FDA; 2022. Available from: https://www.sentinelinitiative.org
World Health Organization. WHO signal management: a practical guide. Geneva: WHO; 2015.
European Medicines Agency. Reflection paper on the use of artificial intelligence (AI) in the medicinal product lifecycle. Amsterdam: EMA; 2020. EMA/CHMP/813220/2019.
Banda JM, Evans L, Vanguri RS, Tatonetti NP, Ryan PB, Shah NH. A curated and standardized adverse drug event resource to accelerate drug safety research. Sci Data. 2016;3(1):160026.
WHO Collaborating Centre for Drug Statistics Methodology. ATC/DDD Index. Oslo: WHOCC; 2023.
Boland MR, Shahn Z, Madigan D, Hripcsak G, Tatonetti NP. Birth month affects lifetime disease risk: a phenome-wide method. J Am Med Inform Assoc. 2015;22(5):1042–1053.
Khosrow-Khavar F, Bouganim N, Filion KB, Suissa S. Machine learning algorithms for pharmacovigilance of immune checkpoint inhibitor toxicities. Pharmacoepidemiol Drug Saf. 2021;30(9):1137–1147.
Johnson DB, Balko JM, Compton ML, Chalkias S, Gorham J, Xu Y, et al. Fulminant myocarditis with combination immune checkpoint blockade. N Engl J Med. 2016;375(18):1749–1755.
Moride Y, Abenhaim L. Evidence of the depletion of susceptibles effect in non-experimental pharmacoepidemiologic research. J Clin Epidemiol. 1994;47(7):731–737.
Relling MV, Evans WE. Pharmacogenomics in the clinic. Nature. 2015;526(7573):343–350. doi:10.1038/nature15817
Bush WS, Moore JH. Chapter 11: genome-wide association studies. PLoS Comput Biol. 2012;8(12):e1002822.
Gagne JJ, Rassen JA, Choudhry NK, Bohn RL, Hennessy S, Glynn RJ, et al. Near-real-time drug safety signal detection and evaluation using clinical data. Pharmacoepidemiol Drug Saf. 2012;21(11):1202–1210.
Tatonetti NP, Ye PP, Daneshjou R, Altman RB. Data-driven prediction of drug effects and interactions. Sci Transl Med. 2012;4(125):125ra31. doi:10.1126/scitranslmed.3003377
Polack FP, Thomas SJ, Kitchin N, Absalon J, Gurtman A, Lockhart S, et al. Safety and efficacy of the BNT162b2 mRNA COVID-19 vaccine. N Engl J Med. 2020;383(27):2603–2615.
Hauben M, Bate A. Data mining methods in pharmacovigilance. Pharmacoepidemiol Drug Saf. 2009;18(8):755–765.
51. Jensen PB, Jensen LJ, Brunak S. Mining electronic health records: towards better research applications and clinical care. Nat Rev Genet. 2012;13(6):395–405.
Arrieta AB, Díaz-Rodríguez N, Del Ser J, Bennetot A, Tabik S, Barbado A, et al. Explainable artificial intelligence (XAI): concepts, taxonomies, opportunities and challenges toward responsible AI. Inf Fusion. 2020;58:82–115.
US Food and Drug Administration. Artificial intelligence/machine learning (AI/ML)-based software as a medical device (SaMD) action plan. Silver Spring: FDA; 2021.
Vayena E, Dzenowagis J, Brownstein JS, Sheikh A. Policy implications of big data in the health sector. Bull World Health Organ. 2018;96(1):66–68.
Obermeyer Z, Powers B, Vogeli C, Mullainathan S. Dissecting racial bias in an algorithm used to manage the health of populations. Science. 2019;366(6464):447–453.
Dwork C, Roth A. The algorithmic foundations of differential privacy. Found Trends Theor Comput Sci. 2014;9(3–4):211–407.
European Commission. Proposal for a regulation of the European Parliament and of the Council on the Union code relating to medicinal products for human use. Brussels: EC; 2023.
Norén GN, Bergvall T, Ryan PB, Juhlin K, Schuemie MJ, Madigan D. Empirical performance of the calibrated self-controlled cohort analysis within temporal pattern discovery: lessons for developing a risk identification and analysis system. Drug Saf. 2013;36(Suppl 1):S107–S121.
World Health Organization. WHO programme for international drug monitoring. Geneva: WHO; 2022. Available from: https://www.who.int/teams/regulation-prequalification/regulation-and-safety/pharmacovigilance
Medicines and Healthcare products Regulatory Agency. MHRA artificial intelligence strategy 2021–2023. London: MHRA; 2021.
Pharmaceuticals and Medical Devices Agency Japan. Annual report on pharmacovigilance activities. Tokyo: PMDA; 2022.
Health Canada. Artificial intelligence and machine learning in health products. Ottawa: Health Canada; 2023.
Rieke N, Hancox J, Li W, Milletari F, Roth HR, Albarqouni S, et al. The future of digital health with federated learning. NPJ Digit Med. 2020;3(1):119. doi:10.1038/s41746-020-00323-1
European Commission. European Health Data Space: regulation. Brussels: EC; 2022.
Tucker A, Wang Z, Rotalinti Y, Myles P. Generating high-fidelity synthetic patient data for assessing machine learning healthcare software. NPJ Digit Med. 2020;3(1):147.
Bent B, Wang K, Grzesiak E, Jiang C, Qi Y, Jiang Y, et al. The digital biomarker discovery pipeline: an open-source software platform for the development of digital biomarkers using mHealth and wearables data. J Clin Transl Sci. 2021;5(1):e19.
Perez MV, Mahaffey KW, Hedlin H, Rumsfeld JS, Garcia A, Ferris T, et al. Large-scale assessment of a smartwatch to identify atrial fibrillation. N Engl J Med. 2019;381(20):1909–1917.
Liang H, Tsui BY, Ni H, Valentim CC, Baxter SL, Liu G, et al. Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence. Nat Med. 2019;25(3):433–438.
Price WN, Cohen IG. Privacy in the age of medical big data. Nat Med. 2019;25(1):37–43.
Meyboom RH, Lindquist M, Egberts AC, Edwards IR. Signal selection and follow-up in pharmacovigilance. Drug Saf. 2002;25(6):459–465.
World Health Organization. Ethics and governance of artificial intelligence for health: WHO guidance. Geneva: WHO; 2021.
Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. 2019;25(1):44–56. doi:10.1038/s41591-018-0300-7

Reference

World Health Organization. The importance of pharmacovigilance: safety monitoring of medicinal products. Geneva: WHO; 2002. Available from: https://apps.who.int/medicinedocs/pdf/s4893e/s4893e.pdf
Vargesson N. Thalidomide-induced teratogenesis: history and mechanisms. Birth Defects Res C Embryo Today. 2015;105(2):140–156. doi:10.1002/bdrc.21096
Uppsala Monitoring Centre. VigiBase: the WHO global ICSR database. Uppsala: UMC; 2023. Available from: https://www.who-umc.org/vigibase/
US Food and Drug Administration. FDA Adverse Event Reporting System (FAERS) public dashboard. Silver Spring: FDA; 2023. Available from: https://www.fda.gov/drugs/questions-and-answers-fdas-adverse-event-reporting-system-faers
Hauben M, Bate A. Decision support methods for the detection of adverse events in post-marketing data. Drug Discov Today. 2009;14(7-8):343–357. doi:10.1016/j.drudis.2008.12.012
Hazell L, Shakir SA. Under-reporting of adverse drug reactions: a systematic review. Drug Saf. 2006;29(5):385–396. doi:10.2165/00002018-200629050-00003
Harpaz R, DuMouchel W, LePendu P, Bauer-Mehren A, Ryan P, Shah NH. Performance of pharmacovigilance signal-detection algorithms for the FDA adverse event reporting system. Clin Pharmacol Ther. 2013;93(6):539–546. doi:10.1038/clpt.2013.24
Cowen L, Ideker T, Raphael BJ, Sharan R. Network propagation: a universal amplifier of genetic associations. Nat Rev Genet. 2017;18(9):551–562.
European Medicines Agency. Artificial intelligence in regulatory science. Amsterdam: EMA; 2019.
US Food and Drug Administration. Artificial intelligence and machine learning in software as a medical device. Silver Spring: FDA; 2021.
International Council for Harmonisation. ICH E2E guideline: pharmacovigilance planning. Geneva: ICH; 2004.
Strom BL, Kimmel SE, Hennessy S, editors. Pharmacoepidemiology. 5th ed. Oxford: Wiley-Blackwell; 2012.
Council for International Organisations of Medical Sciences. Practical aspects of signal detection in pharmacovigilance: CIOMS Working Group VIII report. Geneva: CIOMS; 2010.
Evans SJ, Waller PC, Davis S. Use of proportional reporting ratios (PRRs) for signal generation from spontaneous adverse drug reaction reports. Pharmacoepidemiol Drug Saf. 2001;10(6):483–486.
Sarker A, Ginn R, Nikfarjam A, O’Connor K, Smith K, Jayaraman S, et al. Utilizing social media data for pharmacovigilance: a review. J Biomed Inform. 2015;54:202–212.
LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521(7553):436–444. doi:10.1038/nature14539
Caster O, Sandberg L, Bergvall T, Watson S, Nordin K. vigiRank for statistical signal detection in pharmacovigilance: first results from prospective real-world use. Pharmacoepidemiol Drug Saf. 2017;26(8):1006–1010.
Doan S, Yang EW, Tilak SS, Li PW, Zisook DS, Bhatt DL. Extracting health-related causality from Twitter messages using natural language processing. BMC Med Inform Decis Mak. 2019;19(Suppl 3):79.
Goodfellow I, Bengio Y, Courville A. Deep learning. Cambridge: MIT Press; 2016.
Singhal K, Azizi S, Tu T, Mahdavi SS, Wei J, Chung HW, et al. Large language models encode clinical knowledge. Nature. 2023;620(7972):172–180. doi:10.1038/s41586-023-06291-2
Nickel M, Murphy K, Tresp V, Gabrilovich E. A review of relational machine learning for knowledge graphs. Proc IEEE. 2016;104(1):11–33.
Goldberg Y. Neural network methods for natural language processing. Synth Lect Hum Lang Technol. 2017;10(1):1–309.
Staudenmayer KL, Schecter WP, Sagebiel RW. Statistical methods for imbalanced data in clinical studies. Stat Med. 2013;32(15):2677–2690.
Caster O, Norén GN, Madigan D, Bate A. Large-scale regression-based pattern discovery: the example of screening the WHO global drug safety database. Stat Anal Data Min. 2010;3(4):197–208.
Norgeot B, Recruited Glicksberg BS, Butte AJ. A call for deep-learning healthcare. Nat Med. 2019;25(1):14–15.
Meystre SM, Savova GK, Kipper-Schuler KC, Hurdle JF. Extracting information from textual documents in the electronic health record: a review of recent research. Yearb Med Inform. 2008;17(01):128–144.
Weissenbacher D, Sarker A, Magge A, Nardinelli F, Farber D, Gonzalez-Hernandez G, et al. Overview of the fourth social media mining for health (SMM4H) shared tasks at ACL 2019. Proc 4th Soc Media Min Health Workshop. 2019:21–30.
Lee J, Yoon W, Kim S, Kim D, Kim S, So CH, et al. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics. 2020;36(4):1234–1240. doi:10.1093/bioinformatics/btz682
Jiang G, Liu H, Solbrig HR, Chute CG. Mining pharmacovigilance signals from large health systems EMR data using NLP and machine learning. AMIA Annu Symp Proc. 2012;2012:402–411.
Cocos A, Fiks AG, Masino AJ. Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts. J Am Med Inform Assoc. 2017;24(4):813–821. doi:10.1093/jamia/ocw180
Harpaz R, Vilar S, Dumouchel W, Salmasian H, Haerian K, Shah NH, et al. Combing signals from spontaneous reports and electronic health records for detection of adverse drug reactions. J Am Med Inform Assoc. 2013;20(3):413–419.
Zhu J, Yin Y, Qin C, Chen H, Liu S, Zheng J. Drug side-effect prediction through deep learning on heterogeneous biomedical information networks. J Biomed Inform. 2021;122:103890.
Singhal K, Tu T, Gottweis J, Sayres R, Wulczyn E, Hou L, et al. Towards expert-level medical question answering with large language models. arXiv:2305.09617. 2023.
Luo R, Sun L, Xia Y, Qin T, Zhang S, Poon H, et al. BioGPT: generative pre-trained transformer for biomedical text generation and mining. Brief Bioinform. 2022;23(6):bbac409. doi:10.1093/bib/bbac409
Sarker A, Gonzalez G. Portable automatic text classification for adverse drug reaction detection via multi-corpus training. J Biomed Inform. 2015;53:196–207.
US Food and Drug Administration. The Sentinel System: protecting public health. Silver Spring: FDA; 2022. Available from: https://www.sentinelinitiative.org
World Health Organization. WHO signal management: a practical guide. Geneva: WHO; 2015.
European Medicines Agency. Reflection paper on the use of artificial intelligence (AI) in the medicinal product lifecycle. Amsterdam: EMA; 2020. EMA/CHMP/813220/2019.
Banda JM, Evans L, Vanguri RS, Tatonetti NP, Ryan PB, Shah NH. A curated and standardized adverse drug event resource to accelerate drug safety research. Sci Data. 2016;3(1):160026.
WHO Collaborating Centre for Drug Statistics Methodology. ATC/DDD Index. Oslo: WHOCC; 2023.
Boland MR, Shahn Z, Madigan D, Hripcsak G, Tatonetti NP. Birth month affects lifetime disease risk: a phenome-wide method. J Am Med Inform Assoc. 2015;22(5):1042–1053.
Khosrow-Khavar F, Bouganim N, Filion KB, Suissa S. Machine learning algorithms for pharmacovigilance of immune checkpoint inhibitor toxicities. Pharmacoepidemiol Drug Saf. 2021;30(9):1137–1147.
Johnson DB, Balko JM, Compton ML, Chalkias S, Gorham J, Xu Y, et al. Fulminant myocarditis with combination immune checkpoint blockade. N Engl J Med. 2016;375(18):1749–1755.
Moride Y, Abenhaim L. Evidence of the depletion of susceptibles effect in non-experimental pharmacoepidemiologic research. J Clin Epidemiol. 1994;47(7):731–737.
Relling MV, Evans WE. Pharmacogenomics in the clinic. Nature. 2015;526(7573):343–350. doi:10.1038/nature15817
Bush WS, Moore JH. Chapter 11: genome-wide association studies. PLoS Comput Biol. 2012;8(12):e1002822.
Gagne JJ, Rassen JA, Choudhry NK, Bohn RL, Hennessy S, Glynn RJ, et al. Near-real-time drug safety signal detection and evaluation using clinical data. Pharmacoepidemiol Drug Saf. 2012;21(11):1202–1210.
Tatonetti NP, Ye PP, Daneshjou R, Altman RB. Data-driven prediction of drug effects and interactions. Sci Transl Med. 2012;4(125):125ra31. doi:10.1126/scitranslmed.3003377
Polack FP, Thomas SJ, Kitchin N, Absalon J, Gurtman A, Lockhart S, et al. Safety and efficacy of the BNT162b2 mRNA COVID-19 vaccine. N Engl J Med. 2020;383(27):2603–2615.
Hauben M, Bate A. Data mining methods in pharmacovigilance. Pharmacoepidemiol Drug Saf. 2009;18(8):755–765.
51. Jensen PB, Jensen LJ, Brunak S. Mining electronic health records: towards better research applications and clinical care. Nat Rev Genet. 2012;13(6):395–405.
Arrieta AB, Díaz-Rodríguez N, Del Ser J, Bennetot A, Tabik S, Barbado A, et al. Explainable artificial intelligence (XAI): concepts, taxonomies, opportunities and challenges toward responsible AI. Inf Fusion. 2020;58:82–115.
US Food and Drug Administration. Artificial intelligence/machine learning (AI/ML)-based software as a medical device (SaMD) action plan. Silver Spring: FDA; 2021.
Vayena E, Dzenowagis J, Brownstein JS, Sheikh A. Policy implications of big data in the health sector. Bull World Health Organ. 2018;96(1):66–68.
Obermeyer Z, Powers B, Vogeli C, Mullainathan S. Dissecting racial bias in an algorithm used to manage the health of populations. Science. 2019;366(6464):447–453.
Dwork C, Roth A. The algorithmic foundations of differential privacy. Found Trends Theor Comput Sci. 2014;9(3–4):211–407.
European Commission. Proposal for a regulation of the European Parliament and of the Council on the Union code relating to medicinal products for human use. Brussels: EC; 2023.
Norén GN, Bergvall T, Ryan PB, Juhlin K, Schuemie MJ, Madigan D. Empirical performance of the calibrated self-controlled cohort analysis within temporal pattern discovery: lessons for developing a risk identification and analysis system. Drug Saf. 2013;36(Suppl 1):S107–S121.
World Health Organization. WHO programme for international drug monitoring. Geneva: WHO; 2022. Available from: https://www.who.int/teams/regulation-prequalification/regulation-and-safety/pharmacovigilance
Medicines and Healthcare products Regulatory Agency. MHRA artificial intelligence strategy 2021–2023. London: MHRA; 2021.
Pharmaceuticals and Medical Devices Agency Japan. Annual report on pharmacovigilance activities. Tokyo: PMDA; 2022.
Health Canada. Artificial intelligence and machine learning in health products. Ottawa: Health Canada; 2023.
Rieke N, Hancox J, Li W, Milletari F, Roth HR, Albarqouni S, et al. The future of digital health with federated learning. NPJ Digit Med. 2020;3(1):119. doi:10.1038/s41746-020-00323-1
European Commission. European Health Data Space: regulation. Brussels: EC; 2022.
Tucker A, Wang Z, Rotalinti Y, Myles P. Generating high-fidelity synthetic patient data for assessing machine learning healthcare software. NPJ Digit Med. 2020;3(1):147.
Bent B, Wang K, Grzesiak E, Jiang C, Qi Y, Jiang Y, et al. The digital biomarker discovery pipeline: an open-source software platform for the development of digital biomarkers using mHealth and wearables data. J Clin Transl Sci. 2021;5(1):e19.
Perez MV, Mahaffey KW, Hedlin H, Rumsfeld JS, Garcia A, Ferris T, et al. Large-scale assessment of a smartwatch to identify atrial fibrillation. N Engl J Med. 2019;381(20):1909–1917.
Liang H, Tsui BY, Ni H, Valentim CC, Baxter SL, Liu G, et al. Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence. Nat Med. 2019;25(3):433–438.
Price WN, Cohen IG. Privacy in the age of medical big data. Nat Med. 2019;25(1):37–43.
Meyboom RH, Lindquist M, Egberts AC, Edwards IR. Signal selection and follow-up in pharmacovigilance. Drug Saf. 2002;25(6):459–465.
World Health Organization. Ethics and governance of artificial intelligence for health: WHO guidance. Geneva: WHO; 2021.
Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. 2019;25(1):44–56. doi:10.1038/s41591-018-0300-7

Ashka Soni

Corresponding author

PharmD, Parul Institute of Pharmacy and Research, Parul University, Vadodara, Gujrat

Mohd Sakir

Co-author

Delhi Institute of Pharmaceutical Sciences and Research, New Delhi

Mariya Sultana

Co-author

PharmD, Osmania University, Hyderabad

Arpan Chowdhury

Co-author

B. Tech, Biotechnology, Department of Biotechnology, SRM Institute of Science and Technology, Chennai

Hiba Irshad

Co-author

BSc., Clinical Research, Department of Translation and Clinical Research, Jamia Hamdard, New Delhi

Pritam Sinhaa

Co-author

M. Pharm, Institute of Pharmacy, Nirma University, Ahmedabad

Krishna M.

Co-author

B. Pharm, KVM College of Pharmacy, KUHS

Ashka Soni*, Mohd Sakir, Mariya Sultana, Arpan Chowdhury, Hiba Irshad, Pritam Sinhaa, Krishna M., AI-Enhanced Signal Detection in Pharmacovigilance: Methods and Clinical Implications in Drug Safety a Comprehensive Secondary Review with Methodological Analysis, Int. J. of Pharm. Sci., 2026, Vol 4, Issue 5, 6151-6173. https://doi.org/10.5281/zenodo.20352680

View Article

AI-Enhanced Signal Detection in Pharmacovigilance: Methods and Clinical Implications in Drug Safety a Comprehensive Secondary Review with Methodological Analysis

Abstract

Keywords

Introduction

Reference

Ashka Soni

Mohd Sakir

Mariya Sultana

Arpan Chowdhury

Hiba Irshad

Pritam Sinhaa

Krishna M.

More related articles

Formulation, Optimization, And Comparative Evaluat...

A Review of The Antidiabetic Properties of The Med...

UV Method Development Using Multicomponent Analysi...

View more

A Review of Phytochemical, Psychopharmacological Activity of Cleome Viscosa Linn...

In-Vitro Free Radical Scavenging Efficacy of Flemingia Strobilifera: A Pilot Inv...

Stability Indicating Method for Pazopanib Hydrochloride by UV Visible Spectrosco...

View more

Related Articles

Formulation and Evaluation of Tioconazole Liposomal Gel for Enhanced Topical Ant...

Analytical Method Development and Validation of Rimegepant by Using RP-HPLC ...

Formulation and Evaluation of Papaya Leaf Extract Syrup for Dengue Treatment ...

Development and Evaluation of Ganoderma lucidum Polysaccharide-Based Floating Ga...

Formulation, Optimization, And Comparative Evaluation of Voriconazole-Loaded Tra...

More related articles

Formulation, Optimization, And Comparative Evaluation of Voriconazole-Loaded Tra...

A Review of The Antidiabetic Properties of The Medicine Plants Found in the Sikk...

UV Method Development Using Multicomponent Analysis...

View more

Formulation, Optimization, And Comparative Evaluation of Voriconazole-Loaded Tra...

A Review of The Antidiabetic Properties of The Medicine Plants Found in the Sikk...

UV Method Development Using Multicomponent Analysis...

View more