Vidyabharti College of Pharmacy, Amravati, Maharashtra, India
Based on this, the review proposes a holistic quality assurance ecosystem necessary for the safe and trustworthy deployment of artificial intelligence capsule systems from manufacturing to real-world use. AI-enabled ingestible capsules transform non-invasive diagnostics of the GI tract by realizing 90-99?curacy in lesion detection; yet, at the same time, they present unprecedented challenges related to quality assurance, involving hardware reliability, algorithmic non-determinism, data security, and clinical integration. Synthesis is supported by recent literature on hardware in capsule endoscopy, AI validation protocols, regulatory frameworks such as FDA SaMD, ISO 13485, and IMDRF GMLP, risk management, and emerging multi-sensor technologies. The lack of unified standards for edge AI, continuous learning, and therapeutic autonomy keeps fragmented today's QA approaches. Several preclinical phantoms, multi-center trials, and post-market registries have demonstrated feasibility but underlined the lack of harmonization across jurisdictions. It advocates for the implementation of a unified QA ecosystem: from hardware controls down to the trustworthiness of AI, ethical governance, and adaptive regulation, a critically required multi-stakeholder collaboration lays the foundation for establishing global benchmarks for explainable multimodal models and human-AI symbiosis paradigms that realize full potential for autonomous GI diagnostics.
Wireless capsule endoscopy (WCE), brought into existence in the year 2001, has changed the entire landscape of gastrointestinal assessments by providing a non-invasive technique of visualization of the small bowel for more than 2.4 million patients annually across the globe. The recent integration of artificial intelligence into this modality has significantly heightened the sensitivity of lesion detection, which now takes only 2-5 minutes compared to 30-60 minutes earlier while still having a sensitivity of 90-99% for bleeding lesions and Crohn’s ulcers.[1]
Quality assurance is also important for each medical device for diagnostic purposes. The new challenges related to QA include algorithmic drift, decision-making with a 'black box,' and hardware-software dependencies, which are not yet considered in the current endoscopy quality assurance criteria. Artificial intelligence-based capsules face new challenges related to QA, such as algorithmic 'drift,' 'black box' decision-making, whereby the decision cannot be traced, and hardware-software dependencies. These challenges have opened new areas in the field of endoscopy.[2]
Current literature emphasizes isolated issues such as AI performance, hardware quality, or general capsules quality scores, without holistic QA systems that encompass the entire process from the manufacturing phase to post-market surveillance for AI ingestible devices.[3]
This review tries to offer an overall QA environment for AI-based ingestible capsules, combining strategies for validation, risk management, and regulatory compliance. Capsule technology is explained in Section 2, while the rest of the sections, up to Section 9, cover the implementation hurdles, current regulatory environment, and future scope. [4]
Fig.1. Quality Assurance Ecosystem Framework for AI-Enabled Ingestible Capsules in Gastrointestinal Diagnostics.
OVERVIEW OF AI ENABLED INGESTIBLE CAPSULE
"The following section provides a summary on the technological and clinical foundations for AI-powered, ingestible capsules that would be the foundation for future quality-related conversations."
The AI-assisted ingestion capsules are a fusion of miniaturized imaging equipment and sophisticated software algorithms that help in non-invasive imaging and analysis of the human gastrointestinal tract. These are based upon traditional wireless capsules but now involve artificial intelligence that increases sensitivity and quickens reading times.[1,5]
Capsule Hardware Components and Functionality
Most capsules are about 11×26 mm and commonly contain an optical lens, a CMOS image sensor, white LEDs for illumination, button batteries, a dedicated ASIC, RF transmitter, and an antenna on board within a biocompatible case. This ASIC controls, among others, the image acquisition, on-capsule processing, power management, and wireless data transmission to receivers outside the human body. It has a typical battery life in an order of approximately 8 hours. Frame rates typically vary between 2 and 6 frames per second depending on capsule motion. More recent “intelligent” capsule designs integrate various sensors like pH, pressure, and gas, among others, and feature advanced functionalities enabling active locomotion by magnetism or robotics, energy harvesting by wireless transfer, and hybrid localization for enhanced navigation and diagnostic coverage.[5,6,7]
AI Algorithms applied in Gastrointestinal Diagnostics
In the case of capsule endoscopy, the state of the art in AI solutions is currently dominated by deep learning, especially convolutional neural networks for the detection of small-bowed lesions like bleeding, ulcers, erosions, vascular malformations, polyps, and tumors, with more complex designs involving object detection networks and the newest vision transformation models capable of multiple lesion detection, bowel preparation evaluation, and the prioritization of the most important frames, with sensitivity and specificity values above 90% in research studies being reported. These systems can serve as Computer-Aided Detection (CADe) software for pointing out suspicious frames or Computer-Aided Detection (CADx) software for lesion typing and severity, with current efforts focusing on the application of Explainable AI for better transparency of the predictive process for the clinician.[8,9]
Current Clinical Applications and Impact
In clinical contexts, capsule endoscopy is mainly employed in the setting of suspected small bowel bleeding and in Crohn’s disease. As a reference standard in mucosal examination in these conditions, AI-assisted capsule endoscopy has already proved beneficial in increasing the detection rates while shortening the time required for reading the data. For the other parts of the alimentary tract, the utilization of AI in colon capsule endoscopy examination, panorama endoscopic examination, and quality scoring is being assessed. This has wide implications in the screening and management of GI disorders as AI-based capsule endoscopies become mainstream.[3,11]
These are the technological and clinical attributes that determine the quality assurance needs which will be dealt with in the next sections of this review.[2]
QUALITY ASSURANCE FUNDAMENTALS
The quality assurance for medical devices includes the systematic approach towards ensuring the performance, safety, and reliability of the various devices at all stages, from design to the surveillance stages.[2]
Principles of QA in Medical Devices
Core principles include risk-based thinking, process validation, traceability, and continuous improvement, as outlined in ISO 13485 which mandates documented procedures for design controls, supplier qualification, and adverse event reporting in device manufacturing. These principles ensure traceability from capsule manufacturing to clinical outcomes, critical for post-market surveillance of AI performance drift. They adapt traditional manufacturing QA to diagnostic devices by emphasizing clinical utility metrics like diagnostic accuracy (>90% sensitivity reported for AI capsules) alongside physical durability.[2]
Unique QA Challenges for AI-Enabled Ingestible Capsules
AI introduces non-deterministic behavior, algorithmic bias (up to 15% variation across demographics), and "black box" opacity, exacerbated by physiological variables including GI transit unpredictability, battery failure risks (2-5% retention rates), and variable image quality from bowel preparation. Unlike static devices, capsules require real-time QA for wireless data integrity and locomotion reliability, with edge AI processing demanding onboard validation absent in conventional endoscopy.[1]
Relevant Frameworks and Standards
These frameworks provide the foundation for a cohesive QA ecosystem, bridging hardware reliability with AI trustworthiness essential for clinical adoption.[2]
KEY QUALITY ASSURANCE COMPONENTS IN AI CAPSULE SYSTEMS
The essential QA elements must be considered from a hardware perspective, a perspective of artificial intelligence model development, a perspective of data flow, and from a perspective of clinical practice, where they must be viewed as an entire system to ensure proper capsule function.[1]
1. Hardware Quality Controls
Hardware QA ensures the capsule is mechanically safe, reliable, and capable of capturing diagnostic?grade data throughout GI transit.
2. AI Algorithm Validation
AI validation confirms that capsule-generated interpretations are accurate, robust, and clinically trustworthy.
3. Data Management
Data QA secures the full pipeline from image capture to reporting, ensuring integrity, security, and regulatory compliance.
4. Clinical Workflow Integration
QA also concerns how the AI capsule fits into daily practice, from reading to remote follow?up.
VALIDATION AND VERIFICATION STRATEGIES
Validation and verification ensure that AI-enabled capsule systems are safe, effective, and maintain performance across their lifecycle, from preclinical experiments to real world use.[5]
Preclinical Testing Methods :- Preclinical validation typically starts with benchtop and phantom studies, where GI tract mimicking models (straight and curved paths, variable friction, and fluids) are used to test capsule locomotion, orientation stability, and image quality under controlled conditions. Animal models, especially porcine GI tracts, are employed to evaluate in vivo safety, transit behavior, retention risk, and feasibility of imaging or sensing, including specialized capsules such as ultrasound or thermometric devices.[19,20,21]
Clinical Trials Design for Efficacy and Safety :- Prospective clinical trials usually compare AI-assisted capsule reading with standard expert reading as a reference, focusing on endpoints such as lesion detection rate, per?patient sensitivity and specificity, reading time, and adverse event rates (retention, aspiration, obstruction). Multicenter designs and adequately powered sample sizes enable assessment of generalizability across different populations and centers, while subanalyses examine performance across bowel preparation quality, lesion types, and indications such as suspected small?bowel bleeding or Crohn’s disease.[3,22]
Post-Market Performance Monitoring and Real-World Evidence :- After approval, real?world evidence is gathered through registries, routine clinical data, and post?market surveillance systems that track diagnostic yield, complication rates, and device or AI malfunctions over time. Real?world monitoring also supports detection of performance drift, rare failure modes, and biases that may not appear in trials, prompting periodic re?validation or updates to labeling, user training, or technical controls.[23,24,25]
Continual Learning and Software Updates Under QA Frameworks :- Adaptive or continually learning AI models pose regulatory challenges because performance can change after deployment, leading agencies such as the FDA and MHRA to propose “predetermined change control plans” and lifecycle oversight for AI?SaMD. Within a QA ecosystem, any model updates should follow a controlled process: versioning, pre?release validation on locked datasets, cybersecure deployment, and post?update monitoring with rollback capability if predefined safety or accuracy thresholds are not met.[23,24,25]
RISK MANAGEMENT AND ETHICAL CONSIDERATIONS
Risk management in AI capsule systems must address technical failure modes and broader ethical, legal, and social implications to ensure safe, fair, and trustworthy use in GI diagnostics.[5]
Failure Mode Effects Analysis (FMEA) for AI Capsule Systems
FMEA can be applied across the capsule pathway (prescription, ingestion, recording, AI reading, reporting) to systematically list potential failures, such as capsule retention, battery depletion, wireless link loss, mislabeling of critical lesions, or incorrect report generation, and score each by severity, occurrence, and detectability to derive a risk priority number (RPN). High?RPN items (e.g., missed active bleeding due to AI misclassification or unencrypted cloud upload failure) are then targeted with controls such as redundant alarms, human over?read policies, standardized escalation pathways, and equipment maintenance or recall procedures adapted from endoscopy FMEA case studies.[26,27]
Cybersecurity Risk Assessments
AI capsule systems create an Internet of Medical Things chain—from ingestible device to body?worn recorder, hospital network, and cloud platform—that is vulnerable to eavesdropping, spoofing, ransomware, and unauthorized access if not properly secured. Cybersecurity risk assessment requires threat modeling (assets, attack vectors, attacker motives), enforcement of end?to?end encryption, strong authentication, role based access control, patch management, and incident response plans consistent with guidance for connected medical devices and cloud based capsule platforms.[18,28,29]
Addressing AI Bias, Transparency, and Trustworthiness
Capsule endoscopy AI models may perpetuate disparity when trained on unrepresentative data, resulting in variances in performance based on gender, age, or ethnic origin, which have been earmarked as areas that demand priority attention in current consensus statements in GI endoscopy. Trustworthy AI development necessitates diverse training sets, the provision of performance data for subgroups, the use of probability output scales, and the application of transparency aids such as graphical and textbased descriptions of data sets and limitations, facilitating the practitioner in deciding when to follow or override AI algorithm recommendations and preventing automation bias and algorithm dependence.[30,31]
Patient Consent, Data Ethics, and Regulatory Compliance
The ethical use of AI capsules requires specific, informed consent regarding not only the capsule process but also AI engagement, cloud storage, and secondary use of de-identified data to enhance models that patients consider complex, as found in recent studies of consent. AI use of data also requires consistency with GDPR, HIPAA, and developing regulations regarding AI, which relate to data minimization, limiting purposes of processing, conducting Data Protection Impact Assessments (DPIA), and recognizing patient rights to access, correct, and erase data, as found to present substantial legal and reputational risks to trust in AI-assisted diagnoses.[32,33]
REGULATORY LANDSCAPE AND STANDARDIZATION EFFORTS
AI-enabled capsule endoscopy is at the crossroads between the regulation of ingestible medical devices and Software as a Medical Device, and most countries are still using existing device and software regulations, rather than developing specific regulations concerning capsules. The performed narrative reviews of SaMD in digestive healthcare underline that AI analysis software for standalone capsule images is considered to be a medical device in its own right, requiring evidence of clinical validity, robust data protection, and integration into existing endoscopy workflows. Recent reviews on capsule endoscopy and ingestive devices note that even though electromagnetic safety, radiofrequency transmission, and interoperability (e.g., with hospital PACS/EHR systems) are covered by traditional device and telecom standards, no global unified standard exists that specifically addresses AI-driven, wireless ingestible imaging platforms.[34,35,36]
Harmonization of quality assurance standards for AI?enabled capsules is therefore difficult, because regulators and professional societies use different performance metrics, validation thresholds, and expectations for external testing. Some reviews highlight that, unlike drugs, AI models for capsule endoscopy do not undergo a fully standardized training and validation framework; instead, manufacturers rely on heterogeneous datasets, endpoints, and study designs, which complicates comparisons and international homologation of devices. Authors argue that without common benchmarks and agreed definitions of clinically acceptable sensitivity, specificity, and miss rates for capsule AI, it is challenging to define universal QA indicators or to transfer approvals between regions.[1,37]
Emerging guidance is starting to bridge this gap: papers on SaMD in digestive healthcare and on AI in capsule endoscopy propose clear AI governance, with model documentation that is transparent, continuous post?market evaluation, and standardized reporting of algorithm updates within a regulated change?control framework. Current analyses of the regulatory environment for ingestible devices also point to adaptive processes and integration efforts from the perspective of regulatory bodies, industry, and practicing clinicians with a view to afford learning AI systems, performance data from the real world, and a new crop of versatile ingestible sensors new regulatory frameworks in the future while continuing safety and quality at the same high level. [8, 35, 38]
FUTURE DIRECTIONS AND RESEARCH NEEDS
Next-generation QA environments should be developed in concert with next-generation ingestive technologies, as well as guidelines that actively strive to ensure that efficacy and safety are maintained.[1,5]
Emerging Technologies
Edge AI enables onboard processing within capsules, reducing data transmission latency to milliseconds and enabling real-time lesion detection without cloud dependency, though this demands miniaturized computing with robust thermal and power management. Multi-sensor capsules integrating optical imaging with pH, pressure, temperature, and gas sensors promise comprehensive GI phenotyping, but require unified data fusion validation protocols to ensure sensor-AI interoperability. Autonomous therapeutic capsules with biopsy needles, drug delivery micropumps, or hemostasis mechanisms represent the horizon, necessitating QA frameworks that verify not only diagnostic accuracy but also interventional precision and sterility under uncontrolled GI conditions.[5,14,17,34]
Quality Assurance Challenges Posed by Evolving Capabilities
Edge and multi-modal AI introduce verification complexity, as models must perform reliably across heterogeneous sensor inputs, variable computational constraints, and physiological extremes without human oversight. Therapeutic autonomy amplifies risk profiles, demanding real-time safety interlocks, fail-safe expulsion mechanisms, and traceability for adverse events that span diagnosis-to-treatment sequences. Continual learning exacerbates regulatory uncertainty, as post-deployment adaptation challenges the "locked algorithm" paradigm central to current SaMD approvals.[5,23,38,39]
Recommendations for Standardized QA Frameworks and Collaborative Research
Create international standards benchmarks (e.g., multi-sensor GI atlas maps with expert consensus annotations), performance levels graded according to clinical risk to support replicable validation across all devices. Develop intelligent QA systems based on Predetermined Change Control Plans (PCCPs), with autonomous drift detection and rollback strategies. Mobilize global consortia involving all stakeholders—regulators (IMDRF), industry, healthcare professionals, and patients—to collaboratively develop standards for ingestible AI solutions, with high priority to real-world evidence registries stratified according to device “generation” and patient characteristics. Finally, high priority should be assigned to advancing research on explainable fusion approaches and paradigms involving collaboration between humans and AI agents to retain clinician control with autonomous capsule systems.[31,40]
CONCLUSION
This review demonstrates that while AI-enabled ingestible capsules achieve impressive diagnostic performance (90-99% sensitivity/specificity), their safe clinical translation demands a comprehensive quality assurance ecosystem spanning hardware reliability, algorithm validation, data integrity, clinical workflows, risk management, and adaptive regulatory compliance.[1]
An integrated QA framework is essential—not merely additive quality checks, but a interconnected system ensuring traceability from manufacturing through real-world deployment, addressing AI-specific challenges like non-determinism, bias propagation, and continual learning under physiological variability. Fragmented approaches risk eroding clinician trust, amplifying safety gaps, and stalling innovation in multi-sensor therapeutic capsules.[35,38]
It is a grave necessity to have a multidisciplinary collaborative effort between regulators (IMDRF/FDA), industries, gastroenterologists, ethicists, and patients to set a standard and benchmark dataset and post-market registry. Further research should focus on having a comprehensive understanding of artificial intelligence, human-artificial intelligence synergy, and adaptive QA models to harness the full potential of autonomous GI diagnostic systems to ensure successful outcomes for patients.
REFERENCES
Tejas Niwane, Shailesh Jawarkar, Ashish Dahekar, Monika Jadhao, Quality Assurance Ecosystems for AI-Driven Ingestible Capsules: A Systematic Review in Gastrointestinal Diagnostics, Int. J. of Pharm. Sci., 2026, Vol 4, Issue 2, 4558-4569. https://doi.org/10.5281/zenodo.18811106
10.5281/zenodo.18811106