1Department of Pharmaceutics, Vidya Siri College of Pharmacy, Bangalore, Karnataka 560035, India.
2B Pharm students of Vidya Siri College of Pharmacy, Bangalore, Karnataka 560035, India.
Data integrity plays a critical role in the pharmaceutical industry, as it directly influences product quality, ensures patient safety, and maintains compliance with regulatory standards. It is the corner stone of the industry, ensuring the reliability of data used in drug development, manufacturing, testing, distribution and regulatory submissions. With increasing digitalization, maintaining data integrity has become more challenging, requiring robust systems and practices. Regulatory bodies such as the United States Food and Drug Administration [USFDA], European Medicines Agency [EMA] and World Health Organization [WHO], mandate strict adherence to data integrity principles, such as ALCOA and ALCOA+. These principles guide the entire data lifecycle, emphasizing the need for robust systems to maintain data accuracy, traceability and security. Compliance with these expectations is critical, as breaches can lead to severe consequences, including warning letters, product recalls, and legal action against the organization. The pharmaceutical industry is also adapting to emerging technologies like block chain and artificial intelligence, which offer potential enhancements in data integrity management. This review outlines the benefits, challenges and key considerations of data integrity for paper based and computerized records, strategies for strengthening data integrity and importance of quality culture in upholding data integrity. By understanding the complexities of data management, pharmaceutical companies can ensure regulatory compliance and enhance trust in their products.
Data integrity is defined as “the extent to which the data are complete, consistent, accurate, trustworthy, and reliable and how these aspects of data are maintained throughout the lifecycle of the product” [1]. This is an important element and requirement of pharmaceutical quality system highlighting pharmaceutical industry’s responsibility to ensure the safety, efficacy, and quality of the drugs, and FDA’s role in safeguarding public health [1]. Inadequate data integrity practices and vulnerabilities compromise the quality of records and evidence, potentially jeopardizing the quality of medicinal products [2]. The principles of data management are applicable to paper-based and computerized records in the same way [3]. Integrity of data is essential in research and development, manufacturing, quality control and regulatory submissions. However, maintaining data integrity is fraught with challenges that can arise from various sources. Some of the key challenges are increasing digitalization, regulatory compliance, human error and legacy systems. ALCOA and ALCOA+ are acronyms introduced by USFDA to indicate data integrity in relation to pharmaceutical research, manufacturing, testing and supply chain as a best practice for ensuring data integrity and to overcome the challenges and data integrity breaches [4]. The term ALCOA was introduced to define its expectations of electronic data. Regulators adopted the term ALCOA Plus, emphasizing the consequences of not adhering to regulations and suggesting prevention methods such as simple checklists, self-audits, and self-inspections [2] ALCOA refers to the principles of data being Attributable, Legible, Contemporaneous, Original, and Accurate. ALCOA was further expanded to ALCOA Plus, Plus means Enduring, Available and Accessible, Complete, Consistent, Credible, and Corroborated [5].
KEY TERMINOLOGIES IN DATA INTEGRITY: DEFINITIONS3
Figure 1: Key Terminologies
It includes facts, figures, and statistics, which are gathered for reference or analysis. This encompasses all original records and their true copies, such as source data and metadata, along with any subsequent transformations and reports. These records are generated or documented during the GXP activity, enabling a thorough reconstruction and evaluation of the activity.
It refers to the initial record of information, captured for the first time, whether on paper or electronically. If the information is originally recorded in a dynamic state, it should remain accessible in that form. Raw data must allow for a complete reconstruction of activities.
It refers to the data that describe the attributes of other data, providing context and meaning. They typically detail the structure, data elements, inter-relationships, and other characteristics, such as audit trails. They form an integral part of the original record, and without the context provided by metadata, the data lacks meaningful interpretation.
It refers to the arrangements made to ensure that data, regardless of its format, is recorded, processed, retained, and utilized throughout its lifecycle. Data governance must establish clear data ownership and accountability, focusing on the design, operation, and monitoring of processes and systems to uphold data integrity principles, thereby ensuring that the data is readily traceable and directly accessible upon request from national competent authorities. Electronic data should be provided in a human-readable format.
It refers to all the phases in the life of a data, from generation and recording to processing [including analysis, transformation, or migration], use, retention, archiving/retrieval, and destruction. All these phases must be managed effectively. Data governance should be applied throughout the entire lifecycle to ensure data integrity.
Organizations should possess a thorough understanding of processes and technical knowledge of systems used for data collection and recording, including their capabilities, limitations, and vulnerabilities. The chosen method should ensure that data collected and retained is accurate, complete, and meaningful for its intended use.
Data transfer involves moving data between different storage types, formats, or computerized systems. Data migration, on the other hand, refers to relocating stored data from one durable storage location to another, which may include changing the data’s format but not its content or meaning.
It refers to a series of operations carried out on data to extract, present, or obtain information in a specified format.
Original copy refers to the initial capture of data or information, such as an original paper record of a manual observation or an electronic raw data file from a computerized system, along with all subsequent data needed to fully reconstruct the GXP activity. True copy, on the other hand, refers to a copy of the original record, regardless of the media type, that has been verified [either by a dated signature or through a validated process] to contain the same information, including data describing the context, content, and structure, as the original.
It refers to a type of metadata that records actions related to the creation, modification, or deletion of GXP records. It securely logs life-cycle details such as creation, additions, deletions, or alterations of information in a record, whether on paper or electronically, without obscuring or overwriting the original record. An audit trail enables the reconstruction of event histories by detailing ‘who, what, when, and why’ of each action was performed.
It refers to the method for reviewing specific record content, including critical data, metadata, cross-outs [in paper records], and audit trails [in electronic records], should comply with all applicable regulatory requirements and be based on risk. There should be a procedure outlining the process for data review and approval.
It refers to archiving [protected data for long-term storage] or backup [data for disaster recovery purposes]. Arrangements for data and document retention should ensure records are protected from deliberate or accidental alteration or loss.
It refers to a designated secure area or facility for the long-term retention of data and metadata to verify processes or activities. Archived records may be the original record or a ‘true copy’ and should be safeguarded against alteration or deletion without detection, as well as protected from accidental damage such as fire or pests.
BENEFITS OF DATA INTEGRITY
KEY CHALLENGES IN ENSURING DATA INTEGRITY
The pharmaceutical industry is rapidly adopting digital systems for data management. While this enhances efficiency, it also introduces vulnerabilities, such as cyber threats, unauthorized access, and software malfunctions, which can compromise data integrity [7].
Pharmaceutical companies must comply with various regulations and guidelines, including those from the U.S. Food and Drug Administration [USFDA], European Medicines Agency [EMA], Medicines and Healthcare Products Regulatory Agency [MHRA] and other regulatory agencies. These regulations often have complex and evolving requirements, making compliance challenging [7].
Maintaining data integrity is still greatly challenged by human error. Inaccuracies in data entry, documentation, and system operations can compromise the dependability of pharmaceutical products [7].
Many pharmaceutical companies still rely on outdated or legacy systems that lack robust data integrity controls. These systems can be prone to data loss, corruption, and unauthorized access, making them a significant risk to data integrity [8].
DI CONSIDERATIONS FOR PAPER BASED DOCUMENTS/RECORDS [2]
Effective management of paper-based documents is crucial for GMP/GDP compliance. Therefore, the documentation system should be structured in such a way that they fulfill these requirements, ensuring that the documents and records are effectively controlled and managed to maintain their integrity. Paper records should be managed to remain attributable, legible, contemporaneous, original, accurate, complete, consistent, enduring [indelible/durable], and available [ALCOA+] throughout the data lifecycle.
Figure 2: Considerations for DI of Paper Based Records
Expectations on Generation, Distribution and Control of Records [2]
Expectations on Distribution and Control of Records [2]
Expectations on Completion of Records [2]
Expectations on Making Corrections of Records [2]
Expectations on Verification of Records [2]
Expectations on How Records should be Verified [2]
Expectations on Where and How Records should be Archived [2]
Disposal of Original Records or True Copies [2]
DI CONSIDERATIONS FOR COMPUTERIZED SYSTEMS [2]
Companies use various computerized systems, from simple standalone to complex integrated systems, impacting product quality. Regulated entities must evaluate and control these systems as per GMP and GDP requirements. Organizations should assess and document each system’s use, function, and data integrity risks, focusing on criticality and product quality. Systems affecting product quality must be managed under a Pharmaceutical Quality System to prevent data manipulation. System design, evaluation, and selection should consider data management and integrity, ensuring vendors understand GMP/GDP and data integrity requirements. Legacy systems should meet these standards with additional controls if needed. A risk-based approach should be used to manage data risk and criticality, including metadata. Complete capture and retention of raw data and critical metadata are essential to reconstruct manufacturing events or analyses. Data vulnerability and risk should be assessed based on the computerized system’s role in the business process, evaluating inherent data integrity controls, especially those vulnerable to exploits. During inspections, company expertise should be utilized for system access and navigation. These principles also apply to outsourced computerized systems, ensuring compliance with GMP/GDP requirements and effective data management and integrity controls.
Figure 3: Considerations for DI for Computerized Records
Expectations on Validation and Maintenance of Computerized Systems [2]
Regulated companies must ensure data integrity from the start of system procurement through the entire lifecycle. Functional Specifications [FS] and User Requirement Specifications [URS] should address data integrity. Critical GMP/GDP equipment must be evaluated for data integrity controls before purchase.
Expectations on Data Transfer and Migration [2]
Expectations on System Security for Computerized Systems [2]
Expectations on Data Capture or Entry in Computerized Systems [2]
Expectations on Data capture and Entry [2]
Expectations on Review of Data within Computerized Systems [2]
Storage, Archival, and Disposal of Electronic Data [2]
BEST PRACTICES/ STRATEGIES FOR ENSURING DATA INTEGRITY
Figure 5a: ALCOA
Figure 5b: ALCOA Plus
DATA INTEGRITY VIOLATIONS
Data integrity violations in cGMP have led to regulatory actions like warning letters, import alerts, and seizures. Numerous serious DI issues have been uncovered, posing long-term risks to companies and affecting their culture. Managing DI is particularly challenging in the pharmaceutical industry due to rapidly increasing data volumes. Poor data quality can damage an organization’s reputation. Implementing data controls without understanding regulatory and business processes can lead to questionable data validity and regulatory action [9]. DI is crucial for ensuring data accuracy and consistency throughout its lifecycle. Without proper measures, there’s a high risk of corrupted results. DI errors often stem from human error, inadequate procedures, data transfers, software defects, and physical damage. Maintaining DI is essential for accountability, ensuring the safety, effectiveness, and quality of drug products, and is a critical aspect of regulatory compliance [10].
ABBREVIATIONS
CONCLUSION
Ensuring data integrity in the pharmaceutical sector is paramount to safeguarding public health and maintaining the credibility of the industry. The benefits of upholding data integrity include enhanced product quality, compliance with regulatory standards, and fostering trust among the stakeholders. However, the sector faces key challenges such as human error, inadequate training, outdated systems, and intentional data manipulation. Addressing these key challenges requires a strategic approach, involving robust data management systems, continuous employee training, and the adoption of modern technologies like blockchain and AI for real-time monitoring and verification. Furthermore, compliance with regulatory expectations from regulatory bodies such as FDA, EMA and others, is critical for preventing potential violations and ensuring patient safety. In conclusion, maintaining data integrity is not only a regulatory requirement but a fundamental responsibility for pharmaceutical companies. A proactive, technology driven strategy aligned with regulatory standards will enhance transparency, ensure compliance, and ultimately contribute to the delivery of safe and effective pharmaceutical products.
REFERENCES
Akshaya U. Bhandarkar, Balaji J., Jasvanth R., Dinesh Ragavendra S. , Ensuring Data Integrity In The Pharmaceutical Industry: Benefits, Challenges, Key Considerations And Best Practices, Int. J. of Pharm. Sci., 2024, Vol 2, Issue 10, 1198-1210. https://doi.org/10.5281/zenodo.13971925