Conformant collection and representation of tobacco product data is ensured by full adherence to standards per this guide. Conformance to standards is assessed by confirming implementation of standards per this guide and by evaluating tabulation and analysis data in relation to conformance rules.
The following table illustrates minimum conformance to standards for data collection using CRFs.
Metadataspec |
---|
Num | Conformance | Implementation |
---|
1 | Following best practices for CRF design | The design of the CRF follows both recommendations for creating data collection instruments and recommendations for CRF design. | 2 | Following data collection variable naming conventions | Data collection variable naming conventions are applied in the operational database as specified. | 3 | Following standard wording for question text or prompts | The wording of CRF questions is standardized per specified question text or prompt for the data collection fields. | 4 | Following core designations. | All HR (Highly Recommended) and applicable R/C (Recommended/Conditional) data collection fields are |
|
Determining whether a product faithfully implements a standard or specification is essential to creating robust, interoperable solutions.
Data Collection
The following guidelines should be followed to ensure implementation of CDASH compliant CRFs
Metadataspec |
---|
Guideline | Description |
---|
1 | Core designations must be followed. | All Highly Recommended and applicable Recommended/Conditional fields must be present in the CRF and/or operational database. | Match language in metadata description25 | Following guidance for CDISC Controlled Terminology | must be used. The CDISC Terminology that is included in the CDASHIG metadata tables must be used terminology is used as specified to collect the data | in All codelists displayed in the CRF must use or directly map to the current published CDISC CT submission values, when it is available. Subsets of published CT, such as those provided in CDASH terminology, can be used.- In Findings domains, values from the relevant CDISC CT lists must also be used to create appropriate Question Text, Prompts, and/or variable names. For example, if the question is about the subject's height, incorporate the value of "Height" from the VSTEST codelist as the prompt on the CRF, and incorporate "HEIGHT" from VSTESTCD in the variable name.
3 | Best practices must be followed. | - The design of the CRF must follow guidance in Section 4.1, Best Practices for Creating Data Collection Instruments, and Section 4.2, CRF Design Best Practices.
| 4 | The wording of CRF questions should be standardized; CDASH Question Text or Prompt must be used to ask the question | - In cases where the data collection is done in a denormalized presentation on the CRF, the relevant CDISC CT should be used in the question text or prompt as much as possible. It is acceptable to use synonym text that will directly map to a CDISC submission value (including an NCI Preferred Term), if the CDISC submission value is not appropriate for data collection. For example, "ALT" may be better than "Alanine Aminotransferase" as the prompt for this lab test. If there is no CDISC CT available, the question text or prompt must be standardized by the implementing organization and used consistently. One of the basic purposes of CDASH is to reduce unnecessary variability between CRFs and to encourage the consistent use of variables to support semantic interoperability; therefore, Question Text and Prompt must be used verbatim.
- Similarly, where SDTMIG variables exist in the operational database and the value conforms to controlled terminology, it is permissible to use a familiar synonym on the CRF without affecting conformance. For example, on the Demographics page, SEX may be displayed as "Male" or "Female", whereas in the operational database the controlled terminology values of "M" and "F" would be used.
- In some cases, CDASH Question Text and Prompt allow for flexibility while still being considered conformant. See Section 2.3, CRF Development Overview, for further details on the usage of Question Text and Prompt.
- CDASH Model Question Text may contain options for the tense; if the option is not provided, the tense of the Question Text may be modified to reflect the needs of the study.
- In cases where the CDASH Question Text or Prompt cannot be used due to culture or language, or a CRF must be translated for language or cultural reasons, the implementer must ensure the translation is semantically consistent with the CDASH Question Text and Prompt in the CDASHIG metadata table.
- In cases where a more specific question needs to be asked than that provided by Question Text or Prompt, CDASH recommends the use of a brief CRF Completion Instruction, as long as the instruction clarifies the data required by the study without altering the meaning of variable as defined by the standard. For example, "Sex at birth" is not the same question as "Sex" (which is loosely defined as "reported sex").
| 5 | CDASHIG variable naming conventions should be used in the operational database. | Use a consistent syntax that includes the root variable name and/or controlled terminology, and any other standardized concepts that are needed to support efficient mapping of the collected value to SDTM datasets. The goals are to have beginning-to-end traceability of the variable name from the data capture system to the SDTM datasets, and to support automating electronic data capture (EDC) set-up and downstream processes. It is recognized that (particularly in an EDC system) the variable name of a data collection field, as well as the name in the underlying database, may have various “system” components that become part of the item’s identifier. EDC systems, prior to exporting data in a defined format, may require the variable name to include such database “references” as the EDC page name, the item “group” name, or perhaps a combination. In cases where the data collection is done in a denormalized way, appropriate CDISC CT must be used when it is available. For example, when collecting vital signs results in a denormalized eCRF, the variable names can be created by using terms from the Vital Signs Test Code codelist. For example, temperature result can be collected in a variable called TEMP or TEMP_VSORRES; systolic blood pressure result can be collected in a variable called SYSBP or SYSBP_VSORRES. When a particular system’s constraints limit the variable name to 8 characters, a similar, consistent implementation that preserves either the normalized root variable (e.g., ORRES) or the controlled terminology (e.g., --TESTCD value) should be implemented. Other variable patterns that intentionally connect the data collection variable to the target SDTMIG variable are also acceptable. For example, targetDataset_targetVariable[_optionalTopic] is acceptable. Examples of this pattern include DM_AGE, DM_AGEU, VS_VSORRES_TEMP, VS_VSORRESU_TEMP, SUPPAE_QVAL_AEDIS.
Whereas all CDASHIG defined variable names are 8 characters or fewer to accommodate SDTM limits on variable names, QNAMs, and --TESTCDs, the maximum length of a variable name that may be implemented is determined by the data management system used, not by CDASH. When collecting data in a horizontal manner, to facilitate transformation to SDTM datasets, when possible it is recommended to create denormalized CDASH variables in the data collection system by incorporating the SDTMIG variable name target and/or the controlled terminology (e.g., --TESTCD) as part of the CDASH variable names. The domain-level metadata labeled as "Horizontal-Generic" in the Implementation Options column of the CDASHIG metadata tables are examples of how to implement this. There is no conformance requirement implied by these examples.
| 6 | - Data output by the operational database into an SDTMIG variable should require no additional processing if the CDASHIG and SDTMIG variable names are the same.
| - An SDTM data programmer should be able to assume that data in an SDTMIG variable is SDTMIG-compliant. Minimal processing (e.g., changing case) does not affect conformance. This helps to ensure a quality deliverable, even if the programmer is unfamiliar with data capture practices.
| 7 | Validated questionnaires, ratings, or scales must present the questions and reply choices in the manner in which these were validated. | - his must be followed to maintain the validity of a validated instrument. (See Section 8.3.12, QRS - Questionnaires, Ratings, and Scales).
- In some cases, this may result in CRFs that do not conform to CDASH best practices; however, restructuring these questionnaires should not be done because it could invalidate them.
- The use of such questionnaires in their native format should not be considered to affect conformance to CDASH.
| |
Implementers must determine what additional data fields to add to address study-specific and therapeutic area requirements, and applicable regulatory and business practices. See Section 3.4, How to Create New Data Collection Fields When No CDASHIG Field Has Been Defined, for more information on how to create data collection fields that have not already been described in this implementation guide.
Data Tabulation
From SDTMIG
Conformance with the SDTMIG domain models is minimally indicated by:
- Following the complete metadata structure for data domains
- Following SDTMIG domain models wherever applicable
- Using SDTM-specified standard domain names and prefixes where applicable
- Using SDTM-specified standard variable names
- Using SDTM-specified data types for all variables
- Following SDTM-specified controlled terminology and format guidelines for variables, when provided
- Including all collected and relevant derived data in one of the standard domains, special-purpose datasets, or general observation class structures
- Including all Required and Expected variables as columns in standard domains, and ensuring that all Required variables are populated
- Ensuring that each record in a dataset includes the appropriate Identifier and Timing variables, as well as a Topic variable
- Conforming to all business rules described in the CDISC Notes column and general and domain-specific assumptions
Conformance with the SENDIG domain models is minimally indicated by:
- Following the complete metadata structure for data domains
- Following SENDIG domain models wherever applicable
- Using SENDIG-specified standard domain names and prefixes per controlled terminology
- Using SENDIG-specified standard variable names
- Using SENDIG-defined variable labels for all standard domains
- Using SDTM-specified data types for all variables
- Following SDTM/SEND-specified controlled terminology and format guidelines for variables when provided
- Including all collected and relevant derived data in one of the standard domains, special-purpose datasets, or general-observation class structures
- Including all required and expected variables as columns in standard domains, and ensuring that all required variables are populated
- Ensuring that each record in a dataset includes the appropriate identifier and timing variables as well as a topic variable
- Conforming to all business rules described in the CDISC Notes column and general and domain-specific assumptions
- Ensuring that the datasets are in SAS v5 transport file format or other transport file format required by a regulatory agency
| 6 | Presenting validated QRS questions and reply choices as validated in the CRF. In some cases, this may result in CRFs that do not conform to CDASH best practices. The use of such questionnaires in their native format does not affect conformance. | All QRS questions and reply choices are presented as validated in the CRF. | 7 | Aligning data collection variables values and target tabulation variables values when collection and tabulation variable names are the same. Minimal processing, such as changing case when mapping a data collection variable value into a tabulation variable, does not affect conformance. | Data output by the operational database into a tabulation dataset variable requires minimal processing when the data collection and tabulation variable names are the same. |
|
The following table illustrates minimum conformance to standards for tabulation datasets.
Metadataspec |
---|
Num | Conformance | Implementation |
---|
1 | Representing all collected, assigned, and relevant derived data in applicable datasets | All data generated per scientific and regulatory requirements are included in tabulation datasets. | 2 | Using domain specifications in this guide wherever applicable | A dataset is created using a domain specification in this guide when the scientific nature or role of the data is within the scope of a domain. Domains are extended or custom domain specifications are only used when data are different in nature and are not in scope for domains in this guide. | 3 | Following conventions for dataset naming | The dataset name is standardized per naming conventions and per controlled terminology where applicable. | 4 | Following guidance for dataset record structure | Dataset content is aligned with the record structure specified per the domain specification. | 5 | Following core designations | All Required and Expected tabulation Jira |
---|
showSummary | false |
---|
server | Issue Tracker (JIRA) |
---|
serverId | 85506ce4-3cb3-3d91-85ee-f633aaaf4a45 |
---|
key | TOBA-791 |
---|
| variables are included as columns in the dataset. Required tabulation variables are populated for all records in the dataset. Permissible variables used to collect data are included in the dataset, even when no data for those variables were collected. | 6 | Following conventions for variable naming | The names of variables in the dataset are standardized per domain specifications and other applicable guidance. Controlled terminology for domain prefixes is used as specified for variable naming. | 7 | Following guidance for variable types | The variables in the dataset are standardized for either numeric or character values as specified per the domain specification. | 8 | Populating variable values in alignment with this guide | All variables in the dataset are populated as expected per this guide including per general and domain-specific guidance, controlled terminology, and formatting. |
|
The following table illustrates minimum conformance to standards for analysis datasets.
Metadataspec |
---|
Num | Conformance | Implementation |
---|
1 | ADaM fundamental principles are followed | - Datasets and associated metadata clearly and unambiguously communicate the content and source of the datasets supporting the statistical analyses performed in a clinical study.
- Datasets and associated metadata provide traceability to show the source or derivation of a value or a variable (i.e., the data's lineage or relationship between a value and its predecessor(s)). The metadata identify when and how analysis data have been derived or imputed.
- Datasets are readily usable with commonly available software tools.
- Datasets are associated with metadata to facilitate clear and unambiguous communication. Ideally the metadata are machine-readable.
- Datasets have a structure and content that allow statistical analyses to be performed with minimal programming. Such datasets are described as "analysis-ready." Datasets contain the data needed for the review and re-creation of specific statistical analyses. It is not necessary to collate data into analysis-ready datasets solely to support data listings or other nonanalytical displays.
| 2 | ADaM datasets follow the normative data found in the TIG | Datasets follow the fundamental principles defined in ADaM and adhere as closely as possible to TIG variable naming and other conventions. | 3 | Traceability principles are followed | In ADaM, it is assumed that the original data sources for ADaM datasets are SDTM datasets, even when ADaM datasets are derived from other ADaM datasets. ADaM has features that enable traceability from analysis results to ADaM datasets and from ADaM datasets to SDTM datasets. These conventions must be followed for ADaM datasets with a CLASS value of BASIC DATA STRUCTURE, OCCURRENCE DATA STRUCTURE, and SUBJECT LEVEL ANALYSIS DATASET. Other analysis datasets should follow this convention where practical and feasible. |
|
Tabulation and analysis dataset conformance can be formally evaluated in relation to defined sets of conformance rules. The CDISC TIG Conformance Rules Version 1.0 (available at <placeholder, link pending>)includes rules for both tabulation and analysis datasets. Conformance rules for tabulation datasets assess the conformance of dataset structures and contents to the TIG tabulation standards. Conformance rules for analysis datasets assess the conformance of dataset construction to the TIG analysis standards.
Data Analysis