Num | Variable Population | Implementation |
---|
1 | Text Strings Greater Than 200 Characters | When text strings greater than 200 characters are collected, the following conventions for general observation class variables and SUPP-- datasets will be adhered to: - The first 200 characters of text should be stored in the parent domain variable and each additional 200 characters of text should be stored in a record in the SUPP-- dataset.
- When splitting a text string into several SUPP-- records, the text should be split between words to improve readability.
- The value of the first QNAM representing text over 200 characters will be the original domain variable name without any numeric suffix.
- The values for subsequent QNAMs will be sequential variable names, formed by appending a 1-digit integer, beginning with 1, to the original domain variable name. In cases where the standard domain variable name is already 8 characters in length, applicants will replace the last character with a digit when creating values for QNAM.
- e.g., For Other Action Taken in Adverse Events (AEACNOTH), values for QNAM for the SUPPAE records would have the values AEACNOT1, AEACNOT2, and so on.
- The value for QLABEL should be the original domain variable label for all QNAM values.
| 2 | "Yes", "No", Values | - For variables where the response is "Yes" or "No", both "Y" and "N" will be Variables where the response is "Yes" or "No" ("Y" or "N") should normally be populated for both "Y" and "N" responses.
- This eliminates confusion regarding whether a blank response indicates "N" or is a missing value.
- However, some variables are collected or derived in a manner that allows only 1 response , such as when (e.g., a single checkbox indicates for "Yes"). In situations such as these, where it is unambiguous to populate only the only the response of interest, it is permissible to populate only 1 value ("Y" or "N") and leave the alternate value blank. An example of when it would be acceptable to use only a value of "Y" would be for Last Observation Before Exposure Flag (--LOBXFL) variables, where "N" is not necessary to indicate that a value is not the last observation before exposure.
| 3 | --FOCIDOften, | - Variable --FOCID is populated when a specific part of a subject or specimen is identified as a study-specific point of interest (e.g., injection site, biopsy site, treated site, region of the body)
and is commonly referenced in data collections and tabulationsThe FOCID variable enables representation of this concept, and is available within all general observation classes. - When used, the variable serves as a cross-domain identifier for the study-specific focus of interest; any records relating to the same focus would have the same FOCID value
. When populated, it is usually part of the natural key for the domain, providing a level of granularity that would otherwise not be available through the other variables. This is especially true when the foci are all within the same area on the subject.The following examples demonstrate records from various domains that tie to study-specific foci (i.e., injection site 1 and injection site 2) as identified in the protocol. In each case, the FOCID variable is populated identically across domains for the same focus. Note that, in this example, many variables have been omitted due to space considerations. At each of the 2 injection sites, subject 12345001 received a different treatment (Exposure domain) | 4 | --SEQ, --RECID | Variables --SEQ and --RECID are populated to explicitly identify domain records in different ways. Differences in variable population are described below. --SEQ | --RECID |
---|
Values uniquelyidentify records for subjects within a domain. | Values uniquely identify records within a domain. | The relationship between records and values is not one-to-one. - Values may change between versions of datasets.
- When a record is deleted, the value for the record may be reused to identify another record.
| There is a one-to-one relationship between records and values. - Values for records do not change between versions of datasets even when content is modified.
- When a record is deleted, the value for the record will not be reused to identify another record.
| Variable is numeric with numeric values. | Variable is character with numeric, character, or alphanumeric values. | Conventions for establishing and maintaining values are applicant-defined. Values may or may not be sequential depending on data processes and sources. |
| 5 | --GRPID | - The value of --GRPID is generally assigned during or after data collection at the discretion of the applicant.
| 6 | --REFID | - Values for --REFID are sponsor-defined and can be any alphanumeric strings the sponsor chooses, consistent with their internal practices.
| 7 | --CAT, --SCAT | - Values for --CAT and/or--SCAT are known (identified) about the data before it is collected.
- Variable --SCAT will be populated only when there is a value in variable --CAT.
| 8 | --STAT | - In general observation class domains, --STAT will be populated with "NOT DONE" when data are not collected for the topic of the observation.
|
|