- Created by Nate Freimark, last modified by Christine Connolly on Mar 27, 2023
You are viewing an old version of this page. View the current version.
Compare with Current View Page History
« Previous Version 8 Current »
Num | Convention |
---|---|
1 | To ensure compliance with SAS Version 5 transport file format and Oracle constraints, all ADaM variable names must be no more than 8 characters in length, start with a letter (not underscore), and be composed only of letters (A-Z), underscore ( _ ), and numerals (0-9). All ADaM variable labels must be no more than 40 characters in length. All ADaM character variables must be no more than 200 characters in length. |
2 | The lower-case letters "w", "xx", "y", and "zz" that appear in a variable name or label in this document must be replaced in the actual variable name or label using the following conventions:
|
3 | Any variable in an ADaM dataset whose name is the same as an SDTM variable must be a copy of the SDTM variable, and its label, meaning, and values must not be modified. ADaM adheres to a principle of harmonization known as "same name, same meaning, same values." However, to optimize file size, it is permissible that the length of the variables differ (e.g., trailing blanks may be removed). In many cases it makes sense to copy over a variable from an SDTM dataset. For example, the SDTM variable --SEQ may be useful for traceability. However, in other cases, it is also perfectly acceptable, and might be much better, to create an ADaM variable with a meaningful variable name and clear and unambiguous metadata. An SDTM variable may be somewhat meaningless when removed from its SDTM context. For example, the meaning of the SDTM variable DSDECOD may depend on other SDTM variables such as DSCAT and DSSCAT, and ultimately on how the data were collected and mapped to SDTM in a particular study; thus it may be better to create a clearly defined ADaM variable. In any case, whenever values are modified in any way, it is mandatory to do so in an ADaM variable, and it is prohibited to do so in a variable whose name is that of an SDTM variable. |
4 | When an ADaM standard variable name has been defined for a specific concept, the ADaM standard variable name must be used, even if the content of an ADaM variable is a direct copy of the content of an SDTM variable. For example, in the creation of an ADaM dataset based on an SDTM LB dataset, even if AVAL is just a copy of LBSTRESN, the dataset must contain AVAL. |
5 | For variable pairs designated as having a one-to-one relationship within a specified scope (e.g., within a parameter, within a study), if both variables are present in the dataset and there exists a row in that scope on which both variables are populated, then there must be a one-to-one relationship between the two variables on all rows within the scope on which both variables are populated. The scope noted in this document should be considered the minimum level for the mapping; it does not preclude the producer from using a broader level of scope. For example, if a one-to-one relationship is specified as within a PARAM, the producer may elect to use the same one-to-one relationship across all PARAMs within the dataset or study. In addition, note that "within a parameter" means "within a parameter within a dataset." |
6 | In a pair of corresponding variables (e.g., TRTP and TRTPN), the primary or most commonly used variable does not have the suffix or extension (i.e., N for numeric or C for character). The relevant suffix is used only on the name of the secondary member of the variable pair. For example, in the (TRTP, TRTPN) pair, the primary variable, TRTP, is character, but it is not named TRTPC. Similarly in the (APERIOD, APERIODC) pair, the primary variable, APERIOD, is numeric, but it is not named APERIODN. When a secondary variable is included in the dataset, then the primary variable must also be included. . If both variables of a variable pair are present, there must be a one-to-one relationship between the values of the two variables, as described in Item 5 above. |
7 | In general, if an SDTM character variable is converted to a numeric variable in an ADaM dataset, then it should be named as it is in the SDTM dataset with an "N" suffix added. For example, the numeric version of the DM variable SEX is SEXN in an ADaM dataset, and a numeric version of RACE is RACEN. As stated in item 6, the secondary variable of the variable pair cannot be present in the dataset unless the primary variable is also present. Applying Item 6 to the variable pairs being described in Item 7, the numeric equivalent of the variable cannot be present in the dataset unless the character version is also present. If necessary to keep within the eight-character variable name length limit, the last character may be removed prior to appending the N. Note that this naming scheme applies only to numeric variables whose values have a one-to-one relationship to the values of the equivalent character variables. Note also that this convention does not apply to SDTM date/time ISO8601-formatted character variables converted to ADaM numeric *DT, *TM, and *DTM variables. |
8 | Variables whose names end in FL are character flag (or indicator) variables with at most two possible non-missing values, Y or N (i.e., yes or no). The name of the corresponding numeric flag (or indicator) variable ends in FN. If the flag is included in an ADaM dataset, the character version (*FL) is required but the corresponding numeric version (*FN) can also be included. If both versions of the flag are included, there must be a one-to-one relationship between the values of the two variables, as described in Section 3.1.4, Flag Variable Conventions. |
9 | Variables whose names end in GRy, Gy, or CATy are grouping variables, where "y" refers to the grouping scheme or algorithm (not the category within the grouping). For example, SITEGR3 is the name of a variable containing site group (pooled site) names, where the grouping has been done according to the third site-grouping algorithm; SITEGR3 does not mean the third group of sites. Within this document, CATy is the suffix used for categorization of ADaM-specified analysis variables (e.g., CHGCATy categorizes CHG). |
10 | It is recommended that producer-defined grouping or categorization variables begin with the name of the variable being grouped and end in GRy (e.g., variable ABCGRy is a character description of a grouping or categorization of the values from the ABC variable for analysis purposes). If any grouping of values from an SDTM variable is done, the name of the derived ADaM character grouping variable should begin with the SDTM variable name and end in GRy (GRyN for the numeric equivalent) where y is an integer [1-99, not zero-padded] representing a grouping scheme. For example, if a character analysis variable is created to contain values of Caucasian and Non-Caucasian from the SDTM RACE variable, then it should be named RACEGRy and its numeric equivalent should be named RACEGRyN (e.g., RACEGR1, RACEGR1N). As described in Table 3.1.5.1, Gy can be used as an abbreviated form of GRy when the use of GRy would create a variable name longer than 8 characters. Truncation of the original variable name may be necessary when appending suffix fragments GRy, GRyN, Gy, or GyN. |
- No labels