Model Documentation

The GroupingFactor class is used in the reporting event's analysisGroupings attribute to define grouping factors, which are characteristics used to subdivide the subject population or analysis dataset records into groups for analysis. Examples of grouping factors include subject characteristics (such as sex or age group), planned or actual treatment, "system organ class" or "preferred term" in an adverse events analysis dataset, and "parameter" or "visit" in a vital signs analysis dataset. Each grouping factor may either contain a set of specified groups (e.g., “Male” and “Female” for a sex grouping) or be defined as "data-driven", where the group values are obtained from the data (e.g., "system organ class" or "preferred term" for adverse events).

ER Diagram: GroupingFactor

Each grouping factor is represented as an instance of the GroupingFactor class and, in each instance:

  • The id attribute contains the assigned identifier value for the grouping factor.
  • The name attribute contains the name of the grouping factor (e.g., "Treatment", "Sex", "System Organ Class" or "Vital Signs Parameter").
  • The description attribute may be used to record a detailed description of the grouping factor.
  • The label attribute may be used to record a short description of the grouping factor.
  • The groupingDataset and groupingVariable attributes may contain, respectively, the dataset name and variable name of the variable whose values are used to subdivide subjects or data records. These attributes:
    • May be blank for groupings that include predefined groups that are specified using compound expressions that reference more than one variable.
    • Must be populated for data-driven groupings.
  • The dataDriven attribute contains either "true" or "false" to indicate whether the grouping factor is data-driven. The groups within a data-driven grouping factor (where dataDriven = true) are not predefined; they are determined from the distinct values contained within the specified groupingVariable.
  • For grouping factors that are not data-driven (dataDriven = false), the groups attribute contains the specifications of 2 or more predefined groups.

When predefined groups are specified in the groups attribute, each predefined group is defined as an instance of the Group class. In each instance of the Group class:

  • The id attribute contains the assigned identifier value for the predefined group.
  • The name attribute contains the assigned name for the predefined group, for example:

    • "Placebo" or the active treatment name in a "Treatment" grouping factor,
    • "Male" or "Female" in a "Sex" grouping factor,
    • "Screening Visit" or "Visit 1" in a "Visit" grouping factor

  • The description attribute may be used to record a detailed description of the predefined group.
  • The label attribute may be used to record a short description of the predefined group, such as might be used in a display with limited space.
  • The level attribute is assigned a value of "1".
  • The order attribute contains an integer used to order the predefined group with respect to the other predefined groups in the grouping factor (i.e., "1" for the first predefined group, "2" for the second, and so on). The value of this attribute is assigned according to the order in which the predefined groups should be displayed in outputs.
  • Either:

Or:

    • The compoundExpression attribute contains the definition of a compound expression represented as an instance of the CompoundGroupExpression class, in which:
      • The logicalOperator attribute contains a value from the ExpressionLogicalOperatorEnum enumeration.
      • The whereClauses attribute contains 1 or more subclauses being combined or negated by the specified logical operator. Each subclause is represented as one of the following:
        • A where clause represented as an instance of the WhereClause class, with the subclause expressed using either the condition or compoundExpression attribute.
        • A reference to another predefined group represented as an instance of the ReferencedGroup class, in which the subClauseId attribute contains the identifier value of the referenced group.

Refer to the WhereClauseCompoundExpression section for more information about compound expressions.

The ARS Model is designed to support the specification of analyses of data in ADaM datasets, which are expected to be "analysis-ready." In analysis-ready datasets, any complex criteria needed to specify predefined groups within grouping factors will usually have been applied to create specific grouping variables. These grouping variables can usually then be referenced in the specification of simple conditions, so compound expressions should generally not be required for the definition of predefined groups within grouping factors.

Example

This example shows the definition of 4 grouping factors, each of which is represented as an instance of the GroupingFactor class within the analysisGroupings attribute of the reporting event.

Lines 2-29:

Show the attributes of the instance of the GroupingFactor class for the first grouping factor, "Sex" (line 3), which is defined as being based on the ADSL SEX grouping variable (lines 4 and 5). The grouping factor is not data-driven (line 6), so the groups attribute is included (line 7) and contains 2 predefined groups, each specified as a simple condition represented as an instance of the Group class:

  • Lines 8-17: Show the first predefined group (order = 1), which is identified as "AnlsGrouping_05_Sex_1", named as "Female" (line 9), labeled as "F" (line 10), and contains a simple condition specifying that the value of the SEX variable in the ADSL dataset is equal to "F", which could be expressed as:
    • ADSL.SEX EQ 'F'
  • Lines 18-27: Show the second predefined group (order = 2), which is identified as "AnlsGrouping_05_Sex_2", named as "Male" (line 20), labeled as "M" (line 21), and contains a simple condition specifying that the value of the SEX variable in the ADSL dataset is equal to "M", which could be expressed as:
    • ADSL.SEX EQ 'M'
Lines 30-34:

Show the attributes of the instance of the GroupingFactor class for the second grouping factor, "Country" (line 31), which is defined as being based on the ADSL COUNTRY grouping variable (lines 32 and 33). The grouping factor is data-driven (line 34), so the groups attribute is not included; the groups within this grouping factor are derived from the distinct values in the ADSL COUNTRY variable.

Lines 35-60:

Show the attributes of the instance of the GroupingFactor class for the first grouping factor, "Vital Signs Parameter" (line 36), which is defined as being based on the ADVS PARAMCD grouping variable (lines 37 and 38). The grouping factor is not data-driven (line 39), so the groups attribute is included (line 40) and contains 2 predefined groups, each specified as a simple condition represented as an instance of the Group class:

  • Lines 41-50: Show the first predefined group (order = 1), which is identified as "AnlsGrouping_03_Param_1", named as "Systolic Blood Pressure (mmHg)" (line 42), and contains a simple condition specifying that the value of the PARAMCD variable in the ADVS dataset is equal to "SYSBP", which could be expressed as:
    • ADVS.PARAMCD EQ 'SYSBP'
  • Lines 51-60: Show the second predefined group (order = 2), which is identified as "AnlsGrouping_03_Param_2", named as "Diastolic Blood Pressure (mmHg)" (line 52), and contains a simple condition specifying that the value of the PARAMCD variable in the ADVS dataset is equal to "DIABP", which could be expressed as:
    • ADVS.PARAMCD EQ 'DIABP'
Lines 61-65:

Show the attributes of the instance of the GroupingFactor class for the second grouping factor, "System Organ Class" (line 62), which is defined as being based on the ADAE AESOC grouping variable (lines 63 and 64). The grouping factor is data-driven (line 65), so the groups attribute is not included; the groups within this grouping factor are derived from the distinct values in the AESOC variable.

YAML Example
analysisGroupings:
- id: AnlsGrouping_01_Sex
  name: Sex
  groupingDataset: ADSL
  groupingVariable: SEX
  dataDriven: false
  groups:
  - id: AnlsGrouping_01_Sex_1
    name: Female
    label: F
    level: 1
    order: 1
    condition:
      dataset: ADSL
      variable: SEX
      comparator: EQ
      value:
      - F
  - id: AnlsGrouping_01_Sex_2
    name: Male
    label: M
    level: 1
    order: 2
    condition:
      dataset: ADSL
      variable: SEX
      comparator: EQ
      value:
      - M
- id: AnlsGrouping_02_Cntry
  name: Country
  GroupingDataset: ADSL
  groupingVariable: COUNTRY
  dataDriven: true
- id: AnlsGrouping_03_Param
  name: Vital Signs Parameter
  groupingDataset: ADVS
  groupingVariable: PARAMCD
  dataDriven: false
  groups:
  - id: AnlsGrouping_03_Param_1
    name: Systolic Blood Pressure (mmHg)
    level: 1
    order: 1
    condition:
      dataset: ADVS
      variable: PARAMCD
      comparator: EQ
      value:
      - SYSBP
  - id: AnlsGrouping_03_Param_2
    name: Diastolic Blood Pressure (mmHg)
    level: 1
    order: 2
    condition:
      dataset: ADVS
      variable: PARAMCD
      comparator: EQ
      value:
      - DIABP
- id: AnlsGrouping_04_Soc
  name: System Organ Class
  groupingDataset: ADAE
  groupingVariable: AESOC
  dataDriven: true

These grouping factors and their predefined groups could be represented in tabular form as shown in the following table where each instance of the Group class is represented on a separate line, with attribute values for the related instance of the GroupingFactor class represented on every row. If there are no predefined groups in the grouping factor (i.e., it is data-driven), a single row is shown with values only included for the GroupingFactor attributes.

$titleHtml
idnamegroupingDatasetgroupingVariabledataDrivengroup_idgroup_namegroup_labellevelorderdatasetvariablecomparatorvalue
AnlsGrouping_01_SexSexADSLSEXFALSEAnlsGrouping_01_Sex_1FemaleF11ADSLSEXEQF
AnlsGrouping_01_SexSexADSLSEXFALSEAnlsGrouping_01_Sex_2MaleM12ADSLSEXEQM
AnlsGrouping_02_CntryCountryADSLCOUNTRYTRUE








AnlsGrouping_03_ParamVital Signs ParameterADVSPARAMCDFALSEAnlsGrouping_03_Param_1Systolic Blood Pressure (mmHg)
11ADVSPARAMCDEQSYSBP
AnlsGrouping_03_ParamVital Signs ParameterADVSPARAMCDFALSEAnlsGrouping_03_Param_2Diastolic Blood Pressure (mmHg)
12ADVSPARAMCDEQDIABP
AnlsGrouping_04_SocSystem Organ ClassADAEAESOCTRUE








$warningHtml

See the WhereClauseCompoundExpression section for an example of predefined groups created as compound expressions containing instances of the ReferencedGroup class to reference other groups by identifier.
Once a grouping factor has been defined in the analysisGroupings attribute of the reporting event, it can be used in the specification of any analysis by including its identifier value in the groupingId attribute of 1 of the instances of the OrderedGroupingFactor class in the orderedGroupings attribute of the analysis. In general, grouping factors are defined with reference to a single grouping variable. However, multiple separate grouping factors may be referenced in the specification of an analysis, so it is possible to define analyses of subjects and data records that are grouped by multiple factors.
For example, to define an analysis of the occurrence of adverse events by treatment, system organ class and preferred term, 3 separate grouping factors would be defined (1 for treatment, 1 for system organ class, and 1 for preferred term). All 3 grouping factors would then be used in the definition of the analysis by including each of their identifier values in the groupingId attribute of 1 of 3 instances of the OrderedGroupingFactor class in the orderedGroupings attribute of the analysis definition.
Refer to the Analysis section for more information about the specification of analyses.

  • No labels