Page History

...

Tabulation Datasets

Observations generated over the course of a study about tobacco products and study subjects generated for a submission are represented in a series of datasets aligned with logical groupings of data per domains. Domains described in this guide are generally aligned with implementation of a single dataset in which to represent data in scope for a domain.All datasets are structured as flat files with rows representing observations and columns representing variables.In some cases, a dataset implemented for a domain may be split into physically separate datasets to support submission when needed and as allowable by the regulatory authority.

...

Metadataspec

Num

Guidance

Implementation

1

Dataset Content

Data represented in tabulation datasets will include the following per regulatory requirements and standards in this guide:

Data as originally collected or received.
Data from the protocolrelevant external references (such as a protocol).
Assigned data.
Derived data.

2

Dataset Naming

Each domain dataset is distinguished by a unique, 2-character code that should be used consistently throughout the submission. This code, which is stored in the SDTM variable named DOMAIN, is used in 4 ways: as the dataset name, as the value of the DOMAIN variable in that dataset, as a prefix for most variable names in that dataset, and as a value in the RDOMAIN variable in relationship tables (see Section 8, Representing Relationships and Data).

3

Splitting Datasets

...

Page tree

Versions Compared

Old Version 44

New Version 45

Key