The following examples illustrate how the list of ingredients of a tobacco product would be represented and quantified using three datasets:
Tobacco Ingredients (IT) is used for representing the tobacco ingredients and their attributes within the tobacco product under study.
Non-tobacco Ingredients (IN) is used for representing other ingredients/additives and their attributes as used in the tobacco product under study.
Ingredients Quantified (IQ) is used for representing the quantity of each ingredient (both tobacco and non-tobacco) by component of the tobacco product in which they are included. These values represent the design/formulation specification of the product, not experimentally determined values (as the latter would be represented in the Product Testing (PT) dataset, not shown here).
All three datasets make use of the SPTOBID variable to identify the tobacco product under study, and the STRFID (Substance Reference Identifier) variable to represent the ingredient name. STRFID must be assigned by study sponsors to be unique with regard to the attributes that describe it. If, for instance, the same ingredient varies on one or more attribute (e.g., two different manufacturers supplied the same ingredient, each with different purities, that is used in two different components of the product), STRFID must be unique for each use to reflect this difference, even if they share the same common name, and/or the same IUPAC name and CAS number. See IT example which illustrates this below.
The IT example below shows tobacco filler and the 4 types of tobacco of which it is comprised listed as tobacco ingredients (STRFID) for the Tobacco Product CIG01A (SPTOBID). The variable ITSPECIF shows the specification (variety) of the first three tobaccos. ITCAT is used to indicate that Reconstituted Tobacco is a complex ingredient. (it has two more components) ITCURMTH indicates the cure method. ITCPISP is used to indicate whether complex purchased ingredients were made to the applicant's specification. TMRF indicates the Tobacco Master Reference File.Note that there are two varieties of burley tobacco in this tobacco product, thus each is given a unique STRFID value (Burley Tobacco and Burley Tobacco 2). Note that following the team discussion on 12/13/22, we have added tobacco filler to the IT dataset below, so that all values of STRFID in the IQ dataset have a match in IT (just as the top-level non-tobacco ingredients "FSC Paper B" and "Cellulose Filter A" are listed in the IN dataset). This allows for validation checks that every ingredient listed in the IT and IN datasets is quantified in the IQ data and vice versa. See note 4 in the comments section at the bottom of the page for more info
it.xpt
it.xpt
Row
DOMAIN
STUDYID
SPTOBID
STRFID
ITSPECIF
ITCAT
ITCURMTH
ITCPISP
TMRF
1
IT
TOB07
CIG01A
Tobacco Filler
2
IT
TOB07
CIG01A
Burley Tobacco
NC 7LC
Air
MF5555555
3
IT
TOB07
CIG01A
Bright Tobacco
CC 1063
Flue
4
IT
TOB07
CIG01A
Oriental Tobacco
USDA Nicotiana Collection PI 552747
Sun
5
IT
TOB07
CIG01A
Reconstituted Tobacco
COMPLEX INGREDIENT
N
6
IT
TOB07
CIG01A
Burley Tobacco 2
KT215LC
Air
$warningHtml
The IN dataset example below shows the list of non-tobacco ingredients used in the tobacco product identified as CIG01A. Each unique ingredient is listed in STRFID. Where applicable, IUPAC name and CAS number are listed (IUPACNAM and CASNO, respectively). INCAT is used to indicate whether an ingredient is a single chemical substance or a complex purchased ingredient. INSPFL indicates whether complex purchased ingredients were made to the applicant's specification. Grade, purity, manufacturer name and ID number, and the Tobacco Product Master File number complete the dataset.
Finally, the IQ domain is used to represent the quantity (target, minimum and maximum) of each ingredient identified in STRFID that the product (SPTOBID) is designed to contain, with the units for these values represented in IQUNIT. The function of each ingredient is described in IQFUNCT. Ingredients that appear in more than one component of the tobacco product are quantified separately for each component. IQPARENT and IQLEVEL are used to identify the component/ingredient of which the ingredient in a record is a constituent component, and and at which level in the overall hierarchy the constituent ingredient appears. The following implementation rules apply:
IQPARENT must correspond to a value of STRFID in the same dataset. If parent is null, the ingredient is considered a top-level component and is assigned a value of "1" in IQLEVEL.
Records describing ingredients that are constituent ingredients of another ingredient/component listed in the dataset will have IQPARENT value equal to the STRFID of that ingredient/component. IQLEVEL will be set to n +1, where n=the parent record's value of IQLEVEL.
Every value of STRFID used in the IQ dataset must have a matching value of STRFID:
In the IT dataset (when IQCAT=TOBACCO INGREDIENT) or,
In the IN dataset (when IQCAT=NON-TOBACCO INGREDIENT)
Therefore, IQCAT is required to relate ingredient quantity records in IQ to the associated ingredient attributes described in either IT or IN.
The use of IQLPARENT and IQLEVEL help make explicit the relationship between complex ingredients and their constituents, and can assist when totaling the quantities of an ingredient that appears in multiple components of a tobacco product. The concept map below illustrates the representation of the hierarchical relationships between a complex ingredient and its constituent ingredients in the IQ domain structure.
Error rendering macro 'excerpt-include'
No link could be created for 'Concept Map. Relationships of Complex Ingredients in IQ'.
The IQ dataset example illustrates the principles discussed above using a cigarette tobacco product:
Rows 1, 13, 16:
Show three top-level components of a cigarette product: tobacco filler, filter, and cigarette paper, respectively. Their target, minimum and maximum quantities expressed in units of mg/cigarette are shown in IQTARG, IQMIN, IQMAX and IQUNIT. Because there is no parent component other than the finished product itself, IQPARENT is null, and IQLEVEL = 1, indicating that these are top-level ingredients/components of the finished product.
Rows 2-5:
Show the quantities of four types of tobacco used in the product. IQPARENT=Tobacco Filler and IQLEVEL=2, together indicating that these are constituents of the tobacco filler, a top level-component of the cigarette product.
Rows 6-8:
Show the three constituent tobacco ingredients that comprise reconstituted tobacco in row 5 (IQPARENT=Reconstituted Tobacco). The quantities of these tobaccos pertain to the quantity at which they appear in the reconstituted tobacco ONLY. Therefore, the total quantity of the individual tobaccos used in this tobacco product is the sum of these three plus the sum of the quantities shown in rows 1-3. IQLEVEL=3 to show that these ingredients are at the 3rd level of the hierarchy (constituents of an ingredient with IQLEVEL=2).
Rows 9-10:
Show the quantities of two non-tobacco ingredients, Methylcellulose and 1,3-Butanediol, which are further constituent ingredients of Reconstituted Tobacco as indicated by IQPARENT. IQFUNCT shows that Methylcellulose functions as a binder and 1,3-Butanediol functions as a humectant. IQLEVEL=3 to show that these ingredients are at the 3rd level of the hierarchy.
Rows 11-12:
Show the quantities of two constituent ingredients that both function as a flavorant in the tobacco filler (as indicated by IQFUNCT= Flavor and IQPARENT=Tobacco Filler).
Rows 14-15:
Show the quantities of two constituent ingredients that function as filtration and plasticizer, respectively, in the filter portion of the cigarette (Cellulose Filter A, a top-level component shown in row 13).
Rows 17-22:
Show the quantities and functions of six constituent ingredients of the top-level component FSC Paper B (the cigarette paper shown in row 16).