Composition of Foods
Raw, Processed, Prepared

USDA Nutrient Database for Standard Reference, Release 14

Table of Contents

Introduction
Specific Changes
Data Files
Reports
File Content

   Food Description File
   Refuse
   Nutrients
      Proximates
      Minerals
      Vitamins
      Lipid Components
      Amino Acids
   Weights and Measures
   Footnotes
   References
Explanation of File Format
   Relational File
      Food Description File
      Food Group Description File
      Nutrient Data File
      Nutrient Definition File
      Source Code File
      Derivation Code File
      Gram weight file
      Footnote file
      Reference file
   Abbreviated
   Update Files
References 
Appendix A. Abbreviations Used in Short Descriptions
Appendix B. Other Abbreviations

U.S. Department of Agriculture, Agricultural Research Service, USDA Nutrient Data Laboratory. 2001. USDA Nutrient Database for Standard Reference, Release 14.

USDA Nutrient Data Laboratory home page, http://www.nal.usda.gov/fnic/foodcomp

Mention of trade names, commercial products, or companies in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture over others not mentioned.

The United States Department of Agriculture (USDA) prohibits discrimination in all its programs and activities on the basis of race, color, national origin, gender, religion, age, disability, political beliefs, sexual orientation, and marital or familial status. (Not all prohibited bases apply to all programs.) Persons with disabilities who require alternative means for communication of program information (Braille, large print, audiotape, etc.) should contact USDA's TARGET Center at 202-720-2600 (voice and TDD).

To file a complaint, write USDA, Director, Office of Civil Rights, Room 326-W, Whitten Building, 14th and Independence Avenue, S.W., Washington, DC 20250-9410, or call 202- 720-5964 (voice and TDD). USDA is an equal opportunity provider and employer.

Issued July 2001

Introduction

The USDA Nutrient Database for Standard Reference (SR) is the major source of food composition data in the United States. It provides the foundation for most food composition databases in the public and private sectors. As information is updated, new versions of the database are released. This version, Release 14 (SR14), contains data on 6,039 food items and up to 113 food components. It replaces SR13, issued in February 2000.

This is the first release from NDL's new Nutrient Databank System (NDBS). We have changed some formats and have added fields to improve the descriptive information for food items and the statistical information about the nutrient values. While data in previous releases have been migrated into the new NDBS, they have not been processed through the complete system. Consequently, many of these new fields contain data for only a few items and it will take a number of years before they are populated for a majority of food items in the database.

SR14 includes data for all the food groups and nutrients published in the 21 volumes of Agriculture Handbook No. 8 (U.S. Department of Agriculture 1976-88), and its four supplements (U.S. Department of Agriculture 1990-93), which, in turn superseded the 1963 edition (Watt and Merrill 1963). Since 1992, updated data have been published electronically on NDL's Web site. This SR14 release supersedes all previous releases including the printed version in the event of any differences.

Data were compiled from published and unpublished sources. Published sources include the scientific and technical literature. Unpublished data are from the food industry, other government agencies, and research conducted under contracts initiated by the Agricultural Research Service (ARS). Values in the database may be based on the results of laboratory analyses or calculated by the use of appropriate factors or recipes, which is indicated by the source code in the Nutrient Data File. Every food item may not contain a complete nutrient profile.

Specific Changes

Several changes were made to the database since the last release.

Data Files

The data files for SR14 are presented in ASCII format. An abbreviated file, with fewer nutrients, is also provided. A description of each field in these files and the relationships between each is provided below. We have also made available a MS Access 2000 database. This database contains all the SR14 files and relationships, with a few sample queries and reports. The abbreviated file in MS Excel 2000 is also available.

Reports

The data in SR14 are also available as reports in two different presentations. The first is for all items in SR14 as page images, containing all the data for each food. These data are separated into files according to food groups. The second is for selected foods and nutrients in SR14. These reports are sorted either by food description or in descending order by nutrient content in terms of common household measures. The food items and weights in these reports are adapted from those in the upcoming revision of Home and Garden Bulletin No. 72, Nutritive Values of Foods

The Adobe Acrobat viewer is needed to see these files. These is a link to the internet site where it can be downloaded for free.

File Content

As mentioned, the database consists of several separate data files. Details about the information in each is provided in the sections below. More extensive details on many specific foods are available in the printed Agriculture Handbook No. 8 sections (U.S. Department of Agriculture 1976-88).

The four principal files are the Food Description File, Nutrient Data File, Gram Weight File, and Footnote File. The five support files are the Nutrient Definition File, Food Group Description File, Source Code File, Derivation Code File, and References File.

Food Description File

This file includes descriptive information about the food items. Food descriptions for brand name items are in upper case. A full description and a short description (containing abbreviations) are provided. Abbreviations used in creating short descriptions are given in appendix A. In creating the short description, the first word in the long description was not abbreviated. Also, if the long description was 25 characters or less, the short description contains no abbreviations. Abbreviations used elsewhere in the tables are given in appendix B. Scientific names, amounts of refuse, and refuse description are provided where appropriate. The factors used to calculate protein from nitrogen are included, as well as those used to calculate calories. There are no factors for items prepared using the recipe program of the Nutrient Data Bank System or for items where protein and calories are calculated by the manufacturer. Three new fields have been added to this file:

Refuse

The refuse and refuse description fields contain amounts and descriptions of inedible material (for example, seeds, bone, skin) for applicable foods. These amounts are expressed as a percentage of the total weight of the item as purchased, and they were used to compute the weight of the edible portion. Refuse data were obtained from USDA sponsored contracts and Agriculture Handbook Nos. 102 and 456 (Adams 1975, Matthews and Garrison 1975). To calculate "Amount in edible portion of 1 pound as purchased" use the following formula:

Y = V*4.536*[(100-R)/100]

where

Y = nutrient value per 1 pound as purchased
V = nutrient value per 100 g (Nutr_Val in the Nutrient Data File), and
R = percent refuse (Refuse in the Food Description File).

For meat cuts containing bone and connective tissue, the amount of connective tissue is included in the value given for bone. Separable fat is not shown as refuse if the meat is described as separable lean and fat. Separable fat generally refers to seam fat and intramuscular fat. Separable lean refers to muscle tissue that can be readily separated from the intact cut; it includes any fat striations (marbling) within the muscle. For boneless cuts, the refuse values apply to connective tissue or connective tissue plus separable fat. The percentage yield of cooked, edible meat from 1 pound of raw meat with refuse can be determined from the following formula:

Y = (Wc / 453.6)*100

where

Wc = weight of cooked, edible meat.

Nutrients

Nutrient values per 100 g are contained in the Nutrient File, along with fields to further describe the mean value. Nutrient values have been rounded to a specified number of decimal places for each nutrient. Number of decimal places for each nutrient are listed in the Nutrient Definition File (p.20). With the implementation of the new Nutrient Databank System, we have added a number of statistical attributes to better describe the mean. These include:

A few additional fields were added to expand the information on how the values are generated:

For more details on the Nutrient Data File, see the discussion under Explanation of File Formats 

Nutrient values represent the total amount of the nutrient present in the edible portion of the food, including any nutrients added in processing. The values do not necessarily represent the nutrient amounts available to the body. Table 1 gives an idea of the comprehensiveness of the database by listing for each nutrient the number of items that contain data.

Table 1. Number of foods in database (n=6,039) containing selected nutrients

Nutrient Number of foods Nutrient Number of foods
Protein 6039 Vitamin D 157
Total lipid (fat) 6039 Vitamin E, ATE 3301
Water 6039 Alpha-tocopherol 386
Carbohydrate, by difference 6039 Beta-tocopherol 21
Total dietary fiber 5362 Gamma-tocopherol 28
Starch 21 Delta-tocopherol 22
Total sugar 722 Ascorbic acid 5805
Sucrose 63 Thiamin 5656
Glucose 79 Riboflavin 5664
Fructose 79 Niacin 5657
Lactose 58 Pantothenic acid 5284
Galactose 6 Vitamin B6 5510
Ash 6016 Folate, total 5481
Calcium 5923 Folic acid 5282
Iron 5940 Food folate 5282
Magnesium 5628 Folate, DFE 5282
Phosphorus 5677 Vitamin B12 5539
Potassium 5765 Cholesterol 5931
Sodium 6034 Total saturated fatty acids 5849
Zinc 5608 Total monounsaturated fatty acids 5644
Copper 5536 Total polyunsaturated fatty acids 5651
Manganese 5049 Phytosterols 610
Selenium 4988 -Sitosterol 30
Vitamin A (IU) 5868 Stigmasterol 30
Vitamin A (RE) 5404 Campesterol 29

When nutrient data for prepared or cooked products were unavailable or incomplete, nutrient values were calculated from comparable raw items or by recipe. When values are calculated in a recipe or from the raw item appropriate nutrient retention and yield factors are applied (U.S. Department of Agriculture 1994). To obtain the content of nutrient per 100 g cooked food, the nutrient content per 100 g of raw food is multiplied by the nutrient retention factor and, when appropriate, yield factors.

Vc = (Vr*RF)/Yc

where

Vc = nutrient content of cooked food,
Vr = nutrient content of raw food,
RF = retention factor, and
Yc = yield of cooked food.


Nutrient retention factors are based on data from USDA research contracts, recent research reported in the literature, and USDA publications. Most retention factors were calculated by the True Retention Method (%TR) (Murphy et al. 1975). This method, as shown below, accounts for the loss of solids from foods that occurs during preparation and cooking.

%TR = (Nc × Gc) / (Nr × Gr) × 100

where

Nc = nutrient content per g of cooked food,
Gc = g of cooked food,
Nr = nutrient content per g of raw food, and
Gr = g of food before cooking.

In general, levels of fortified nutrients are the values calculated by the manufacturer or Nutrient Data Laboratory food specialists, based on the Nutrition Labeling and Education Act label declaration of %Daily Value (DV) (CFR, Title 21, Pts. 100-169). Such values represent the minimum nutrient level one can expect in the product. If analytical values were available to estimate levels of added nutrients, a number is present in the sample count field for these nutrients.

Proximates. Proximate components include water (moisture), protein, total lipid (fat), total carbohydrate, and ash. The values for protein were calculated from the level of total nitrogen (N) in the food, using the conversion factors recommended by Jones (1941). The specific factor applied to each food item is provided in the N_Factor field in the Food Description File. The general factor of 6.25 is used to calculate protein in items that do not have a specific factor. There is no factor for items prepared using the recipe program of the Nutrient Data Bank System or for items where protein is calculated by the manufacturer.

Protein values for chocolate, cocoa products, coffee, mushrooms, and yeast were adjusted for nonprotein nitrogenous material. The adjusted protein conversion factors used to calculate protein for these items are as follows:

chocolate and cocoa    4.74
coffee    5.3
mushrooms    4.38
yeast    5.7

When these items were ingredients, only their protein nitrogen content was used to determine their contribution to the protein and amino acid content of the food. Protein calculated from total nitrogen, which may contain nonprotein nitrogen, was used in determining carbohydrate by difference. This unadjusted protein value is not given in the Nutrient Data File for SR14; rather, it is given as a footnote in prior printed sections of Agriculture Handbook No. 8.

For soybeans, nitrogen values were multiplied by a factor of 5.71 (Jones 1941) to calculate protein. The soybean industry, however, uses 6.25 to calculate protein. The protein content of soy flours, soy meals, soy protein concentrates, and soy protein isolates is expressed both ways. The item calculated using the 6.25 factor is identified as "crude protein basis."

Total lipid content of most foods was determined by gravimetric methods, including extraction methods such as those which employ ether or a mixed solvent system of chloroform and methanol, or by acid hydrolysis.

Carbohydrate, when present, was determined as the difference between 100 and the sum of the percentages of water, protein, total lipid (fat), ash, and, when present, alcohol. Total carbohydrate values include total dietary fiber. Total dietary fiber content was determined by the following enzymatic-gravimetric methods: 985.29 and 991.43 of the Association of Official Analytical Chemists (AOAC, 1995). Total sugars were determined using AOAC methods (1995), either high-performance liquid chromatography (HPLC) or gas-liquid chromatography (GLC), and are the sum of individual monosaccharides (i.e., galactose, glucose, and fructose) and disaccharides (i.e., sucrose, lactose, and maltose). Data for total sugars are available primarily for formulated foods, but we anticipate that values for other foods will likely be added in future releases. Starch was analyzed by AOAC method 966.11 (1996).

Food energy is expressed in kilocalories (kcal) and kilojoules (kJ). One kcal equals 4.184 kJ. The data are for physiological energy, which is the energy value remaining after losses from digestion and metabolism are deducted from gross energy. Calorie values, with the exception of formulated foods, are based on the Atwater system for determining energy values. Derivation of the Atwater calorie factors is outlined in Agriculture Handbook No. 74 (Merrill and Watt 1973). For formulated foods, calorie values (source codes 8 or 9; for more information on source codes, see page 21) generally reflect industry practices (as permitted by the Nutrition Labeling and Education Act) of calculating calories from 4-4-9 kcal/g of protein, carbohydrate, and fat, respectively, or from 4-4-9 kcal/g of protein, carbohydrate minus insoluble fiber, and fat. The latter method is frequently used for high-fiber foods.

Calorie factors for protein, fat, and carbohydrates are included in the Food Description File. For foods containing alcohol, a factor of 6.93 was used to calculate calories per gram of alcohol. No calorie factors are presented for items prepared using the recipe program of the Nutrient Data Bank System. Instead, total calories for these items equal the sums of the calories contributed by each ingredient after adjustment for changes in yield, as appropriate. For formulated foods, if the calories calculated by the manufacturer are reported, no calorie factors are presented.

Calorie factors for fructose and sorbitol, not available in the Atwater system, were derived from the work of Livesay and Marinos (1988). Calorie factors for coffee and tea were estimated from seeds and vegetables, respectively.

Minerals. Minerals included in the database are: calcium, iron, magnesium, phosphorus, potassium, sodium, zinc, copper, manganese, and selenium. Levels of minerals for most foods were determined by methods of the Association of Official Analytical Chemists (1995). Phosphorus was determined colorimetrically. Sodium and potassium were usually determined by flame photometry. Calcium, iron, magnesium, zinc, copper, and manganese were determined by atomic absorption and inductively coupled plasma emission spectrophotometry (ICP).

Much of the analytical data for selenium in foods were published earlier (USDA 1992) and were determined by the modified selenium hydride and fluorometric methods. The selenium content of plants, in particular cereal grains, is strongly influenced by the quantity of biologically available selenium in the soil in which they grow, that is, by their geographical origin (Kubota and Allaway 1972). The selenium content of fruits and vegetables is normally very low. While the soil affects the selenium content of fruits and vegetables, it does not significantly increase the amount of selenium in them. The values given are national averages and should be used with caution when considering levels of selenium in locally grown foods.

Vitamins. Vitamins included in the database are: ascorbic acid (vitamin C), thiamin, riboflavin, niacin, pantothenic acid, vitamin B6, folate, vitamin B12, vitamin A, vitamin E, and vitamin D.

Ascorbic acid. In the current database system, all data for ascorbic acid are listed under nutrient number 401 (total ascorbic acid), although reduced ascorbic acid content is reported for many food groups, especially those which are major nutritional contributors of ascorbic acid such as fruits and vegetables. Total ascorbic acid was reported for food groups 1 (Dairy and Eggs), 2 (Spices and Herbs), 4 ( Fats and Oils), 12 (Nut and Seeds), and 17 (Lamb, Veal, and Game). Food group 10 (Pork and Pork Products) contains a mixture of total and reduced forms, which are reported under nutrient number 401. Reduced ascorbic acid was determined by the dichloroindophenol method, and total ascorbic acid by the fluorometric method.

Thiamin, Riboflavin and Niacin. Thiamin was determined chemically by the thiochrome procedure or by microbiological methods. Fluorometric or microbiological methods were used to measure riboflavin. The values for niacin are for preformed niacin only and do not include the niacin contributed by tryptophan, a niacin precursor. The term "niacin equivalent" applies to the potential niacin value, that is, to the sum of the preformed niacin and the amount that could be derived from tryptophan. In estimating the amounts of niacin available from foods, the mean value of 60 mg tryptophan is considered equivalent to 1 mg niacin (National Academy of Sciences 1989).

Pantothenic acid, Vitamins B6 and B12. Pantothenic acid was determined microbiologically. Vitamins B6 and B12 were determined by microbiological or chromatographic methods. Vitamin B12 is found in foods of animal origin or those containing some ingredient of animal origin; for example, cake that contains eggs or milk. For foods that contain only plant products, the value for vitamin B12 is assumed to be 0. Some reports contain values for vitamin B12 in certain fermented foods (beer, soy sauce, and miso). It is believed that this B12 is synthesized not by the microorganisms responsible for the fermentation of the food but, rather, by other contaminating microorganisms. Therefore, one should not consider these foods to be a consistent source of vitamin B12 (Liem et al. 1977).

Folate. In this release, in addition to a total folate value that has been reported in the past, we are reporting values for folic acid, food folate, and total folate reported as g of DFE.

This changes responds to new Dietary Reference Intakes (DRIs) for folate issued by the national Academy of Sciences, Institute of Medicine, (NAS-IOM, 1998). Recommended Dietary Allowances for folate are expressed in dietary folate equivalents (DFE). DFE take into account the greater bioavailability of synthetic folic acid compared to naturally occurring food folate.

To calculate DFE it is necessary to have separate values for naturally occurring food folate and added synthetic folic acid.

µg DFE = µg food folate + (1.7 X µg folic acid)

In 1998 (SR release 12), the folate values in the database were updated to reflect regulations requiring the addition of folic acid to enriched cereal grain products subject to standards of identity (CFR, Title 21, Pts. 136, 137). These products include flour, cornmeal and grits, farina, rice, macaroni, noodles, bread, rolls, and buns. Folic acid may continue to be added (with some restrictions on amounts) to breakfast cereals, infant formulas, medical foods, food for special dietary use, and meal replacement products. For the most part, values for this database were calculated based on enrichment levels specified in the regulations, since analytical values were not yet available. For those foods where the enrichment level is given as a range, the midpoint was used to set the value. Food items containing any of these enriched products as ingredients, such as baked products made with enriched flour, were also updated.

In enriched and fortified foods, total folate includes both food folate and added folic acid. The total folate values found in SR11-1 preceded the implementation of the new enrichment standards and represent the amount of folate naturally occurring in foods. For this release we calculated folic acid by subtracting the food folate value in SR11-1 from the total folate content we estimated in fortified foods for SR12.

For unenriched foods, the total folate value is food folate. Therefore the value for total folate with number of data points and standard error, if present, was also used for food folate. The folic acid value was assumed to be zero.

Enriched ready-to-eat (RTE) cereals have generally included folic acid fortification for over 25 years. Therefore, food folate values were not readily available for these products. Food folate was estimated by means of the databank formulation program for a variety of high consumption cereals. Mean folate values were calculated for categories of RTE cereals based on grain content. Added folic acid was then calculated by subtracting estimated food folate from the total folate content reported in SR13.

Most analytical values shown for folate were determined by the use of conjugase and Lactobacillus casei. Beecher and Matthews (1990) reported that the methodology used in determining folate values needed improvement, particularly, in the areas of extraction procedures and applications to specific foods. Research on determining the folate content of high-protein and high-carbohydrate foods indicates that additional improvements in methodology are needed (Martin et al. 1990). Limited amounts of data generated by USDA were obtained by the tri-enzyme extraction using additional enzymes, amylase and protease, to release bound forms of folate. Microbiological methods measure total folate; for enriched foods, folic acid and food folate are not distinguished from each other.

Vitamin A. The data for vitamin A include chemically determined preformed vitamin A and provitamin-A carotenoids as determined by methods of the Association of Official Analytical Chemists. Total vitamin A activity is expressed in international units (IUs) and retinol equivalents (REs). One IU is equivalent to 0.3 mcg retinol, 0.6 mcg beta carotene, or 1.2 mcg other provitamin-A carotenoids. One RE is equivalent to 1 mcg retinol, 6 mcg beta carotene, or 12 mcg other provitamin-A carotenoids. One RE is equal to 3.33 IU retinol or 10 IU beta carotene (National Academy of Sciences 1989).

In 2001 the Institute of Medicine, National Academy of Sciences issued new DRIs for vitamin A (NAS-IOM, 2001). Along with the new DRI's they recommended changing the factors used for calculating vitamin A activity from the individual carotenoids and developed a new unit for expressing vitamin A activity-Retinol Activity Equivalent (RAE). Vitamin A will be calculated in RAE in the future. Data on individual carotenoids are presented in a separate table (USDA-NCC, 1998) available on the NDL Web site. Plans are underway to add individual carotenoids to this database in future releases.

Vitamin E. Vitamin E was determined by GLC or HPLC. Total vitamin E activity is reported as mg alpha-tocopherol equivalents and was calculated from the amounts and relative activities of the various tocopherols and tocotrienols. In this release, data are also presented on the individual tocopherols when available. In the future we will be adding more data for alpha-tocopherol and dropping alpha-tocopherol equivalents to be in line with the new DRIs for vitamin E (NAS-IOM, 2000).

Lipid Components. Fatty acids are expressed as the actual quantity of fatty acid in g/100 food and do not represent fatty acids as triglycerides. Historically, most fatty acid data reported in USDA Nutrient Databases were obtained as the percentage of fatty acid methyl esters and were determined by gas-liquid chromatographic analyses. These data were converted to g fatty acid per 100 g total lipid using lipid conversion factors and then to g fatty acid per 100 g edible portion of food using the total lipid content. Details of the derivation of lipid conversion factors were published by Weihrauch et al. 1977. In the redesigned NDBS, fatty acid data may be imported in a variety of units and converted within the system. No conversions are required if data are received as g fatty acid/100 g edible portion of food. Data received as fatty acid esters and triglycerides are converted to fatty acids using Sheppard factors. Sheppard conversion factors are based on the differential molecular weights of the specific fatty acid and its corresponding esters (butyl or methyl) and triglyceride (Sheppard 1992). When fatty acid data are received as percentages of fatty acid methyl esters, methyl esters are converted to fatty acids using Sheppard factors and then multiplied by total lipid (nutrient 204) to give g fatty acid per 100 g edible portion of food. Occasionally total fat (nutrient 204) values are available from a variety of data sources, but individual fatty acids are available from fewer references. In those cases, it may be necessary to normalize the individual fatty acids to the mean fat value of the food item. In the case of normalized fatty acids, the sum of the individual fatty acids will equal the mean fat value multiplied by the Weihrauch lipid conversion factor for that food item. No statistics of variability are reported for normalized fatty acids.

The basic format for describing individual fatty acids is that the number before the colon indicates the number of carbon atoms in the fatty acid chain; the number after the colon indicates the number of double bonds. For unsaturated fatty acids, additional nutrient numbers have been added to accommodate the reporting of many specific positional and geometric isomers. Of the specific isomers, there are two basic classifications considered: omega double bond position and cis/trans configuration of double bonds.

Omega-3 and omega-6 isomers are denoted in shorthand nomenclature as n-3 and n-6. The n- number indicates the position of the first double bond from the methyl end of the carbon chain. The letter c or t indicates whether the bond is cis or trans. For polyunsaturated fatty acids, cis and trans configurations at successive double bonds may be indicated. For example, linoleic acid is an 18 carbon omega-6 fatty acid with 2 double bonds, both in cis configuration. When data are isomer specific, linoleic acid is described as 18:2 n-6 c,c. Other isomers of 18:2 for which new nutrient numbers have been assigned include 18:2 c,t; 18:2 t,c; and 18:2 t,t; 18:2 t not further defined; and 18:2 i. 18:2 i is not a single isomer, but includes isomers other than 18:2 n-6 c,c with peaks which cannot easily be differentiated in the particular food item. Systematic and common names for fatty acids are given in Table 2.

Fatty acid totals: Only a small portion of the fatty acid data received for release in SR14 contains specific positional and geometric isomers. Therefore, it has been necessary to maintain the usual nutrient numbers corresponding to fatty acids with no further differentiation than carbon length and number of double bonds. To aid users of our data, specific isomers are always summed to provide a total value for the undifferentiated fatty acid. Thus, mean values for the specific isomers of 18:2 would be summed to provide a mean for 18:2 undifferentiated (nutrient number 618). Other fatty acid totals provided in SR are: 1) the sum of saturated, monounsaturated and polyunsaturated fatty acids and 2) sum of trans-monoenoic, the sum of trans-polyenoic, and the sum of all trans fatty acids.

Values for total saturated, monounsaturated, and polyunsaturated fatty acids may include individual fatty acids not reported in SR; therefore, the sum of their values may exceed the sum of the individual fatty acids listed. In rare cases, the sum of the individual fatty acids may exceed the sum of the values given for the total saturated fatty acids (SFA), monounsaturated fatty acids (MUFA), and polyunsaturated fatty acids (PUFA). These differences are generally caused by rounding and may be relatively small.

For formulated brand name foods, industry data were often available for fatty acid classes (SFA, MUFA, and PUFA) but were lacking for individual fatty acids. In these cases, individual fatty acids were calculated from the fatty acids of the individual listed ingredients and normalized to the total fat level. A best-fit approximation was made to fatty acid classes but, unavoidably, calculated sums of individual fatty acid totals did not always match industry data for fatty acid classes. Zero values for individual fatty acids should be understood to mean that trace amounts may be present. When g fatty acids per 100 g total lipid were converted to g fatty acids per 100 g food, values of less than 0.0005 were rounded to 0.

Table 2 is provided for the convenience of users in attaching common names or systematic names to fatty acids in our database. Although individual fatty acids are more specific than in past releases, it is not possible to include every possible geometric and positional isomer. Where specific isomers exist for a fatty acid, the common name of the most typical isomer is listed for the undifferentiated fatty acid and an asterisk (*) designates the specific isomer by that name. Thus, the most typical isomer for 18:1 is oleic. The specific isomer by that name is 18:1 c, is designated in Table 2 as oleic *.

Table 2. Systematic and common names for fatty acids


Fatty acid
Systematic name Common Name of 
most typical Isomer
Nutrient
Number
Saturated fatty acids
4:0    butanoic    butyric    607
6:0    hexanoic    caproic    608
8:0    octanoic    caprylic    609
10:0    decanoic    capric    610
12:0    dodecanoic    lauric    611
13:0    tridecanoic        696
14:0    tetradecanoic    myristic    612
15:0    pentadecanoic        652
16:0    hexadecanoic    palmitic    613
17:0    heptadecanoic    margaric    653
18:0    octadecanoic    stearic    614
19:0    nonadecanoic        686
20:0    eicosanoic    arachidic    615
22:0    docosanoic    behenic    624
24:0    tetracosanoic    lignoceric    654
Monounsaturated fatty acids
14:1    tetradecenoic    myristoleic    625
15:1    pentadecenoic        697
16:1 undifferentiated    hexadecenoic    palmitoleic    626
16:1 c            673*
16:1 t            662
17:1     heptadecenoic        687
18:1 undifferentiated    octadecenoic    oleic    617
18:1 c            674*
18:1 t            663
20:1    eicosenoic    gadoleic    628
22:1  undifferentiated    docosenoic    erucic    630
22:1 c            676*
22:1 t            664
24:1 c     cis-tetracosenoic    nervonic    671
Polyunsaturated fatty acids
16:2 undifferentiated    hexadecadienoic         688
18:2 undifferentiated    octadecadienoic     linoleic    618
18:2 n-6 c,c            675*
18:2 c,t            668
18:2 t,c            667
18:2 t,t            669
18:2 i            666
18:2 t not further defined            665
18:3    octadecatrienoic    linolenic    619
18:3 n-3 c,c,c        alpha-linolenic    851*
18:3 n-6 c,c,c        gamma-linolenic    685
18:4 undifferentiated    octadecatetraenoic    parinaric    627
20:2 n-6 c,c    eicosadienoic        672
20:3 undifferentiated    eicosatrienoic    689
20:3 n-3            852*
20:3 n-6            853
20:4 undifferentiated    eicosatetraenoic    arachidonic    620
20:4 n-3            854
20:4 n-6            855*
20:5 n-3    eicosapentaenoic (EPA)    timnodonic    629
22:2    docosadienoic    brassic    698
22:5 n-3    docosapentaenoic (DPA)    clupanodonic    631
22:6 n-3    docosahexaenoic    (DHA)    621

For some fatty acids listed in the above table, there are no data in this release.

Cholesterol. Cholesterol values were generated primarily by gas-liquid chromatographic procedures. It is assumed that cholesterol is present only in foods of animal origin and foods containing at least one ingredient of animal origin (for example, cake that contains eggs). For mixtures containing ingredients derived from animal products, the cholesterol value may have been calculated from the value for those ingredients. For foods that contain only plant products, the value for cholesterol is assumed to be 0.

Plant sterols. Data on plant sterols (campesterol, stigmasterol, -sitosterol) were obtained by colorimetric or gas-chromatographic procedures and summed to calculate total phytosterols.

Amino Acids. Amino acid data for a class or species of food are aggregated to yield a set of values which serve as the pattern for calculating the amino acid profile of other similar foods in that class or species. The amino acid values for the pattern are expressed on a per-gram-of-nitrogen basis. Data to develop amino acid patterns for simple foods were obtained primarily by ion-exchange chromatography. The amino acid patterns and the total nitrogen content were used to calculate the levels of individual amino acids per 100 g of food, using the following formula:

AAf = (AAn*Vp ) / Nf

where

AAf = amino acid content per 100 g food,
AAn = amino acid content per g nitrogen,
Vp = protein content of food, and
Nf = nitrogen factor.

In the past, the number of data points appeared only on the food item for which the amino acid pattern was developed, not on other foods that used the same pattern. It referred to the number of observations used in developing the amino acid pattern for that food. For foods processed in the new NDBS, the number of observations used in developing an amino acid pattern will only be released with the release of the pattern. The amino acid profiles calculated from these patterns will show the number of data points to be zero.

If amino acid values are presented for an item with more than one protein-containing ingredient, the values may have been calculated on a per-gram-of-nitrogen basis from the amino acid patterns of the various protein-containing ingredients. Then the amino acid contents for an item on the 100-g basis were calculated as the sum of the amino acids in each protein-containing ingredient multiplied by total nitrogen in the item.

Weights and Measures

Information is provided on household measures for food items (for example, 1 cup, 1 tablespoon, 1 fruit, 1 leg). Weights are given for edible material without refuse. The Gram Weight File contains the gram weights and measure descriptions for each food item. This file can be used to calculate nutrient values for food portions from the values provided per 100 g food. The formula used to calculate the nutrient content per household measure is:

N = (V*W)/100

where

N = nutrient value per household measure,
V = nutrient value per 100 g (Nutr_Val in the Nutrient Data File), and
W = g weight of portion (Gm_wt in Weight File).

The Gram Weight file can be used to produce reports showing the household measure and nutrient values calculated for that portion. The weights were derived from published sources, industry files, studies conducted by USDA (Adams 1975, Fulton et al. 1977), and the weights and measures used in USDA's Continuing Survey of Food Intakes by Individuals (USDA 1995). Although special efforts were made to provide representative values, weights and measures obtained from different sources vary considerably for some foods. Starting with this release we have combined the Gram Weight File and the Measure Description File to make this information easier to use.

Footnotes

Footnotes are provided for a few items where information about food description, weights and measures, or nutrient values could not be accommodated in existing fields. Many of the footnotes published in Agriculture Handbook 8 are no longer needed because the information was moved to other fields and tables. For example, details about the measure description, once contained in footnotes, are now part of the measure description in the Gram Weight file. Values for additional nutrients once included in footnotes were given nutrient numbers, when appropriate, and included in the Nutrient Data File. The database also incorporates data that cover enrichment or fortification or cases where nutrient content is affected by plant part or color (yellow and white corn, for example).

References

This is a new file added with this release. It contains the sources of data for the nutrient values and links to an identification number on each nutrient record. Since much of the data in this release was carried forward from SR13, nutrient specific source documentation was not electronically available. However, as new data for these foods are generated and as additional documentation is entered into the new NDBS, data source information will increase in future releases.

Explanation of File Formats

The data are presented in two different organizational formats. One is a relational format of four principal and five support files comprising the database (figure 1). The relational format is complete and contains all food, nutrient, and related data. The other is a flat file, which is an abbreviated file, with fewer nutrients and related information.

Relational Files

The four principal and five support files of the relational database can be linked together in a variety of combinations to produce queries and generate reports. The formats of these files are identified in tables 3-11. Information on the various relationships that can be made among these files is also given. Fields that always contain data and fields that can be left blank or null are identified in the "blank" column; Y indicates a field that is always filled, N indicates a field that may be left blank (null) (see tables 3-11). An asterisk (*) indicates that the field is indexed. Although the files are not indexed, the file descriptions show where indices were used to sort and manage records within the Nutrient Data Bank System. When importing these files into a database management system, if files are to be indexed, it is important to use the indices listed here, particularly with the Nutrient Data File, which uses two.

Graphic showing relationships between files

ASCII.
ASCII files are delimited. All fields are separated by carets (^) and text fields are surrounded by tildes (~). A double caret (^^) appears when a field is null or blank. Format descriptions include the name of each field, its type [N=numeric with width and number of decimals (w.d) and A=alphanumeric], and its maximum length. The actual length in the data files may be less and most likely will change in subsequent releases.

Food Description File (file name = FOOD_DES). The Food Description File (Table 3) contains a long and a short description and food group for 6,039 food items, and when applicable, common names, manufacturer name, scientific name, percentage of refuse, and factors used for calculating protein and calories.

Table 3. Food Description File Format

Field name   

Type   

Blank   

Description

NDB_No   

A 5*   

N   

5-digit Nutrient Data Bank number which uniquely identifies a food item

FdGp_Cd   

A 4   

N   

4-digit code indicating food group to which a food item belongs.

Desc   

A 200   

N   

200 character description of food item

Shrt_Desc   

A 60   

N   

60 character abbreviated description of food item. Generated from the 200 character description using abbreviations in Appendix A. If short description was longer than 60 characters, additional abbreviations were made.

ComName A 100 Y Other names commonly used to describe a food. Ex. Hot dog for frankfurter
ManufacName A 45 Y Manufacturer's name for brand name foods
Survey A 1 Y Item used in the National Food and Nutrition Surveys

Ref_desc   

A 45   

Y   

Description of inedible parts of a food item (refuse), such as, seeds or bone.

Refuse   

N 2.0   

Y   

Percentage of refuse

SciName   

A 60   

Y   

Scientific name of the food item. Given for the least processed form of the food (usually raw), if applicable.

N_Factor   

N 4.2   

Y   

Factor for converting nitrogen to protein

Pro_Factor   

N 4.2   

Y   

Factor for calculating calories from protein.

Fat_Factor   

N 4.2   

Y   

Factor for calculating calories from fat.

CHO_Factor   

N 4.2   

Y   

Factor for calculating calories from carbohydrate.

 

 Food Group Description File  (file name = FD_GROUP) (Table 4)

Table 4. Food Group Description File Format

Field name   

Type   

Blank   

Description

FdGp_Cd   

A 4*   

N   

4-digit code identifying a food group. Only the first 2 digits are assigned. In the future the last 2 digits may be used.

FdGp_Desc   

A 60   

N   

Name of the food group

Nutrient Data File  (file name = NUT_DATA). The Nutrient Data File (Table 5) contains the nutrient values and information about the values, including expanded statistical information.

Table 5. Nutrient Data File Format

Field name   

Type   

Blank   

Description

NDB_No   

A 5*

N   

5-digit Nutrient Data Bank number.

Nutr_No   

A 3*   

N   

3-digit unique identifier code for a nutrient

Nutr_Val   

N 10.3   

N   

Amount in 100 grams, edible portion.

Num_Data_Pts   

N 5.0   

N   

Number of data points (previously called Sample_Ct)

Std_Error   

N 8.3   

Y   

Standard error of the mean. Null if could not be calculated

Src_Cd A 2 N Code indicating type of data
Deriv_Cd A 4 Y Data Derivation Code giving specific information on how the value was determined
Ref_NDB_No A 5 Y NDB number of the item used to impute a missing value. Not populated for this release
Add_Nutr_Mark A 1 Y Indicates a vitamin or mineral added for fortification or enrichment. Not populated for this release
Num_Studies N 2.0 Y Number of studies
Min N 10.3 Y Minimum value
Max N 10.3 Y Maximum value
DF N 2.0 Y Degrees of Freedom
Low_EB N 10.3 Y Lower 95% Error bound
Up_EB N 10.3 Y Upper 95% Error bound
Stat_Cmt A 10 Y Statistical comments. See definitions below.
CC A 1 Y Confidence Code indicating data quality, based on evaluation of sample plan, sample handling, analytical method, analytical quality control, and number of samples analyzed. Not populated for this release
Ref_ID A 41 Y Codes indicating citation to the sources of the data
† Nutrient values have been rounded to a specified number of decimal places for each nutrient. Number of decimal places are listed in the Nutrient Definition File. 

Definitions of each Statistical Comment included in the Nutrient Data table follow:

Nutrient Definition File (file name = NUTR_DEF). The Nutrient Definition File (Table 6) is the support file to the Nutrient Data File. It provides the 3-digit nutrient code, unit of measure, INFOODS tagname, and description.

Table 6. Nutrient Definition File Format

Field name   

Type   

Blank   

Description

Nutr_No   

A 3*   

N   

3-digit unique identifier code for a nutrient

Units   

A 7   

N   

Units of measure - mg, g, mcg, etc.

Tagname   

A 20   

N   

International Network of Food Data Systems (INFOODS) Tagnames *. A unique abbreviation for a food component developed by INFOODS to aid in the interchange of data.

NutrDesc A 60 N Name of the food component

Decimal

A 1 

N   

Number of decimal places to which a nutrient value is rounded

* Klensin et al, 1989

Source Code File (file name = SRC_CD) (table 7)

Table 7. Source Code File Format

Field Name   

Type   

Blank   

Description

Src_Cd   

A 2*   

N   

2-digit code

SrcCd_Desc   

A 60   

N   

Description of source code that identifies the type of nutrient data.

The Source Code File contains codes indicating the type of data (analytical, calculated, assumed zero, and so on) in the Nutrient Data File. To improve the usability of the database, Nutrient Data Lab staff imputed nutrient values for many proximate components, total dietary fiber, and vitamin and mineral values.

A description of each source code follows:

Source code   

Description

1   

Value is analytical or derived from the analytical

4   

Value is imputed

5   

Value upon which a manufacturer based their label claim for added nutrients (used primarily for Breakfast Cereals and Infant Formulas)

7   

Value is an assumed zero, because biologically, the nutrient could not be present (such as dietary fiber in animal products), or the nutrient is expected to be present in insignificant amounts (such as vitamin C in meat products).

8   

Value is calculated from the nutrient label by Nutrient Data Lab

9   

Value is calculated by the manufacturer, not adjusted or rounded for compliance to the Nutrition Labeling and Education Act

12   

Value is analytical, supplied by the manufacturer with partial documentation.

Data Derivation Code Description File: (File Name=DERIV_CD) (table 8)

Table 8. Data Derivation Code File Format

Field Name   

Type   

Blank   

Description

Deriv_Cd   

A 4*   

N   

Derivation code

Deriv_Desc

A 120

N

Description of derivation code giving specific information on how the value was determined


Gram Weight File
  (file name = WEIGHT). The format for the Gram Weight File (table 9) has been changed from previous releases. The measure description is now a field in this file eliminating the need for a Measure File.

Table 9. Gram Weight File Format

Field name   

Type   

Blank   

Description

NDB_No   

A 5*   

N   

5-digit Nutrient Data Bank number

Seq N 2 N Sequence number
Amount N 5.3 N Unit modifier (e.g. 1 in 1 cup)

Msre_Desc   

A 80   

N   

Description (e.g. Cup, diced, 1" pieces)

Gm_wt   

N 9.2   

N   

Gram weight

Num_Data_Pts N 3 Y Number of data points
Std_Dev N 7.3 Y Standard deviation

Footnote File
(file name = FOOTNOTE). This file (table 10) contains additional information about the food item, household weight, or nutrient value.

Table 10. Footnote File Format

Field name   

Type   

Blank   

Description

NDB_No   

A 5*   

N   

5-digit Nutrient Data Bank number

Footnt_no   

A 4*   

N   

Sequence number

Footnt_typ   

A 1   

N   

The type of footnote
D=indicates a footnote adding information to the description
N=indicates a footnote providing additional information on a nutrient value. If the Footnt_typ = N, the Nutr_No will also be filled in.

Nutr_No   

A 3   

Y   

3-digit unique identifier code for a nutrient to which footnote applies

Footnt_txt   

A 200   

N   

Footnote text


Reference File
( File Name=REFERNC) This file (Table 11) provides a citation to the Ref_ID in the Nutrient Data File.

Table 11. Reference File Format

Field name   

Type   

Blank   

Description

Ref_ID

A 6*   

N   

A unique number identifying the reference/source

Authors 

A 255

N

List of authors for a journal article or name of sponsoring organization for other documents

Title   

A 255

N   

Title of article or name document, i.e. a report from a company or trade association

Year 

A 4 N Year document or report was published

Journal   

A 135  

The name of the journal in which the article was published

Vol_City A 10 Y Volume number for journal articles or books; city where sponsoring organization is located
Issue_State A 5 Y Issue number for journal article; state where the sponsoring organization is located
Start_Page A 5 Y Starting page number in document
End_Page A 5 Y Ending page number in document

Flat Files

Abbreviated File (file name = ABBREV). The abbreviated file is available in ASCII formats. The ASCII file (table 12) is in free format. Fields are separated by a caret (^). Text fields are surrounded by a tilde (~). Data refer to 100 g of the edible portion of the food item. Decimal points are included in the fields. Missing values are denoted by the null value of two consecutive carets (^^). The file is sorted in ascending order by the NDB number.

This file is a flat file and is provided for those users who do not need a relational database. It contains the information in one record per food item and is suitable for importing into a spreadsheet. It contains less descriptive information and fewer nutrients and weights than the larger relational files. If additional information is needed, this file can be linked to the other files by the NDB_No.

Table 12. Abbreviated File Format

Field name   

Type   

Description

NDB No.   

A 5*   

5-digit Nutrient Data Bank number

Shrt_Desc   

A 60   

60-character abbreviated description of food item † 

Water   

N 10.3   

Water (g/100 g)

Energ_Kcal   

N 10.3   

Food energy (kcal/100 g)

Protein   

N 10.3   

Protein (g/100 g)

Tot_Lipid   

N 10.3   

Total lipid (fat; g/100 g)

Carbohydrt   

N 10.3   

Carbohydrate, by difference (g/100 g)

Fiber_TD   

N 10.3   

Total dietary fiber (g/100 g)

Ash   

N 10.3   

Ash (g/100 g)

Calcium   

N 10.3   

Calcium (mg/100 g)

Phosphorus   

N 10.3   

Phosphorus (mg/100 g)

Iron   

N 10.3   

Iron (mg/100 g)

Sodium   

N 10.3   

Sodium (mg/100 g)

Potassium   

N 10.3   

Potassium (mg/100 g)

Magnesium   

N 10.3   

Magnesium (mg/100 g)

Zinc   

N 10.3   

Zinc (mg/100 g)

Copper   

N 10.3   

Copper (mg/100 g)

Manganese   

N 10.3   

Manganese (mg/100 g)

Selenium   

N 10.3   

Selenium (mcg/100 g)

Vit_A   

N 10.3   

Vitamin A (IU/100 g)

Vit_E   

N 10.3   

Vitamin E (mg alpha-tocopherol equivalents)

Thiamin   

N 10.3   

Thiamin (mg/100 g)

Riboflavin   

N 10.3   

Riboflavin (mg/100 g)

Niacin   

N 10.3   

Niacin (mg/100 g)

Panto_acid   

N 10.3   

Pantothenic acid (mg/100 g)

Vit_B6   

N 10.3   

Vitamin B6 (mg/100 g)

Folate   

N 10.3   

Folate (mcg/100 g)

Vit_B12   

N 10.3   

Vitamin B12 (mcg per 100 g)

Vit_C   

N 10.3   

Vitamin C (mg/100 g)

FA_Sat   

N 10.3   

Saturated fatty acid (g/100 g)

FA_Mono   

N 10.3   

Monounsaturated fatty acids (g/100 g)

FA_Poly   

N 10.3   

Polyunsaturated fatty acids (g/100 g)

Cholestrl   

N 10.3   

Cholesterol (mg/100 g)

GmWt_1   

N 9.2   

First household weight for this item