1. Introduction With the spread of economic activity around the globe, accurate data on trade between countries is of increased interest to researchers in the private and public sectors. This was recognized in the United States by the Omnibus Trade and Competitiveness Act of 1988, which mandated that increased attention be devoted to the collection and dissemination of international statistics. Resulting from the legislation was the National Trade Data Bank (NTDB) program, which distributes the data in a CD-ROM format. While the NTDB includes detailed information on U.S. imports and exports, the earliest year provided is 1989. The only source for earlier years of import and export data are magnetic tapes and printed media, which are often difficult to work with for research purposes. The Omnibus Trade and Competitiveness Act of 1988 also mandated that the United States adopt the Harmonized System (HS) for the collection of import and export data by the Bureau of the Census. The HS is an international classification scheme for commodities. In earlier years, U.S. imports were classified according to the Tariff Schedule of the United States Annotated (TSUSA), while U.S. exports were collected under the "Schedule B" classification system. The fact that no information from either of these classification schemes is included in the NTDB means that researchers cannot easily make the link between earlier and later years. Furthermore, the TSUSA and Schedule B classifications each changed frequently, so that it is difficult to track the same commodity over successive years. These problems have meant that the TSUSA imports and Schedule B exports, which are a potentially very rich and detailed data source, have not been utilized in academic research to the extent that is possible. The objective of this database is to provide complete information on the HS and TSUSA imports as collected by the Bureau of the Census for 1972-1994, on a CD-ROM. It is anticipated that a second CD-ROM dealing with U.S. exports over 1972-1994 will be available at a future date. The import data are merged with various concordances so that in addition to the commodity number and descriptive information such as the commodity name, the Standard Industrial Classification (SIC) number and Standard International Trade Classification (SITC) numbers are provided. This allows that data to be compared across years, and across different classification schemes. In addition, summations of the imports according to the SIC and SITC systems are included on the CD-ROM. 2. Related Research There are several important areas of economic research where the Census trade data have been used. First, the recent concern in the United States that the consumer price index may overstate the extent of price inflation also applies to the import price index, so that it is important to have several sources of price information. The TSUSA and HS import data collected by the Census at the border includes both the value and the quantity of imports, so that the unit-value (=value/quantity) can be used as an estimate of the price. For many years, this was the only source of price trends for goods involved in U.S. trade. More recently, this source has come to be replaced by prices that are obtained through interviews of importing firms, performed by the Division of International Prices, Bureau of Labor Statistics (BLS). The BLS and Census programs are described and contrasted in Alterman (1991) and Walter (1991). While the prices obtained through interviews of firms are often thought to be preferable to the unit-values, these data series are actually complementary. The unit-values can be used to obtain estimates of the "quality-upgrading" that occurs in imports subject to quota restrictions, such as footwear (Aw and Roberts, 1986) and steel (Boorstein and Feenstra, 1991), or to compare the relative quality of U.S. imports from different countries, such as Korea and Taiwan (Feenstra, Hamilton and Yang, 1993; Rodrik, 1994). A number of authors including Knetter (1989) have used the Census data to study the impact of changing exchange rates on the prices of traded goods. Shiells (1991) compares the performance of the BLS import price indexes with the Census unit-values in the context of import demand. Feenstra (1994) used the Census data to construct an alternative estimate of the price indexes for selected imports, which were meant to correct for potential overstatement of price inflation. Feenstra and Shiells (1994) have applied this same idea to the entire range of U.S. merchandise imports, and conclude that the official index of import prices published by the U.S. Department of Commerce is upward biased by as much as one and one-half percentage points annually; this is the same magnitude of upward bias that is believed to apply to the consumer price index. Second, in cases of alleged "dumping" in the United States, whereby imports are sold at prices that are below their foreign prices or costs of production, the commodities in question are identified by their TSUSA or HS number. This means that the TSUSA or HS import data is ideally suited to track the changes in import value, quantity, or unit-value around the time of the alleged dumping and its investigation. This type of research has been undertaken by Harrison (1991), Prusa (1994, 1995), and Staiger and Wolak (1994). If a foreign firm is found to be guilty of dumping, then duties are applied by the United States and consumers face higher prices. However, even when duties are not applied, these researchers have found that the investigation itself and threat of possible duties leads to an increase in import prices. Third, during the decade of the 1980's there have been profound changes in the structure of wages in the United States, with blue-collar workers experiencing a decline in their wages both in real terms, and relative to the wages of white-collar workers. Many researchers are investigating the sources of this decline in blue-collar wages, with the leading explanations being the widespread use of computers in the workplace, and the increased import competition from developing countries. However, these research efforts are hampered by a lack of U.S. import and export data concorded to a domestic industry basis. The only source for this information has been Abowd (1991), who concorded the U.S. import and export data to a 4-digit domestic SIC basis; this data formed the basis for many of the studies in Abowd and Freeman (1991). Unfortunately, the last year of the trade data from Abowd is 1985, and this source does not provide imports by the source country. Included in the present database are the import values on a domestic SIC for 1972-1994, distinguished by the source country. As described in the section 5, the methodology established by Abowd was extended forward to 1994, but was also modified to obtain more accurate import figures for all industries back to 1972. 3. Methodology The disaggregate import data by TSUSA or HS classification was initially obtained on magnetic tape for 1972-1989, or CD-ROM for 1990-1994 (sources are listed in section 6). These sources contained logical record files, where each record had the TSUSA or HS number, and quantitative information on: the value and quantity or imports; the source country (designated by a Census number); the country sub-code (such as Generalized System of Preferences item, Caribbean Basin Initiative item, tariff code 806 or 807 item, etc.); the port of entry into the United States; and other information such as the rate of duty and method of shipment. Starting with these files, four changes were made: (i) The value and quantity data for each TSUSA or HS commodity was summed over all ports of entry (and also over the rates of duty and method of shipment, if these were included). This means that for each imported item, source country, and country sub-code, the total value and quantity of that item coming into the United States was measured; (ii) The quantitative information on the value and quantity of imports was merged with descriptive information such as the TSUSA or HS description, and the import-based SIC or SITC classification code. This descriptive information was stored on each record, so that imported items can be identified by their alphabetic description, and import-based SIC and SITC classifications, rather than only the TSUSA or HS number; (iii) Instead of the classification scheme for source countries used by Census, we instead introduced the United Nations (UN) system for country names and numbers. (This is the six digit Standard Classification of Customs Areas and Territories,and is the same as that used by Statistics Canada in their World Trade Database. See Appendix A.) In some cases, several source countries as identified by the Census classification were merged into a single country as defined by the UN. (iv) The 1977 import data were extensively corrected. The motivation for the first of these adjustments to the original data is that the value of imports by their port of entry is not that interesting a variable for economic analysis: the initial port of entry may have little relation to the location where this commodity is sold or consumed. By eliminating the disaggregation of the data according to port of entry, the size of the data files was reduced quite substantially. The second adjustment to the data is necessary because the TSUSA and HS numbers change frequently over time, so these codes alone cannot be used to identify a commodity over time. In fact, it even happens that the same TSUSA number is used for completely different commodities in two separate years! By including the alphabetic description of commodities, along with their import-based SIC and SITC classifications, it is much easier to track commodities over time. This descriptive information on each commodity was obtained from cumulative concordances, which were initially available for 1978-1988 and 1989-1994. The former concordance was extended back in time to 1972, using the information on TSUSA numbers and alphabetic descriptions contained in U.S. Bureau of the Census [1], for 1972-1977. This time-consuming task was the only way that the quantitative information for 1972-1977 could be linked to the commodity names, and to the import-based SIC numbers, which were obtained from U.S. Bureau of the Census [3]. The third adjustment made to the data was needed because the classification numbers for source countries used by the Census are unfamiliar, and also change over time. Typically, this classification scheme identifies countries in even more detail than the UN classification. For example, in 1988 the Census used three different numbers for France (distinguishing Andorra, Monaco, and the rest of France), three numbers for Italy (distinguishing San Marino, Vatican City, and the rest of Italy), four numbers for New Zealand (distinguishing various surrounding islands), etc. For each year, the Census numbers used for any given UN country were changed to that UN number and name, and the value and quantity of imports were summed within each UN country (and country sub-code). The complete list of UN numbers, along with country names and the corresponding Census numbers (for 1994) are given in Appendix A. The fourth adjustment affected the data for 1977 only, where it became apparent that a significant portion of the import data was either missing from the magnetic tape, or had imports values and quantities that were twice as large as the data listed in the bound volumes Bureau of the Census [1] for 1977. Taking the bound volumes as the accurate source, the data obtained from the magnetic tape was extensively corrected, as follows: (a) For all TSUSA numbers in the range 7000500--7066020 inclusive, the import data was missing from the tape, and for a few TSUSA numbers in the range 6571000--6966000 the import data was missing or had very low values. Correct data was added to the database using the information from the bound volume; (b) For all TSUSA numbers in the range 3823322--3906000; those beginning with 4 or 5; and those in the range 6571000--6966000 (except for the missing data noted above), the import value and quantity on the tape were twice as large as that reported in the bound volume. The tape data was corrected by dividing by two. 4. Comparison with Other Data Sources To check the integrity of the data after making these adjustments, the total value of imports over all source country and all commodities can be compared with the totals from other sources. Included in the database are both the customs value of imports, and the CIF (cost including freight) value of imports. The customs value does not include freight, and is the value on which duties are assessed. It is intended to serve as an arm's-length transactions value for the commodity, and is similar to the FAS (free alongside ship) value, which reflects the value at the foreign port of exportation. Both the customs and CIF values will be used for comparison. In the first three column of Table 1 we focus on the customs value for all merchandise imports. The first column reports the value of imports in our database, followed by the value of imports from the U.S. Bureau of the Census [1],[2], which are the sources that are in principal identical to the original tapes/CD-ROM from which our data were obtained. We therefore expect the first two columns to be very close in magnitude, and this is confirmed for most years. The largest difference in the early years is $333 million in 1977, which was the year in which the magnetic tape contained faulty data; even this difference is only about two-tenths of one percent of the import value. In the later years there is a surprising difference of $1 billion in the 1991 values (again, two-tenths of one percent), but otherwise the magnitude of imports in the database and the Census volumes are very close. In years where there is some difference, the total value of imports in our database is usually less than that in the Census volumes. This is no coincidence, and reflects one type of error in our database that we are aware of: in rare cases, the record(s) indicating the imports of a particular commodity from a particular source country are missing. For example, the TSUSA commodity 6837000, flashlights and parts, is primarily imported from Asian countries, but includes some imports from European nations. In 1982, however, the imports from Europe are missing from the database: the value of the missing imports is $1.0 million. (We thank Frank Wolak of Stanford University for identifying this source of error in the database.) Missing records of this type may reflect errors in the magnetic tape/CD-ROM that we started with, or could have appeared during the processing of the data. In any case, the very close magnitudes of total imports in our database and Census [1],[2] assures us that the number and importance of missing records is small. For comparison in the third column, we include the customs value of all merchandise imports taken from Economic Report of the President (1995, Table B-108). These values are higher than those in columns one or two, because they refer to "general imports" rather than "imports for consumption." The former category includes imports that enter into bonded warehouses or foreign trade zones within the U.S., whereas "imports for consumption" reflect the value of imports for immediate consumption (including withdrawals from bonded warehouses and foreign trade zones). These values are used in the database, and are generally smaller than "general imports." In addition to this difference between columns one or two and column three, there is another source of difference in 1984 and 1985. In these two year, the Economic Report of the President (1995, Table B-108) notes that the import values have been revised. We can judge that this revision dealt with the exact timing of imports during the year: while the import value for 1984 in column three exceeds that in columns one and two, and conversely for 1985, the sum of imports over these two years is quite close in all the columns. It is apparent that the import values in our database do not incorporate this particular revision. Table 1: Comparison of Value of Imports ($ million) see printed codebook for this table In the final two columns of Table 1, we report the import value of manufactured imports only, identified as those commodities with a Standard Industrial Classification (SIC) number beginning with 2 or 3. In the fourth column we report the value of manufactured imports in our database, and in the last column we report the value from the widely-used database of Abowd (1991). Surprisingly, the values from Abowd are significantly higher than those from our database, with a difference ranging from nearly $7 billion in 1978 (or 5% of imports) to $46 billion in 1985 (or about 15% of the import value). While these differences are large, it turns out that they can be traced to a very small number of SIC industries at the 4-digit level. In particular, these are industries in which it is especially difficult to measure or impute the value of imports, for the following reasons. The import data from Census [1],[2] is not collected or concorded to a domestic SIC basis. Rather, this data is concorded to a so-called import-based SIC. This distinction is necessary because a number of industries in the domestic SIC are identified by their method of processing, and this is unknown for imports entering the country. For example, SIC industry 2011, meat packing plants, and industry 2013, sausages and other prepared meats, produce many of the same products. (This example is drawn from Sachs and Shatz (1994, unpublished Appendix), who were the first to note that the total import values from Abowd (1991) exceeded that from published sources. They did not provide an explanation for this difference, however. All SIC numbers used in this section are based on the 1972 version.) What makes them different is that industry 2011 slaughters while industry 2013 purchased carcasses. There is no way to determine the source of materials for imported meat products, so most of the imports are grouped into the import-based SIC code (or MSIC) 2011, with only a very small amount of imports in MSIC 2013. The magnitude of imports in each of these categories for 1978 is shown in Table 2. Table 2: Comparison of MSIC and SIC Imports, 1978 ($ million) see printed codebook for this table Listed in the first and second columns of Table 2 are the CIF value of imports from our database, along with the correspondingly import-based SIC (or MSIC) number. These values can be directly compared to Census [4], U.S. Commodity Exports and Imports as Related to Output. In that publication, the value of imports is compared to U.S. industry supply, so that the import shares defined as (import value)/(domestic supply+import value) can be constructed. To achieve this, the SIC data on domestic output is concorded to the MSIC data on imports. For example, in the third column of Table 2 are listed the SIC numbers that correspond to the MSIC codes given in the first column: as already noted, most of the prepared meats are grouped into MSIC 2011, which incorporates SIC 2011+2013. The exception to this is SIC 2013A, natural sausage casings, which are given the MSIC number 2103 upon importation. It is apparent that there is a very close correspondence between the import values in columns two and four. In column five, the import shares calculated over the corresponding MSIC and SIC groups are given. In the last columns of Table 2, we show the 1978 import values for from Abowd (1991), which are listed according to domestic-based SIC numbers. Because imports are not collected separately by the SIC 2011 and 2013 categories, Abowd has imputed the imports to each in such a way that the import shares in the two SIC categories are equal, and the total value of imports approximately matches that in the earlier columns. (There is some difference between the total value of imports because Abowd (1991) used import shares from a source that was similar, but not identical, to Census [4]. Using these import shares along with the value of domestic production, the import values were then computed.) We will use precisely the same kind of imputation in our database when there is not a one-to-one correspondence between the MSIC and SIC categories. In some cases, however, the imputation becomes more difficult, and it is these cases that account for the overstatement in the total import value from Abowd. The case with the greatest overstatement of total imports is shown next in Table 2. The SIC industry 333 deal with the primary refining and smelting of copper (3331), lead (3332), zinc (3333), aluminum (3334) and materials not elsewhere classified (3339), whereas the SIC industry 3341 deals with the secondary smelting and refining of these metals. Processing through primary or secondary smelting cannot be distinguished when these products are imported, and instead, the import data is simply collected by the type of metal, as listed in the first two columns of Table 2. The SIC numbers in the third column show clearly that the imports in MSIC 3331, for example, are drawn from both primary and secondary smelting of copper (SIC 3331+33412), imports in MSIC 3332 include both primary and secondary smelting of zinc (SIC 3332+33413), and similarly for the other metals. Note that the total value of imports from the database and Census [4] are in close agreement. Turning to the final columns, Abowd (1991) imputed imports into each of the primary smelting SIC categories 3331,2,3,4,9, using import shares that are quite close to those from Census [4]. The problem arises, however, in imputing an import value for secondary smelting (SIC 3341), where is no separate import data. Faced with the range of import shares for each type of metal in the final column of Table 2, Abowd evidently chose the maximum value of these shares (60.8%), and applied this to the value of domestic production, to impute imports for smelted metal of $5,878.7 million. When summing this together with the other imputed imports, the total value of imported nonferrous metals from Abowd is $9.1 billion, which exceeds the value of imported nonferrous metals in our database by $3.6 billion (and exceeds the total from Census [4] by slightly more). In other words, this single industry accounts for fully one-half of the overstatement in the total value of manufacturing imports for 1978. With 450 SIC industries at the 4-digit level, and about 435 having some imports, the concordance developed by Abowd (1991) identifies six cases similar to nonferrous metals, where one SIC industry corresponds to multiple MSIC categories, which themselves may include other SIC industries. In each of these cases, it appears that the maximum value of the possible import shares was chosen in order to compute imports for the SIC industry in question. But in addition, there are some industries not specially identified in the concordance developed by Abowd in which import shares that are too high were used. An example is the SIC industry 3915, jewelers' findings and materials and lapidary work. Census [4] lists an import share of 78% for the 5-digit category SIC 39152 (lapidary work including diamond cutting and polishing), and a much lower import share of 18% for the combined SIC category 3911 (jewelry, precious metals) plus 39151 (jeweler's findings and materials). In this case, Abowd applied the higher import share of 75.2% to the entire SIC 3915 category, which resulted in an overstatement of imports by $1.0 billion in 1978. For that year, we have also found that Abowd's import values exceed those in our database or Census [4] by more than $500 million for the following industries: rubber and plastic footwear (SIC 3021); radio and television receiving sets (SIC 3651); office, computing, and accounting machines (SIC 357); and engineering, laboratory, scientific, and research instruments (SIC 3811). While there are also some other industries in which imports from Abowd (1991) exceed those in our database or Census [4] by less than $500 million, or where Abowd's imports are lower in value, the industries that we have identified account for the bulk of the overstatement in the total import value in 1978. These industries are certainly small in number, and even in cases where the value of imports is overstated, the import shares are still within the range that occur in other related industries. This means that studies that have used Abowd's data in share form (such as some chapters in Abowd and Freeman, 1991), may not be affected to any significant extent by the overstatement. But it is potentially more serious for studies that have relied on the absolute magnitude of imports, such as those that estimate the factor-content of imports and its impact on the labor market. 5. Calculation of Domestic-based SIC Imports Included in the database are the (customs and CIF) value of imports by source country for all manufacturing industries at the 4- digit SIC level, using the 1972 version of the SIC. These values were computed by first summing the import data according to the 4-digit MSIC values. Then a concordance between the MSIC and SIC values was applied to convert the import data to the SIC basis. This concordance was itself developed by starting with the concordance of Abowd (1991), and making adjustments to it based on two sources of information: Census [4], and a remarkable Appendix in Census [5] entitled "Principal Differences Between the SIC-Based Output and Import Product Codes." As suggested by the title, it lists those cases in which the SIC and MSIC numbers differ for the same underlying commodity. This information is actually provided at the 5-digit SIC and MSIC level, though for the purpose of the concordance we worked at the 4-digit level. Then there are three types of matches that can occur between the SIC and MSIC numbers: (1) an "exact match" between one MSIC and one SIC; (2) one MSIC category corresponds to several SIC industries; (3) several MSIC categories correspond to several SIC industries. In the first case, the "exact match" between a MSIC and SIC means that either these two categories really were identical, or that any differences were judged as too small to be important. This occurred for 234 cases out of the 435 4-digit SIC industries with positive imports. In the second case, the MSIC imports are allocated to the various SIC industries in proportion to their domestic output, which means that the import share in each of the SIC industries would be identical. This occurred for another 84 4-digit SIC industries. In the third case, we impute imports to various SIC industries according to the following rules: (a) the total value of imports across the MSIC and SIC categories is identical; (b) imports are allocated across the SIC industries in proportion to their domestic output. An exception is the smelting and refining of nonferrous metals, detailed in Table 2. In that case, the allocation of total imports across primary and secondary smelting is done according to rules (a) and (b). But given this allocation, the division of primary smelting imports across the various types of metals (SIC 3331,2,3,4,9) can be done in proportion to the imports in the like-numbered MSIC categories. This methodology can be used to obtain imports by source country for the 4-digit SIC industries for 1972-1988, since our import data for those years include the MSIC numbers on a 1972 basis. However, for 1989 and later years, the import data includes only the MSIC numbers on a 1987 basis, which are the years in which the SIC codes were substantially revised. In order to convert the import totals for 1989 and later from the 1987 MSIC to the 1972 MSIC basis, we proceeded as follows. For the single year 1988, each TSUSA commodity in the import data includes a listing for both the 1972 MSIC and 1987 MSIC numbers. By aggregating across these, we obtain a "weighting matrix" for each source country between the 4- digit 1972 MSIC numbers and 4-digit 1987 MSIC numbers. For later years, we first sum the data according the 1987 MSIC numbers, then multiply by the weighting matrix for each source country, and then apply our concordance between the 1972 MSIC and 1972 SIC numbers. This gives us the value of imports for the year in question organized by the 1972 SIC industries, and distinguishing the various source countries for each industry. 6. Technical Information All of the data, concordances, and documentation in the import database are stored as xBase files. The data and concordances have the extension .DBF. For each datafile (or group of files) and each concordance, there is documentation with the identical name but the extension .TXT; this documentation is the same as that found on the following pages. This disaggregate import data includes both values and quantities, so that unit-values(=value/quantity) can be constructed. In addition, there are import values for the SIC and SITC aggregations. The value of import duties as estimated by the U.S. Customs are included according to TSUSA, HS, SIC and SITC categories, so that ad valorem tariff rates can be calculated by dividing the import duties and values. The complete list of data files included on the CD-ROM is displayed in Table 3. Table 3: Files on the CD-ROM File Description TSUSA import data, 1972-1988: IMP72.DBF,....,IMP88.DBF Import data by TSUSA commodity for the years 1972-1988 Harmonized System import data, 1989-1994: IMP89.DBF,....,IMP94.DBF Import data by HS commodity for the years 1989-1994 SIC (1972 basis, 4-digit) import data, 1972-1992: SIC72.DBF,....,SIC92.DBF SIC imports by source country for the years 1972-1992 SITC (Rev. 2 and 3, 5-digit) import data, 1972-1994: SITC72.DBF,....,SITC94.DBF SITC imports by source country for the years 1972-1994 Concordances: CON7288.DBF Concordance for 1972-1988 TSUSA data CON8994.DBF Concordance for 1989-1994 HS data HS2TSUSA.DBF, Concordances between HS and TSUSA TSUSA2HT.DBF SITC23.DBF, SITC32.DBF Concordance between SITC Revisions 2 and 3 Other: COUNTRY.DBF List of country names and codes SICNAME.DBF, SITCREV*.DBF Industry/product names for SIC and SITC (* = 2-3) UNIT7288.DBF, UNIT8994.DBF Units of quantity used in IMPYR.DBF HSDESC.DBF, TSUSA.DBF Classification hierarchies for HTS and TSUSA USIMPORT.TXT Full codebook The xBase CD-ROM includes software that supports browsing, viewing, and exporting the datafiles. This software is written in dBase 5 for DOS; it should run under DOS, Windows 3 or Windows 95. Software is not available for other platforms. The software requires at least an 80386 processor. There are two versions of the software, a standalone version and a compact version; both versions are functionally identical. While performance should be better with the standalone version, the standalone version requires more memory. To install the software, run the program INSTALL.EXE from the CD-ROM root directory. The installation program prompts for the drive destination letter of the CD-ROM drive and the target hard drive, the sub-directory in which to install the software, and the preferred implementation, standalone or compact. The program installs the standalone version by default; only install the compact version after determining that the standalone version will not run on the target system. The installation program will copy and configure the software for the target system. To run the IMPORTS.EXE program, type IMP7294 from the target installation directory. The following technical information is necessary for installations that do not use the provided installation program. The main executable is IMPORTS.EXE. This program requires three additional binary files: FDISK.BIN, GETDRIVE.BIN, and HDISK.BIN. In addition, the compact version requires the runtime libraries: DBASE32.RTL, DBASE1.RES, DBASE2.RES, and DBASE3.RES. The DOS environment variable WORKDIR points to the CD-ROM drive. This environment variable is required by the program. The DOS environment variable BINPATH points to the binary files. The BINPATH variable must terminate with a "\". This environment variable is optional. The IMPORTS.EXE returns one of three error codes to DOS if it fails to start. DOS interprets the errorlevel values from high to low. Look for errorlevel 3, before looking for errorlevel 2, etc. ERRORLEVEL 3 indicates that the program could not find the binary files FDISK.BIN, GETDRIVE.BIN, or HDISK.BIN. If the BINPATH environment variable is set, the binary files must be in the specified directory. If the BINPATH variable is not set, the binary files must be in the open directory. ERRORLEVEL 2 indicates that the program could not find the datafiles. This could be that the WORKDIR environment variable is incorrectly set or that the CDROM is not correctly loaded. ERRORLEVEL 1 indicates that the WORKDIR environment variable is not set. For example, the following batchfile sets the location of the CD-ROM drive to D:, sets the location of the binary files to C:\IMPORTS, and runs the program IMPORTS.EXE located in that directory. SET WORKDIR=D: SET BINPATH=C:\IMPORTS\ C:\IMPORTS\IMPORTS IF ERRORLEVEL 3 GOTO ERR3 IF ERRORLEVEL 2 GOTO ERR2 IF ERRORLEVEL 1 GOTO ERR1 GOTO END :ERR3 ECHO Cannot find binary files, check DOS environment variable BINPATH GOTO END :ERR2 ECHO Cannot find data files, check CDROM drive GOTO END :ERR1 ECHO DOS environment variable WORKDIR is not set END SET WORKDIR= SET BINPATH= 6A. TSUSA Import Data, 1972-1988 (IMPYR.DBF, YR = 72-88) For years up until 1988, imports to the United States at a disaggregate level were measured according to the Tariff Schedule of the United States Annotated (TSUSA) classification. These datafiles contain the U.S. import data according to TSUSA number, distinguished by source country, and including both quantitative information about imports and descriptive information about each commodity. The files IMPYR.DBF contain U.S. import data for 1972-88, sorted by TSUSA number, with YR = {a two-digit number in the range 72-88}. Record Layout: Database information for IMPYR.DBF (YR = 72-88) is as follows: Name Type Length Decimal Description TSUSA C 7 0 Tariff Schedule of the USA (TSUSA) no. (7-digit) CNTRYCODE N 6 0 Source country code (see Appendix A) CNTRYNAME C 8 0 Country name CNTRYSUBC C 1 0 Country Sub-Code 0 = Country of Origin 1 = Country of Shipment 2 = 807 Commodity 3 = 806 Commodity 4 = Generalized System of Preferences (GSP) Item 5 = Folklore Merchandise 9 = Caribbean Basin Initiative (CBI) Item I = Israel Free Trade Agreement T = 807 Commodity in CBI Country SITC2 C 5 0 Rev. 2 SITC no. (5-digit) SIC72 C 8 0 1972 import-based SIC no. (8-digit) SIC87 C 8 0 1987 import-based SIC no. (8-digit) UNIT C 3 0 Units for quantity (see Appendix B) QUANTITY N 12 0 Quantity CUSTVAL N 12 0 Customs value (dollars) CIFVAL N 12 0 CIF value (dollars) DUTIES N 12 0 Duties paid (dollars) DESC C 64 0 Commodity description YEAR N 2 0 Year C= Character, N=Numeric. Customs value does not include freight and is the value on which duties are assessed. It is intended as an arms-length transactions value for the commodity. It is similar to the FAS (free alongside ship) value, which was collected by the Census prior to 1982. CIF (cost including freight) includes freight but not tariff duties. Missing and Imputed Values: (i) Missing values for any alphabetic variable are indicated by a blank field, as occurs especially for the Units of quantity, indicating that either the units could not be measured, or were simply missing. When the units could not be measured, there will be a zero value for Quantity, but positive entries for Customs value or CIF value. In other cases, a zero value for Quantity and also the Customs and CIF values indicates that the good was not imported from that country in that year. (ii) For 1972 and 1973, the CIF value and Duties paid are missing, as indicated by a dot. Duties paid are also missing for a very small number of TSUSA commodities in other years. (iii) In some years, the Units of quantity have been imputed based on the units used for the same commodity in other years. (iv) For 1972-1977, the Rev. 2 SITC codes have been imputed using information on: (a) the Rev. 2 SITC codes used for a like-named TSUSA commodity in later years; or (b) the Rev. 1 SITC code used for that commodity, and a concordance between Rev. 1 SITC and Rev. 2 SITC. In a very small number of cases, the Rev. 2 SITC number is missing, as indicated by a blank field. (v) The 1987 SIC codes contain a number of cases where "Z" is used to indicate that the classification is not known. For example, "23ZZZZZZ" indicates that the commodity belongs to the 2-digit SIC category 23, but more precise information is not available. The 1987 SITC numbers are missing for those commodities with TSUSA numbers that were used only before 1978, as indicated by a blank field. Special Considerations: The 1972 and 1987 import-based SIC numbers are not the same as the SIC numbers used to identify U.S. industries. This is because industries in the United States are sometimes defined in terms of the processing that occurs in them, whereas the method of processing is not known for imports. As a result, a condensed set of SIC numbers called "import-based SIC" are used, and these are included in the IMPYR.DBF files. Related Files: (i) A complete list of the commodities, including the TSUSA number for each, 1972 and 1987 import-based SIC numbers, Rev. 2 SITC number (5-digit), and the first and last years that the TSUSA number is used for that commodity, is contained in the concordance CON7288.DBF, which is described in CON7288.TXT. (ii) The source country for each imported commodity is identified by the name and United Nations (UN) code. The complete list of names and UN codes, along with a correspondence to the country codes used by the U.S. Census, is provided in Appendix A. (iii) There is a cross-reference between the Tariff Schedule of the United States Annotated (TSUSA) codes that are used in the 1972- 1988 import files, and the Harmonized Tariff System (HTS) numbers that are used in the 1989-1994 import files, and. This cross-reference is contained in HTS2TSUS.DBF and TSUSA2HT.DBF described in HS_TSUSA.TXT. Sources: The data for 1972-1988 were obtained on magnetic tapes from: National Archives and Record Administration, Annual Import Databank, IA245, Record group 29, Washington, D.C. [magnetic tapes]. The same data are reported in printed form in: U.S. Imports for Consumption and General Imports, TSUSA Commodity by Country of Origin, FT246, Bureau of the Census, Dept. of Commerce, Washington, D.C., 1972-1988. Additional information on the variables listed above can be obtained from: Guide to Foreign Trade Statistics, Bureau of the Census, Department of Commerce, Washington, D.C., 1983. 6B. Harmonized System Import Data, 1989-1994 (IMPYR.DBF, YR = 89-94) In 1989 and later years, the Harmonized System of commodity classification has been used to measure disaggregate U.S. imports and exports. The particular application of the Harmonized System to U.S. imports is called the Harmonized Tariff Schedule (HTS). These datafiles contain the U.S. import data according to HTS number, distinguished by source country, and including both quantitative information about imports and descriptive information about each commodity. The files IMPYR.DBF (YR = 89-94) contain U.S. import data for 1989-94, sorted by HTS numbers, with YR = {a two-digit number in the range 89-94}. Record Layout: Database information for IMPYR.DBF (YR = 89-94) is as follows: Name Type Length Decimal Description HTS C 10 0 Harmonized Tariff System (HTS) no. (10-digit) CNTRYCODE N 6 0 Source country code (see Appendix A) CNTRYNAME C 8 0 Country name CNTRYSUBC C 1 0 Country Sub-Code 0 = Country of Origin 1 = Country of Shipment 4 = Generalized System of Preferences (GSP) Item 9 = Caribbean Basin Initiative (CBI) Item B = APTA Automotive Products Trade Act C = Agreement on Civil Aircraft I = Israel Free Trade Agreement J = Andean Trade Preference Act X = Canada Free Trade Agreement (suspended 1/1/94) W = Puerto-Rico product improved in a CBI country and returned to the United States Y = North American Free Trade Agreement (NAFTA) Z = Compact of Free Association Act with Marshall Islands and Federated States of Micronesia SITC3 C 5 0 Rev. 3 SITC no. (5-digit) SIC87 C 4 0 1987 import-based SIC no. (4-digit) UNIT C 3 0 Units for quantity (see Appendix B) QUANTITY N 12 0 Quantity CUSTVAL N 12 0 Customs value (dollars) CIFVAL N 12 0 CIF value (dollars) DUTIES N 12 0 Duties paid (dollars) DESC C 50 0 Commodity description YEAR N 2 0 Year C= Character, N=Numeric. Customs value does not include freight and is the value on which duties are assessed. It is intended as an arms-length transactions value for the commodity. CIF (cost including freight) includes freight but not tariff duties. Missing and Imputed Values: Missing values for any alphabetic variable are indicated by a blank field, as occurs especially for the Units of quantity, indicating that either the units could not be measured, or were simply missing. When the units could not be measured, there will be a zero value for Quantity, but positive entries for Customs value or CIF value. In other cases, a zero value for Quantity and also for the Customs and CIF values indicates that the commodity was not imported from that country in that year. Related Files: (i) A complete list of the commodities, including the HTS number for each, 1987 import-based SIC numbers (4-digit), Rev. 3 SITC number (5-digit), and the first and last years that the HTS number is used for that commodity, is contained in the concordance CON8994.DBF, which is described in CON8994.TXT. (ii) The source country for each imported commodity is identified by a name and United Nations (UN) code. The complete list of names and UN codes, along with a correspondence to the country codes used by the U.S. Census, is provided in Appendix A. (iii) There is a cross-reference between the Harmonized Tariff System (HTS) numbers that are used in the 1989-1994 import files, and the Tariff Schedule of the United States Annotated (TSUSA) codes that are used in the 1972-1988 import files. This cross-reference is contained in HTS2TSUS.DBF and TSUSA2HT.DBF and described in HS_TSUSA.TXT. Sources: The data for 1989 was obtained from: National Archives and Record Administration, Annual Import Databank, IA245, Record group 29, Washington, D.C. [magnetic tape]. Data for 1990 and later years were obtained from: U.S. Imports of Merchandise on CD-ROM [machine-readable data file] / prepared by the Bureau of the Census. - Washington: The Bureau [producer and distributor], 1990- 1994. The same data in printed form are reported in: U.S. Imports for Consumption, HTSUSA Commodity by Country of Origin, FT247, Bureau of the Census, Washington, D.C., 1989-1994. Additional information on the variables listed above can be obtained from: Guide to Foreign Trade Statistics, Bureau of the Census, Department of Commerce, Washington, D.C., 1991. 6C. SIC (1972 basis, 4-digit) Import Data, 1972-1992 (SICYR.DBF, YR = 72-92) These files contain the U.S. import data according to a 4-digit Standard Industrial Classification (SIC) basis, for 1972-1992, and related information. Later years are not included because converting the data to an SIC basis requires data on domestic shipments by 4-digit SIC industries, which is available only up until 1992. Also, note that this conversion is done only for manufactured goods, which are those having SIC numbers beginning with 2 or 3. Note: File SIC77.DBF: Bahamas/Jamaica (real country code 350440/353880) have "mistaken" country codes 350400/353800 in 1977 ONLY, and for ONLY the commodities with SITC code 79320/83100, respectively. These mistaken country codes occur in the original TSUSA files, the 1977 SITC files, and the 1977 SIC file as noted above. Two additional files were used to create the domestic SIC estimates provided in these datasets. These supplemental files are available as part of the ASCII version of this dataset only. One, SIC_MSIC.ASC provides information on the type of match between SIC and MSIC codes (one-to-one, several-to-one, exact match, etc.). The other, MSIC8772.ASC links 1987 to 1972 MSIC codes and provides the weights used to convert to a 1972-basis. (1) 1972-based SIC Import Data Using an SIC to MSIC concordance, the import data summed according to 4-digit MSIC was converted to a 4-digit SIC basis for 1972-1988. For 1989 and later years, the MSIC was on a 1987-basis, and so it first had to be converted to a 1972-basis,using the weighting information described above. These data are stored in the files SICYR.DBF (YR = 72-92) Record Layout: Name Type Length Decimal Description SIC72 C 4 0 1972 SIC no. (4-digit) CNTRYNAME C 8 0 Country name CNTRYCODE C 6 0 Source Country code (see Appendix A) CIFVAL N 12 0 CIF value (dollars) CUSTVAL N 12 0 Customs value (dollars) YR N 2 0 Year C= Character, N=Numeric. Customs value does not include freight and is the value on which duties are assessed. It is intended as an arms-length transactions value for the commodity. CIF (cost including freight) includes freight but not tariff duties. Missing and Imputed Values: (i) The CIF value of imports is missing for 1972 and 1973 as indicated by a dot. (ii) Included in these files are the CIF and Customs value of imports summed over all source countries, in which case the Country name is listed as WORLD, and the Country code is 100000. 6D. SITC (Rev. 2 and 3, 5-digit) Import Data, 1972-1994 (SITCYR.DBF, YR = 72-94) Using the files IMPYR.DBF for 1972-1994, the import data are summed according to the 5-digit Standard International Trade Classification (SITC) codes. The results are stored in SITCYR.DBF (YR = 72-94). Record Layout: Name Type Length Decimal Description SITC2 C 5 0 Rev. 2 SITC no. (5-digit) SITC3 C 5 0 Rev. 3 SITC no. CNTRYCODE N 6 0 Country code (see Appendix A) CNTRYNAME C 8 0 Source country name CUSTVAL N 12 0 Customs value (dollars) CIFVAL N 12 0 CIF value (dollars) DUTIES N 12 0 Duties paid (dollars) YR N 2 0 Year C= Character, N=Numeric. A 3-digit Rev. 3 SITC is used in the 1972-1988 files, and the 5-digit Rev. 3 SITC no. for 1989-1994. Customs value does not include freight and is the value on which duties are assessed. It is intended as an arms-length transactions value for the commodity. CIF (cost including freight) includes freight but not tariff duties. Special Considerations: (i) For 1972-1977, the Rev. 2 SITC codes have been imputed using information on: (a) the Rev. 2 SITC codes used for a like-named TSUSA commodity in later years; (b) the Rev. 1 SITC code used for that commodity. This imputation may cause some irregular movement in the import values across 5-digit Rev. 2 SITC categories over 1977- 1978. In cases where a 5-digit Rev. 2 SITC code could not be imputed, a 4-digit (or more aggregate) code is used instead, with zero appended as the fifth digit. (ii) For 1972-1988, the data were initially organized on a 5-digit Rev. 2 SITC basis. Then the Rev. 3 SITC codes were imputed using the concordance contained in SITCR32.DBF. Because this imputation is difficult to perform when going from Rev. 2 to 3 (because Rev. 3 is more disaggregate at the 5-digit level), only the 3-digit Rev. 3 codes are used. This imputation may cause some irregular movement in the import values across 3-digit Rev. 3 SITC categories over 1988-1989. In cases where a 3-digit Rev. 3 SITC code could not be imputed, a 2- digit code is used instead, with zero appended as the third digit. Any given 3-digit Rev. 3 SITC number will generally appear multiple times in a year, corresponding to different 5-digit Rev. 2 SITC numbers. Thus, to compute the value of imports by 3-digit Rev. 3 code, it is necessary to sum over all records for each such code. (iii) For 1989-1994, the data are initially organized on a 5-digit Rev. 3 SITC basis. In this case the concordance SITCR32.DBF gives a unique 5-digit Rev. 2 SITC code for each of the Rev. 3 codes. So these Rev. 2 codes were added into the file for each year, and then the records were re-sorted according to the Rev. 2 codes. As is apparent by inspection, each 5-digit Rev. 2 code can appear more than once. Thus, to compute the value of imports by 5-digit Rev. 2 code, it would be necessary to sum over all records for each such code. (iv) Also included in these files are the variables listed above summed over all source countries, in which case the Country name is listed as WORLD, and the Country code is 100000. Related Files: The commodity names corresponding to each SITC code, for each revision of the SITC, are found in SITCREV2.DBF, and SITCREV3.DBF. Concordances between the various revisions of the SITC are found in SITCR23.DBF, and SITCR32.DBF. 6E. Concordance for TSUSA Imports, 1972-1988 (CON7288.DBF) For years up until 1988, imports to the United States at a disaggregate level were measured according to the Tariff Schedule of the United States Annotated (TSUSA) classification. The file CON7288.DBF contains a complete list of the TSUSA numbers for 1972-88, along with various information about that commodity. Record Layout: Name Type Length Decimal Description TSUSA C 7 0 Tariff Schedule of the USA (TSUSA) no. (7-digit) SIC72 C 8 0 1972 SIC no. (8-digit) SIC87 C 8 0 1987 SIC no. (8-digit) SITC2 C 5 0 Rev. 2 SITC no. (5-digit) SITC3 C 3 0 Rev. 3 SITC no. (3-digit) YRFIRST C 2 0 First year TSUSA no. is used YRLAST C 2 0 Last year TSUSA no. is used UNITS C 3 0 Units for quantity (see Appendix B) DESC C 64 0 Commodity description (TSUSA) Missing and Imputed Values: (i) Some of the units of quantity are missing, and are indicated by blanks. (ii) In some years, the Units of quantity have been imputed based on the units used for the same commodity in other years. (iii) Some of the 1972 SIC numbers have been revised from the original source, to obtain a better match with the commodity description. For example, various articles of women's, children's and infant's clothing were classified in 1972 SIC numbers with the first four digits 2321, which refers to "men's and boy's shirts." The 1972 SIC number for these items was changed to 2361ZZZZ, which refers to "girl's, children's and infant's clothing." In addition, the category 3715ZZZZ referring to "truck trailers" was created. (iv) The Rev. 2 SITC codes have been imputed for TSUSA categories used during 1972-1977 by applying information on: (a) the Rev. 2 SITC codes used for a like-named TSUSA commodity in later years; or (b) the Rev. 1 SITC code used for that commodity. In a very small number of cases, the Rev. 2 SITC number is missing, as indicated by a blank. (v) The 1987 SIC numbers contain a number of cases where "Z" is used to indicate that the classification is not known. For example, "23ZZZZZZ" indicates that the commodity belongs to the 2-digit SIC category 23, but more precise information is not available. The 1987 SIC numbers are missing for the commodities with TSUSA numbers that were only used before 1978, as indicated by a blank. Special Considerations: It should be noted that the TSUSA numbers for any commodity change over time, so the only way to keep track of a given commodity is by its alphabetic description. One way to identify all the TSUSA numbers corresponding to a given commodity is to SORT the data by the description. However, the descriptions themselves also change slightly over time due to variations in spelling, etc. This means that the set of TSUSA numbers for a given description will not equal all the TSUSA numbers for that item, since there may be slightly different alphabetic descriptions for essentially the same item. Some attempt has been made to resolve this issue by using consistent alphabetic description for commodities over various years, but cases of slight changes in spelling still remain. Related Files: (i) The Description has been used to identify the TSUSA commodities in the files IMPYR.DBF, for the years YR = 72-88, and does not have any missing values. The SIC and Rev. 2 SITC numbers, and units of quantity in this concordance are identical to those in IMPYR.DBF for 1978-88. (ii) The Rev. 3 SITC numbers used in CON7288.DBF are obtained from the Rev. 2 to Rev. 3 SITC concordance reported in SITCR23.DBF. (iii) For years after 1988, imported commodities are identified by the Harmonized Tariff Schedule (HTS) numbers. A concordance of these numbers is contained in CON8994.DBF, as described in CON8994.TXT. A cross-reference between the TSUSA and HTS numbers is contained in HTS2TSUS.DBF and TSUSA2HT.DBF, as described in HS_TSUSA.TXT. Sources: A concordance for 1978-1988 was obtained from: U.S. International Trade Administration, COMPRO database [machine-readable file]. In order to extend this back to 1972, the TSUSA numbers and commodity descriptions for these years were typed into the concordance from the annual source: U.S. Imports for Consumption and General Imports, TSUSA Commodity by Country of Origin, FT246, Bureau of the Census, Washington, D.C., 1972-1977. The SIC and Rev. 1 SITC numbers corresponding to the 1972- 1977 TSUSA numbers were obtained from: U.S. Foreign Trade Statistics. Classifications and Cross- Classifications, 1974. Washington, D.C.: U.S. Dept. of Commerce, Bureau of the Census, December 1975. A concordance between Rev. 1 and Rev. 2 SITC was obtained from Robert Lipsey, National Bureau of Economic Research, and was used to convert the SITC codes for the 1972-1977 period to a Rev. 2 basis. 6F. Concordance for Harmonized System Imports, 1989-1994 (CON8994.DBF) In 1989 and later years, the Harmonized System of commodity classification has been used to measure disaggregate U.S. imports and exports. The particular application of the Harmonized System to U.S. imports is called the Harmonized Tariff Schedule (HTS). The file CON8994.DBF is a concordance that contains a complete list of the HTS numbers used identify U.S. imports over 1989-1994, along with various information about each of these commodities. Record Layout: Name Type Length Decimal Description HTS C 10 0 Harmonized Tariff System (HTS) no. (10-digit) SIC87 C 4 0 1987 SIC no. (4-digit) SITC2 C 5 0 Rev. 2 SITC no. (5-digit) SITC3 C 5 0 Rev. 3 SITC no. (5-digit) ENDUSE C 5 0 End-Use Classification YRFIRST C 2 0 First year HTS no. is used YRLAST C 2 0 Last year HTS no. is used UNITS C 3 0 Units for quantity (see Appendix B) DESC C 50 0 Commodity description (HTS) C= Character, N=Numeric. Missing Values: Some units of quantity are missing, which are indicated by a blank. Related Files: (i) The HTS description, SIC and SITC numbers, and units of quantity in this concordance are identical to those used in the files IMPYR.DBF (YR = 89-94). (ii) For years before 1989, imported commodities are identified by the Tariff Schedule of the United States Annotated (TSUSA) numbers. A concordance of these numbers is contained in CON7288.DBF, as described in CON7288.TXT. A cross-reference between the TSUSA and HTS numbers is contained in HTS2TSUS.DBF and TSUSA2HT.DBF, as described in HS_TSUSA.TXT. Source: U.S. International Trade Administration, COMPRO database [machine-readable file]. 6G. Harmonized System to TSUSA Concordance (HTS2TSUS.DBF and TSUSA2HT.DBF) In 1989, the United States changed its system for collecting import (and export) data. Prior to that year, the import data were collected according to the Tariff Schedule of the United States Annotated (TSUSA), whereas beginning in 1989 the data were collected according to the Harmonized System of commodity classification. The application of the Harmonized System to U.S. imports is called the Harmonized Tariff Schedule (HTS). The files HTS2TSUS.DBF and TSUSA2HT.DBF contain cross-references between the 7-digit TSUSA codes for 1987, and the 8-digit HTS codes proposed at that time. Since the actual conversion from the TSUSA codes for U.S. imports and the HTS codes occurred in 1989, so this file can be used to obtain the linkage between a 1987 or 1988 TSUSA code, and the corresponding HTS code in 1989. (HTS codes that were introduced in years later than 1989 would not be reflected in this cross-reference). The only difference between the two files is field position and sort order. Record Layout for HTS2TSUS.DBF: Name Type Length Decimal Description HTS C 8 0 HTS code (8-digit) TSUSA C 7 0 TSUSA code (7-digit) C= Character, N=Numeric. The records are sorted by HTS code. Record Layout for TSUSA2HT.DBF: Name Type Length Decimal Description TSUSA C 7 0 TSUSA code (7-digit) HTS C 8 0 HTS code (8-digit) C= Character, N=Numeric. The records are sorted by TSUSA code. Missing Values and Coding Errors: Both HTS2TSUS.DBF and TSUSA2HT.DBF have a missing value (blank) for the HTS codein the record, with a TSUSA value of 3201900. The record containing HTS number 99028471 also contains a blank for the TSUSA code, indicating missing data. Special Considerations: This source for this cross-reference, USITC (1988) listed below, contains the following explanatory notes: "The cross- references are designed to assist the international trade community in translating a known classification in the TSUSA into a likely classification under the HTS. The user is strongly cautioned against relying on the cross-reference in order to determine legally appropriate tariff classifications under the HTS. All TSUSA numbers have been aligned with at least one HTS subheading. Similarly, all HTS subheadings have been referenced to at least one TSUSA number or headnote. Nevertheless, these cross-references do not include all possible TSUSA/HTS combinations." Most importantly, within the cross-reference each TSUSA code can correspond to more than one HTS code, and vice-versa. Thus, this cross-reference does not give a unique correspondence between the TSUSA and HTS. Its usefulness will depend on the particular commodities than each researcher is interested in. The USITC (1988) publication listed below also provides the following information concerning "special situations": 1. Intangibles Certain HTS subheadings (for example, 2716.00, 8901.10 and 8904.00) have no equivalent TSUSA number since they describe goods listed in general footnote 5 to the Tariff Schedules of the United States that are identified as "intangibles." Intangibles are articles not subject to the provisions of the Tariff Schedule of the U.S. In these cases, the code GHDNT5G is used in place of a TSUSA number. 2. Certain woven fabrics There are an enormous number of 7-digit TSUSA numbers between 320.01 through 331.98, many of which have very little trade. In order to reduce the complexity of the cross-reference, TSUSA numbers at the 5-digit level are instead used for these commodities. Even at the five-digit level, the number of commodities is reduced by consolidating the fourth and fifth digits. For example, TSUSA items 320.01 through 320.09 are shown in the cross-reference under item 320.09. In general, the fourth and fifth digits of the TSUSA items have been replaced as follows: Range of actual 4th and 5th digits 4th and 5th digits as shown in cross reference 01-09 09 10-19 19 20-29 29 30-39 39 40-49 49 50-59 59 60-69 69 70-79 79 80-89 89 92-98 99 With these fourth and fifth digits, the sixth and seventh digits for the TSUSA numbers in the cross-reference are set at zero. The treatment of TSUSA item 338.50 in the cross-reference is similar to that of items 320.01 through 331.98, and the same adjustments apply. 3. Watches and clocks The tariff classification applicable to watches, clocks with watch movements, and watch movements under the TSUSA were too convoluted to apply the general principles for linking TSUSA and HTS. As a result, for these items certain numbers reported in the 7- digit TSUSA columns are not actually TSUSA numbers, but are instead Census trade reporting numbers. At the same time, some 7- digit TSUSA numbers covering these articles are not included in the listings. These exceptions are as follows: (a) All 7-digit numbers shown the in the TSUSA column beginning with 715 or 716. These numbers, which cover watches and clocks with watch movements, are the numbers used by Census in publishing import trade statistics, rather than the TSUSA numbers used on Customs entry forms. (b) TSUSA numbers beginning with 716.37, 716.38, 716.39, 716.40, 716.45 and 719. Of these numbers, which cover watch movements, only those for which import trade was reported in the period 1983-86 are listed. Source: United States International Trade Commission. Continuity of Import and Export Trade Statistics After Implementation of the Harmonized Commodity Description and Coding System. Report to the President on Investigation no. 332-250 under Section 332 of the Tariff Act of 1930. USITC Publication 2051, January 1988, Washington, D.C. The data in HTS2TSUS.DBF and TSUSA2HT.DBF was scanned from Annex II of this publication. Bruce A. Blonigen, Department of Economics, University of Oregon, arranged to have this scanning performed at the U.S. International Trade Commission. 6H. SIC and SITC Names and Concordances (SICNAME.DBF, SITCREV*.DBF (* = 2-3), SITCR23.DBF, SITCR32.DBF) The file SICNAME.DBF contains the commodity codes and names for the Standard Industrial Classification (SIC), 1972 basis, for manufactured goods only (SIC codes beginning with 2 and 3). These SIC codes are used in the files SICYR.DBF. The organization of the records is self-explanatory. This file was scanned from the document: Standard Industrial Classification Manual, 1972, Executive Office of the President, Office of Management and Budget, Government Printing Office. The file SITCREV2.DBF contains the codes and names for the Rev. 2 SITC, and was obtained from Bruce A. Blonigen, Dept. of Economics, University of Oregon. The file SITCREV3.DBF contains the codes and names for the Rev. 3 SITC, and was copied from: U.S. Imports of Merchandise on CD-ROM [machine-readable data file]/prepared by the Bureau of the Census. - Washington: The Bureau [producer and distributor], 1994. A concordance between between Rev. 3 and 2 is contained in SITC32.DBF. This file was obtained from Robert E. Lipsey, National Bureau of Economic Research. In general, it is easier to go from a later revision of the SITC to an earlier edition, rather than vice- versa, because the number of categories tends to expand over time. However, in some cases it was also necessary to go from Rev. 2 to Rev. 3. To achieve this, the concordance SITCR23.DBF was developed from and SITCR32.DBF. However, each Rev. 2 number does not correspond to a single Rev. 3 number, instead an aggregate 3-digit Rev. 3 number is used, with zero's appended to it to achieve 5- digits. The concordance between the Standard Industrial Trade Classification (SITC) Revision 2 and Revision 3 is provided in two files. One, SITCR23.DBF, links the 5-digit Rev. 2 to the corresponding 3-digit Rev. 3. Because each 5-digit Rev. 2 number can correspond to several different Rev. 3 numbers at the 5-digit or 4-digit level, only the 3-digit Rev. 3 numbers are used. In cases where a 5- digit Rev. 2 number does not correspond to a unique 3-digit Rev. 3 number, instead a 2-digit Rev. 3 number is used, with a zero appended onto it to achieve 3 digits. Record Layout: Name Type Length Decimal Description SITC2 C 5 0 Rev. 2 SITC no. (5-digit) SITC3 C 5 0 Rev. 3 SITC no. (5-digit) Source: Calculated from the file SITCREV32.DBF, and using additional information from the Rev. 2 to Rev. 3 concordance in: Standard International Trade Classification Revision 3, United Nations Statistical Papers, Series M, No, 34/Rev.3, 1986. The concordance between the Standard Industrial Trade Classification (SITC) Revision 3 and Revision 2 is contained in the file named SITC32.DBF. The Rev. 3 number is given in the first column, and the corresponding Rev. 2 number is in the second column. Both codes are 5-digits. This means that any code with a smaller number of digits has zero's appended onto it to achieve 5 digits. Special Considerations: In the files IMPYR.DBF for 1989-1994 and the concordance CON8994.DBF, the Rev. 3 category 95000 has been introduced for "Gold coin and other coin." These are distinguished separately from "Gold numismatic (collecter's) coins," which has the Rev. 3 number 89650, and the Rev. 2 number 89605. Accordingly, the Rev. 3 number 95000 has been introduced into the concordance, with the corresponding Rev. 2 number of 89605. Record Layout: Name Type Length Decimal Description SITC3 C 5 0 Rev. 3 SITC no. (5-digit) SITC2 C 5 0 Rev. 2 SITC no. (5-digit) C= Character, N=Numeric. Source: Robert E. Lipsey, National Bureau of Economic Research. The data in SITCYR.DBF is a summation of that found in IMPYR.DBF. Thus, the details about each variable provided in the documentation to IMPYR.DBF also applies to SITCYR.DBF. References: Abowd, John M. (1991) "Appendix: The NBER Immigration, Trade, and Labor Market Files," in John M. Abowd and Richard B. Freeman, eds. Immigration, Trade, and the Labor Market. Chicago: Univ. of Chicago Press and NBER, 407-422. Abowd, John M. and Richard B. Freeman, eds. Immigration, Trade, and the Labor Market. Chicago: Univ. of Chicago Press and NBER. Alterman, William (1991) "Price Trends in U.S. Trade: New Data, New Insights," in Peter Hooper and J. David Richardson, eds. International Economic Transactions. Studies in Income and wealth, no. 55. Chicago: Univ. of Chicago Press and NBER, 109-139. Aw, Bee Yan and Mark J. Roberts (1986) "Estimating Quality Change in Quota-Constrained Import Markets: The Case of U.S. Footwear," Journal of International Economics, 21, 45-60. Boorstein, Randi and Robert C. Feenstra (1991) "Quality Upgrading and Its Welfare Cost in U.S. Steel Imports, 1969-74," in Elhanan Helpman and Assaf Razin, eds. International Trade and Trade Policy. Cambridge: MIT Press, 1991, 167-186. Feenstra, Robert C. (1994) "New Product Varieties and the Measurement of International Prices," American Economic Review, 84(1), March, 157-177. Feenstra, Robert C., Gary Hamilton, and Tzu-han Yang (1993) "Market Structure and International Trade: Business Groups in East Asia," NBER Working Paper no. 4536. Feenstra, Robert C. and Clinton R. Shiells (1994) "Bias in U.S. Import Prices and Demand," NBER Working Paper no. 4841, August, forthcoming in T. Bresnahan and R. Gordon, eds. New Products: History, Methodology and Applications. Chicago: Univ. of Chicago Press and NBER. Harrison, Ann (1991) :The New Trade Protection: Price Effects of Antidumping and Countervailing Measures in the United States," The World Bank, Trade Policy Division, Working Paper no. 808. Knetter, Michael M. (1989) "Price Discrimination by U.S. and German Exporters," American Economic Review, 79(1), 198-210. Shiells, Clinton R. (1991) "Errors in Import-Demand Estimates Based Upon Unit-Value Indexes," Review of Economics and Statistics, 55, 456-464. Prusa, Thomas J. (1994) "Pricing Behavior in the Presence of Antidumping Law," Journal of Economic Integration, 9(2), 260- 289. Prusa, Thomas J. (1995) "The Trade Effects of U.S. Antidumping Actions," presented at the NBER conference, "Effects of U.S. Trade Protection and Promotion Policies," Richmond, VA, October 6-7, 1995. Rodrik, Dani (1994) "Industrial Organization and Product Quality: Evidence from South Korean and Taiwanese Exports," in Paul Krugman and Alasdair Smith, eds. Empirical Studies of Strategic Trade Policy. Chicago: Univ. of Chicago Press and NBER, 195- 210. Sachs, Jeffrey D., and Howard J. Shatz (1994) "Trade and Jobs in U.S. Manufacturing," Brooking Papers on Economic Activity, 1, 1- 69. Staiger Robert W. and Frank A. Wolak (1994) "Measuring Industry- Specific Protection: Antidumping in the United States," Brookings Paper on Microeconomics, 51-118. Walter, Bruce C. (1991) "Quality Issues Affecting the Compilation of the U.S. Merchandise Trade Statistics," in Peter Hooper and J. David Richardson, eds. International Economic Transactions. Studies in Income and wealth, no. 55. Chicago: Univ. of Chicago Press and NBER, 89-103. Census Publications [1] U.S. Bureau of the Census, U.S. Imports for Consumption and General Imports, TSUSA Commodity by Country of Origin, FT246, Washington, D.C.: Department of Commerce, annual, 1972-1988. [2] U.S. Bureau of the Census, U.S. Imports for Consumption, HTSUSA Commodity by Country of Origin, FT247, Washington, D.C.: Department of Commerce, annual, 1989-1994. [3] U.S. Bureau of the Census, U.S. Foreign Trade Statistics. Classifications and Cross-Classifications, 1974. Washington, D.C.: Department of Commerce, December, 1975. [4] U.S. Bureau of the Census, U.S. Commodity Exports and Imports as Related to Output, Washington, D.C.: Department of Commerce, annual, various years. [5] U.S. Bureau of the Census, U.S. Imports: SIC-Based Products, FT210, Washington, D.C.: Department of Commerce, 1980, 1982, 1983, 1984. Appendix A: Country Codes and Names (COUNTRY.DBF) The U.S. import data collected by the Bureau of the Census keeps track of the source country by certain Census codes. For the import database, the United Nations (UN) country codes and names are used instead. The file COUNTRY.DBF gives a complete list of the UN codes, UN country abbreviations, the corresponding Census codes for 1994, and the full name of the Census country. This file is printed below. Record Layout: Name Type Length Decimal Description CNTRYCODE C 6 0 Source country code (UN system) CNTRYNAME C 8 0 Country name (UN abbreviation) CENSUSCODE C 4 0 Census country code for 1994 CENSUSNAME C 30 0 Full Census country name C= Character, N=Numeric. The records are sorted by the six-digit UN codes. The first two-digits of that code are a regional identifier, the next three-digits are a specific country code, and the last digit is a special modifier than equals zero in nearly all cases. Missing values: In a few cases, the Census country code for 1994 is missing, as indicated by a blank. This means that the country in question did not appear in the Census import data for 1994, but did appear in some earlier year; an example is East Germany. Special considerations: There are more Census country codes than UN codes. This means that a given UN code may appear on several subsequent records, followed by the same abbreviated UN country name; on each of these records, a different Census country code and Census county name will appear. For example, South Africa is treated as one country in the UN codes, but is broken down into several smaller regions in the Census codes and names. Sources: The UN codes are the six-digit Standard Classification of Customs Areas and Terri-tories, and are nearly the same as those used by Statistics Canada in their World Trade Database. The Census codes and country names are taken from the file COUNTRY.DBF contained on: U.S. Imports of Merchandise on CD-ROM [machine-readable data file] / prepared by the Bureau of the Census. - Washington: The Bureau [producer and distributor], 1994. See separate file COUNTRY.DBF for country coding. Appendix B: Units for Quantity The following units for quantity are used in IMPYR.DBF for 1972-1988: Units Code Units for Quantity AOZ Avoirdupois ounce BBL Barrel BFT Board feet BU Bushel C One hundred CAR Carot CD Cord CFT Cubic feet CLB Content pound CRT Crate CTN Content ton CUR Curie CWT Hundredweight (100 pounds) CYD Cubic Yard DOZ Dozen DPC Dozen Pieces DPR Dozen Pair GAL Gallon GBX Gross boxes GR Gross GRL Gross lines GM Gram GTN Gross ton JWL Jewels LB Pound LF Leaf LFT Linear Foot LTN Long ton (2,240 pounds) LYD Long yard M Thousand (1,000) MBF Thousand board feet MC Milli-Curie MCF Thousand cubic feet MFT Thousand feet MLF Thousand linear feet MSF Thousand square feet NO Number PC Piece PFG Proof gallon PK Pack PR Pair SFT Square foot SQ Square SQI Square inch STN Short ton (2,000 pounds) SUF Superficial foot SYD Square yard TON Long Ton (2,240 pounds) TOZ Troy ounce YD Yard The following units for quantity are used in IMPYR.DBF for 1989-1994: Units Code Units for Quantity BBL Barrel CAR Carot CBM Cubic Meters CKG Content Kilogram CTN Content Metric Ton CUR Curie CYK Clean Yield Kilogram DOZ Dozen DPC Dozen Pieces DPR Dozen Pair FBM Fiber Meter GCN Gross Containers GKG Kilogram (gross) GM Gram GRS Gross HUN Hundred KG Kilograms KGS Kilogram Total Sugars KWH Kilowatt-hours LTR Liters M2 Square Meters MC Milli-Curie MCU Micro-Curie MTR Meter NO Number ODE Ozone Depletion Equivalent PCS Pieces PFL Proof Liter PKS Packs PRS Pairs RBA Running Bales SCM Square Centimeters SET Sets SQ Square SQM Square meters TBE Thousand Standard Brick Equivalent TCM Thousand Cubic Meter THS Thousands TON Metric Ton