Catalog of Holdings

Study Report

Study Number: AH-011-001-1-1-United States-USCEN-1940

Subject Area: Individual

Bibliographic Citation: Census of population and housing, 1940 [United States]: public use sample, 1/100 sample.  [machine-readable data file] / U.S. Department of Commerce. Bureau of the Census  [principal investigator(s)] / Washington, D.C.: U.S. Department of Commerce. Bureau of the Census  [distributor].

Originating Archive Number: 8236

Comments: Abstract taken from ICPSR. Obtain data files from ICPSR.

Access Status: Unrestricted access.

Documentation: 1 volume hard copy codebook (5/98 This ICPSR codebook replaced the Census copy of technical document and the CDE copy of porcedural document.)

Abstract: This data collection and its 1950 counterpart were assembled through a collaborative effort between the United States Bureau of the Census and the Center for Demography and Ecology of the University of Wisconsin. The 1940 and 1950 Census Public Use Sample Project was supported by The National Science Foundation under Grant SES-7704135. The collections contain a stratified 1-percent sample of households, with separate records for each household, for each 'sample line' respondent, and for each person in the household. These records were encoded from microfilm copies of original handwritten enumeration schedules from the 1940 and 1950 Censuses of Population. The universe for the sample included all persons and households within the United States. Geographic identification of the location of the sampled households includes Census regions and divisions, States (except Alaska and Hawaii), Standard Metropolitan Areas (SMA's), and State Economic Areas (SEA's). The SMA's and SEA's are comparable for both the 1940 and 1950 Public Use Microdata Samples (PUMS). The data collections were constructed from and consist of 20 independently-drawn subsamples stored in 20 discrete physical files. Each of the 20 subsamples contains three record types (household, 'sample line', and person). Both collections had both a complete-count and a sample component. Individuals selected for the sample component were asked a set of additional questions. Only households with a 'sample line' person were included in the public use microdata sample. The collections also contain records of group quarters members who were also on the Census 'sample line'. For the 1940 and 1950 collections, each household record contains variables describing the location and composition of the household. The 'sample line' records for 1940 contain variables describing demographic characteristics such as nativity, marital status, number of children, veteran status, wage deductions for social security, and occupation. The person records for 1940 contain such demographic variables as nativity, marital status, family membership, education, employment status, income, and occupation. Accompanying the data collections are code books which include an abstract, descriptions of sample design, processing procedures and file structure, a data dictionary (record layout), category code lists, and a glossary. Also included is a procedural history of the 1940 Census. The data collections are arranged by subsample with each subsample stored as a separate physical file of information. The 20 subsamples were selected randomly. Within each of the 20 subsamples, records are sequenced by State. Extracting all of the records for one State entails reading through all of the 20 physical files and selecting that State's records from each of the 20 subsamples. Record types are ordered within household (household characteristics first, 'sample line' next, and person records last). The 1940 collection consists of a total of 2,133,800 records: 391,034 household records, 391,034 'sample line' records, and 1,351,732 person records. Each record type has a logical record length of 138 characters.

Media/File Reports: