Beginner's Guide to Locating and Using Numeric Data



Quick Data Definitions

Summary-Level Data: Summary-level data are published data points in either print or electronic format. You would use summary- level data if you were looking for a quick statistic such as the unemployment rate for the current month or if you wanted to see a table of statistics, such as GNP for various countries during a specific time period.

Micro-Level Data: Micro-level data files are the numerically-coded results of individual responses to such files as the census questionnaires, public opinion surveys, etc. You have much more flexibility to work with the data and run statistical analyses on the extracted data. The data are in an unanalyzed, raw format of columns and rows, usually in ASCII format but not always. Some raw data files are accompanied by files in SPSS, SAS or other statistical software format for easier use in these packages. If you are working with only the raw data, you must consult the data documentation (codebook) and write a small program or use an extraction program to have the computer "read" in the data into a useable format.

Data Documentation/Codebooks: Codebooks provide information on the structure, content, and layout of a data file and the questionnaire, if any, used for the survey or study. Many codebooks are available electronically with the data file.

 

Return to top

Data Tutorials

Harvard-MIT Data Center
The “Data Services” tab provides access to data tutorials; FAQ’s; data search tools and data downloading and extraction methods; data class schedules and reference or consultation hours for data librarians and statistical consultants.

Inter-University Consortium for Political and Social Research (ICPSR)
This data use tutorial provides new users with explanations of data terminology, data formats and ways to search, access and download datasets.

Return to top

"Quick Look-Up" Data Resources

United States

Statistical Abstract of the United States
The "National Data Book" contains statistics on social and economic conditions in the United States. There is also a section on Comparative International Statistics, which includes information on finance, agriculture, population, communication, vital statistics and more. This resource is available in print, CD-ROM and online. The CD-ROM provides spreadsheet files, and the online edition provides information in excel and .pdf format. There is a "Summary Statistics" section which provides access to USA Statistics in Brief, Historical Statistics including population, immigration, financial, and health data for as far back as 1900, and State Rankings for various topics including population, income and employment. It also has a link to a guide to statistical sources.

Fedstats
Fedstats is a gateway to statistics from over 100 U.S. Federal agencies. It provides official statistical information collected by the Federal Government. This site provides links to data as well as the agencies that collect and disseminate the data. Data includes economic, healthcare, education, crime, foreign trade, and energy use. Searchable by topic or federal agency.

Lexis Nexis Statistical/Lexis Nexis (HOLLIS e-resource)
Statistical tables from selected United States government publications, state government publications, business and association publications, and international inter-governmental sources. Tables are in PDF and/or comma-separated variable (CSV) format.

TableBase/Responsive Database Services (HOLLIS e-resource)
TableBase provides data in a tabular format. TableBase collects data from over 1000 publications, including privately published statistical annuals, trade associations, government agencies, non-profit research groups, and industry reports prepared by investment research groups. The data that is available includes market share, company and brand rankings, industry and product forecasts, production and consumption statistics, imports and exports, usage and capacity, number of users/outlets, trends and demographics.

International

International Historical Statistics Series / B.R. Mitchell; London : Macmillan Reference; New York : Stockton Press, 2003.

Africa, Asia & Oceania, 1750-2000

Europe, 1750-2000

the Americas, 1750-2000

Lexis Nexis Statistical/Lexis-Nexis (HOLLIS e-resource)
Statistical tables from selected United States government publications, state government publications, business and association publications, and international inter-governmental sources. Tables are in PDF and/or comma-separated variable (CSV) format.

United Nations Common Database (UNCD)/United Nations (HOLLIS e-resource)
This United Nations Statistical Office database provides access to hundreds of statistical series for all countries and areas of the world. The series can be accessed alphabetically, by source (i.e. the originating international organization), or by topics. The topics include balance of payments, financial statistics, income, consumption and wealth, and national accounts.

United Nations Statistical Yearbook / United Nations (print/CD-ROM)
The United Nations Statistical Yearbook is an annual compilation of a wide range of international economic, social and environmental statistics on over 200 countries and areas of the world, compiled from sources including UN agencies and other international, national and specialized organizations.

World Development Indicators / The World Bank (HOLLIS e-resource)
WDI (World Development Indicators) Online contains statistical data from the World Bank for almost 600 development indicators and time series data from 1960 for over 200 countries and 18 country groups. Data includes social, economic, financial, natural resources, and environmental indicators. Results can be scaled, indexed against a particular year, viewed by percentage change, and charted. Data can be exported in Excel.

Return to top

Selected Data Indexes and Guides

United States

The Economist's Handbook: A Research and Writing Guide / Thomas L. Wyrick; Minneapolis/St. Paul: West Pub. Co., 1994.

Finding & Using Economic Information: a Guide to Sources and Interpretation / David B. Johnson; Mountain View, Calif.: Mayfield Pub. Co., 1993.

Lexis-Nexis Statistical Abstracts
Bibliographic database which indexes and abstracts the statistical content of selected United States government publications (ASI, 1973--), state government publications, business and association publications (SRI, 1980-), and international inter-governmental sources (IIS). Allows limiting to one or more components. Some full-text is available electronically (GIF, PDF, CSV, Excel). All publications are available in microfiche in Research Services in Lamont, Level B.

International

Global Data Locator / George Thomas Kurian; Lanham, Md.: Bernan Press, 1997.

Finding & Using Economic Information: a Guide to Sources and Interpretation / David B. Johnson; Mountain View, Calif.: Mayfield Pub. Co., 1993.

INSTAT--International Statistics Sources: Subject Guide to Sources of International Comparative Statistics / edited by Michael C. Fleming and Joseph G. Nellis; London, New York: Routledge, 1995.

International Business Information: How to Find It, How to Use It / Ruth A. Pagell and Michael Halperin; Phoenix, Ariz.: Oryx Press, 1998.

Lexis-Nexis Statistical Abstracts
Bibliographic database which indexes and abstracts the statistical content of selected United States government publications (ASI, 1973--), state government publications, business and association publications (SRI, 1980-), and international inter-governmental sources (IIS). Allows limiting to one or more components. Some full-text is available electronically (GIF, PDF, CSV, Excel). All publications are available in microfiche in Research Services in Lamont, Level B.

Return to top

Micro-level Data Resources

IQSS Dataverse Network
IQSS Dataverse provides access to Harvard's Murray Research Archive, as well as these other large data collections:  ICPSR, NARA (U.S. National Archives and Records Administration), Roper Center, and The DataWeb (U.S. Census Bureau).  Numeric Data Services and other smaller collections can also be searched.  Data can be downloaded if permissions allow and in some cases subset for analysis.

Inter-University Consortium for Political and Social Research (ICPSR)
Largest machine-readable social science data archive in the world; based at the University of Michigan.The ICPSR site includes a data use tutorial. The ICPSR datsets available at Harvard are archived in the IQSS Dataverse Network.

Roper Center for Public Opinion
The Roper Center for Public Opinion provides access to summary-level (aggregate) and micro-level (raw) public opinion data. While the data collection focuses strongly on United States public opinion, it also includes growing collections of (micro-level) European, Latin American (Latin American Databank) and Japanese (JPOLL) polls. The data archive (micro-level data) is searchable by keyword, date, and survey organization. The iPOLL database (summary-level data) is searchable by keyword, subject /or survey organization and survey sponsor; it provides question and response level data. The Roper Center resources require users to set up individual accounts in order to gain access to the data.

Internet Crossroads in the Social Sciences Data and Program Library Service, University of Wisconsin-Madison
This website provides over 600 links (each link includes a brief description) to summary-level and micro-level data sites. The site is organized by subject. Subjects include crime, education, labor, religion, history, health.

 

Return to top


Statistical Resources

R and S-Plus

An R and S-Plus Companion to Applied Regression John Fox. Thousand Oaks California: Sage Publications, c2002.

Modern Applied Statistics with S W.N. Venables and B.D. Ripley. 4th ed. New York: Springer, c2002.

Introduction to R: PowerPoint document presenting an overview of R and its applications for quantitative research prepared by the Harvard-MIT Data Center.

 

Return to top

STATA

A Handbook of Statistical Analyses Using Stata Sophia Rabe-Hesketh, Brian Everitt. Boca Raton .Chapman & Hall/CRC.

Stata Reference Manual: Release 7 College Station, Tex.: Stata Press

Introduction to Stata: PowerPoint document presenting an overview of Stata and its applications for quantitative research prepared by the Harvard-MIT Data Center.

Return to top

SPSS

An Introductory Guide to SPSS for Windows Eric L. Einspruch. Thousand Oaks, Calif.: Sage Publications, c1998.

Quantitative Data Analysis with SPSS Release 10 for Windows: A Guide for Social Scientists Alan Bryman and Duncan Cramer. London; Philadelphia, PA: Routledge, 2001.

SPSS 10.0 Guide to Data Analysis Marija J. Norusis. Upper Saddle River, N.J.: Prentice Hall, [2000]

Guide to SPSS: This more in-depth SPSS tutorial, developed by the Harvard-MIT Data Center, is divided into three sections including Why to use SPSS, How to use a data set, and How to produce basic statistics.

Return to top

SAS

SAS Language Reference: Concepts, Version 8 Cary, NC: SAS Institute, c2000.

Getting Started with the SAS System: Version 8 Cary, NC: SAS Institute, 1999.

The Little SAS Book : A Primer Lora D. Delwiche and Susan J. Slaughter. 3rd ed. Cary, NC: SAS Institute, c2003.

SAS Programming for Researchers and Social Scientists Paul E. Spector. 2nd ed.Thousand Oaks, Calif.: Sage Publications, c2001.

SAS Commands and References: A list of commands and references prepared by the Harvard-MIT Data Center.

SAS Statistics and Statistical Graphic Resources: This site, developed by Michael Friendly at the University of York, provides links to SAS information guides, tutorials and FAQs.

Return to top

General

Handbook of computational statistics: concepts and methods James E. Gentle, Wolfgang Härdle, Yuichi Mori, editors. Berlin ; London : Springer, c2004.

Principles and practice of structural equation modeling Rex B. Kline. 2nd ed. New York : Guilford Press, c2005.

Return to top

 

Page Last Reviewed: June 10, 2008