Quick Data Definitions
Summary-Level Data: Summary-level data are published data points in either print or electronic format. You would use summary- level data if you were looking for a quick statistic such as the unemployment rate for the current month or if you wanted to see a table of statistics, such as GNP for various countries during a specific time period.
Micro-Level Data: Micro-level data files are the numerically-coded results of individual responses to such files as the census questionnaires, public opinion surveys, etc. You have much more flexibility to work with the data and run statistical analyses on the extracted data. The data are in an unanalyzed, raw format of columns and rows, usually in ASCII format but not always. Some raw data files are accompanied by files in SPSS, SAS or other statistical software format for easier use in these packages. If you are working with only the raw data, you must consult the data documentation (codebook) and write a small program or use an extraction program to have the computer "read" in the data into a useable format.
Data Documentation/Codebooks: Codebooks provide information on the structure, content, and layout of a data file and the questionnaire, if any, used for the survey or study. Many codebooks are available electronically with the data file.
Data Tutorials
Harvard-MIT
Data Center
The “Data Services” tab provides access to data tutorials; FAQ’s; data search tools and data downloading and
extraction methods; data class schedules and reference or consultation
hours for data librarians and statistical consultants.
Inter-University
Consortium for Political and Social Research (ICPSR)
This data use tutorial provides new users with explanations of data terminology,
data formats and ways to search, access and download datasets.
"Quick Look-Up" Data Resources
United States
Statistical
Abstract of the United States
The "National Data Book" contains statistics on social and economic conditions
in the United States. There is also a section on Comparative International
Statistics, which includes information on finance, agriculture, population,
communication, vital statistics and more. This resource is available in
print, CD-ROM and online. The CD-ROM provides spreadsheet files, and the
online edition provides information in excel and .pdf format. There is
a "Summary Statistics" section which provides access to USA
Statistics in Brief, Historical Statistics including population, immigration,
financial, and health data for as far back as 1900, and State Rankings
for various topics including population, income and employment. It also
has a link to a guide to statistical sources.
Fedstats
Fedstats is a gateway to statistics from over 100 U.S. Federal agencies.
It provides official statistical information collected by the Federal
Government. This site provides links to data as well as the agencies that
collect and disseminate the data. Data includes economic, healthcare,
education, crime, foreign trade, and energy use. Searchable by topic or
federal agency.
Lexis
Nexis Statistical/Lexis Nexis (HOLLIS e-resource)
Statistical tables from selected United States government publications,
state government publications, business and association publications,
and international inter-governmental sources. Tables are in PDF and/or
comma-separated variable (CSV) format.
TableBase/Responsive
Database Services (HOLLIS e-resource)
TableBase provides data in a tabular format. TableBase collects data from
over 1000 publications, including privately published statistical annuals,
trade associations, government agencies, non-profit research groups, and
industry reports prepared by investment research groups. The data that
is available includes market share, company and brand rankings, industry
and product forecasts, production and consumption statistics, imports
and exports, usage and capacity, number of users/outlets, trends and demographics.
International
International Historical Statistics Series / B.R. Mitchell; London : Macmillan Reference; New York : Stockton Press, 2003.
Africa, Asia & Oceania, 1750-2000
Lexis
Nexis Statistical/Lexis-Nexis (HOLLIS e-resource)
Statistical tables from selected United States government publications,
state government publications, business and association publications,
and international inter-governmental sources. Tables are in PDF and/or
comma-separated variable (CSV) format.
United
Nations Common Database (UNCD)/United Nations (HOLLIS e-resource)
This United Nations Statistical Office database provides access to hundreds
of statistical series for all countries and areas of the world. The series
can be accessed alphabetically, by source (i.e. the originating international
organization), or by topics. The topics include balance of payments, financial
statistics, income, consumption and wealth, and national accounts.
United Nations Statistical Yearbook / United Nations (print/CD-ROM)
The United Nations Statistical Yearbook is an annual compilation of a wide range of international economic, social and environmental statistics on over 200 countries and areas of the world, compiled from sources including UN agencies and other international, national and specialized organizations.
World
Development Indicators / The World Bank (HOLLIS e-resource)
WDI (World Development Indicators) Online contains statistical data from
the World Bank for almost 600 development indicators and time series data
from 1960 for over 200 countries and 18 country groups. Data includes
social, economic, financial, natural resources, and environmental indicators.
Results can be scaled, indexed against a particular year, viewed by percentage
change, and charted. Data can be exported in Excel.
Selected Data Indexes and Guides
United States
The Economist's Handbook: A Research and Writing Guide / Thomas L. Wyrick; Minneapolis/St. Paul: West Pub. Co., 1994.
Finding & Using Economic Information: a Guide to Sources and Interpretation / David B. Johnson; Mountain View, Calif.: Mayfield Pub. Co., 1993.
Lexis-Nexis
Statistical Abstracts
Bibliographic database which indexes and abstracts the statistical content
of selected United States government publications (ASI, 1973--), state
government publications, business and association publications (SRI, 1980-),
and international inter-governmental sources (IIS). Allows limiting to
one or more components. Some full-text is available electronically (GIF,
PDF, CSV, Excel). All publications are available in microfiche in Research Services in Lamont, Level B.
International
Global Data Locator / George Thomas Kurian; Lanham, Md.: Bernan Press, 1997.
Finding & Using Economic Information: a Guide to Sources and Interpretation / David B. Johnson; Mountain View, Calif.: Mayfield Pub. Co., 1993.
INSTAT--International Statistics Sources: Subject Guide to Sources of International Comparative Statistics / edited by Michael C. Fleming and Joseph G. Nellis; London, New York: Routledge, 1995.
International Business Information: How to Find It, How to Use It / Ruth A. Pagell and Michael Halperin; Phoenix, Ariz.: Oryx Press, 1998.
Lexis-Nexis
Statistical Abstracts
Bibliographic database which indexes and abstracts the statistical content
of selected United States government publications (ASI, 1973--), state
government publications, business and association publications (SRI, 1980-),
and international inter-governmental sources (IIS). Allows limiting to
one or more components. Some full-text is available electronically (GIF,
PDF, CSV, Excel). All publications are available in microfiche in Research Services in Lamont, Level B.
Micro-level Data Resources
IQSS Dataverse Network
IQSS Dataverse provides access to Harvard's Murray Research Archive, as well as these other large data collections: ICPSR, NARA (U.S. National Archives and Records Administration), Roper Center, and The DataWeb (U.S. Census Bureau). Numeric Data Services and other smaller collections can also be searched. Data can be downloaded if permissions allow and in some cases subset for analysis.
Inter-University
Consortium for Political and Social Research (ICPSR)
Largest machine-readable social science data archive in the world; based
at the University of Michigan.The ICPSR site includes a data use tutorial.
The ICPSR datsets available at Harvard are archived in the IQSS Dataverse Network.
Roper Center
for Public Opinion
The Roper Center for Public Opinion provides access to summary-level (aggregate) and micro-level (raw) public opinion data. While the data collection focuses strongly on United States public opinion, it also includes growing collections of (micro-level) European, Latin American (Latin American Databank) and Japanese (JPOLL) polls. The data archive (micro-level data) is searchable by keyword, date, and survey organization. The iPOLL database (summary-level data) is searchable by keyword, subject /or survey organization and survey sponsor; it provides question and response level data. The Roper Center resources require users to set up individual accounts in order to gain access to the data.
Internet
Crossroads in the Social Sciences Data and Program Library Service, University
of Wisconsin-Madison
This website provides over 600 links (each link includes a brief description)
to summary-level and micro-level data sites. The site is organized by
subject. Subjects include crime, education, labor, religion, history,
health.
Statistical Resources
R and S-Plus
An R and S-Plus Companion to Applied Regression John Fox. Thousand Oaks California: Sage Publications, c2002.
Modern Applied Statistics with S W.N. Venables and B.D. Ripley. 4th ed. New York: Springer, c2002.
Introduction to R: PowerPoint document presenting an overview of R and its applications for quantitative research prepared by the Harvard-MIT Data Center.
STATA
A Handbook of Statistical Analyses Using Stata Sophia Rabe-Hesketh, Brian Everitt. Boca Raton .Chapman & Hall/CRC.
Stata Reference Manual: Release 7 College Station, Tex.: Stata Press
Introduction to Stata: PowerPoint document presenting an overview of Stata and its applications for quantitative research prepared by the Harvard-MIT Data Center.
SPSS
An Introductory Guide to SPSS for Windows Eric L. Einspruch. Thousand Oaks, Calif.: Sage Publications, c1998.
Quantitative Data Analysis with SPSS Release 10 for Windows: A Guide for Social Scientists Alan Bryman and Duncan Cramer. London; Philadelphia, PA: Routledge, 2001.
SPSS 10.0 Guide to Data Analysis Marija J. Norusis. Upper Saddle River, N.J.: Prentice Hall, [2000]
Guide
to SPSS: This more in-depth SPSS tutorial, developed by the Harvard-MIT
Data Center, is divided into three sections including Why to use SPSS,
How to use a data set, and How to produce basic statistics.
SAS
SAS Language Reference: Concepts, Version 8 Cary, NC: SAS Institute, c2000.
Getting Started with the SAS System: Version 8 Cary, NC: SAS Institute, 1999.
The Little SAS Book : A Primer Lora D. Delwiche and Susan J. Slaughter. 3rd ed. Cary, NC: SAS Institute, c2003.
SAS Programming for Researchers and Social Scientists Paul E. Spector. 2nd ed.Thousand Oaks, Calif.: Sage Publications, c2001.
SAS Commands and References: A list of commands and references prepared by the Harvard-MIT Data Center.
SAS Statistics and Statistical Graphic Resources: This site, developed by Michael Friendly at the University of York, provides links to SAS information guides, tutorials and FAQs.
General
Handbook of computational statistics: concepts and methods James E. Gentle, Wolfgang Härdle, Yuichi Mori, editors. Berlin ; London : Springer, c2004.
Principles and practice of structural equation modeling Rex B. Kline. 2nd ed. New York : Guilford Press, c2005.

