Title: | Data for "R Graphics Cookbook" |
Description: | Data sets used in the book "R Graphics Cookbook" by Winston Chang, published by O'Reilly Media. |
Authors: | Winston Chang |
Maintainer: | Winston Chang <[email protected]> |
License: | GPL-2 |
Version: | 2.0 |
Built: | 2025-02-25 03:59:23 UTC |
Source: | https://github.com/wch/gcookbook |
Weekly stock data for AAPL (Apple, Inc.) from 1984 to 2012.
adj_price: Price, adjusted for splits and dividends.
Data from an experiment on the homing performance of a desert ant, Cataglyphis bicolor.
angle: Angle between true home direction and the direction that the ant went in (positive is clockwise).
expt: Number of ants in the experimental condition that went in this direction.
ctrl: Number of ants in the control condition that went in this direction.
Hand, D.J., Daly, F., Lunn, A.D., McConway, K.J. & Ostrowski, E. (1994), A Handbook of Small Data Sets, Chapman & Hall.
Duelli, P. and Wehner, R. (1973), The spectal sensitivity of polarized light orientation in Cataglyphis bicolor (Formicidae, Hymenoptera). Journal of Comparative Physiology, 86, 36-53.
This data set has groupwise means, standard deviations, counts, and standard
error of the mean for the cabbages
data set from the MASS
package. The purpose of this summarized data set is to make it easy to use
for example graphs.
The source data set in the MASS package, cabbages
This data set includes estimated global temperature anamoly data for the years 1800 through 2011. The anomaly is the difference from the baseline temperature, which is the mean of the yearly temperatures from 1951-1980.
Source: Data source (Berkeley, CRUTEM3, NASA).
Year: Year for the estimate.
Anomaly1y: Temperature anomaly in Celcius, smoothed over one year.
Anomaly5y: Temperature anomaly in Celcius, smoothed over five years.
Anomaly10y: Temperature anomaly in Celcius, smoothed over ten years.
Unc10y: Uncertainty for 10-year-smoothed anomaly.
Berkeley Earth Project: http://berkeleyearth.org/land-and-ocean-data/
Climatic Research Unit (CRUTEM3): http://www.cru.uea.ac.uk/cru/data/temperature/
NASA: http://data.giss.nasa.gov/gistemp/
Corneal thickness of eight people who had glaucoma in one eye.
affected Corneal thickness (in microns) of eye affected by glaucoma.
notaffected Corneal thickness (in microns) of eye not affected by glaucoma.
Hand, D.J., Daly, F., Lunn, A.D., McConway, K.J. & Ostrowski, E. (1994), A Handbook of Small Data Sets, Chapman & Hall.
Ehlers, N. On corneal thickness and introcular pressure, II. (1970). Acta Opthalmologica, 48, 1107-1112.
Health and economic data about countries around the world from 1960-2010, from the World Bank.
Name: Name of country
Code: Short country code
GDP: Per capita Gross Domestic Product, in adjusted 2011 U.S. Dollars
laborrate: Labor rate.
healthexp: Health expenditures in U.S. Dollars.
infmortality: Infant mortality per 1000 live births.
World Bank: http://data.worldbank.org/
Number of people convicted for drunkenness at Tower Bridge and Lambeth Magistrates' Courts from January 1 to June 27, 1970, classified by age and sex.
Hand, D.J., Daly, F., Lunn, A.D., McConway, K.J. & Ostrowski, E. (1994), A Handbook of Small Data Sets, Chapman & Hall.
Cook, T. (1971). New Society, 20 May, 1971.
This package contains data sets used in the book "R Graphics Cookbook" by Winston Chang, published by O'Reilly Media.
Height and weight of schoolchildren
ageYear: Age in years.
ageMonth: Age in months.
heightIn: Height in inches.
weightLb: Weight in pounds.
Lewis, T., & Taylor, L.R. (1967), Introduction to Experimental Ecology, Academic Press.
This data is from a simulation of hurricane Isabel in 2003. It includes temperature and wind data for a 2139km (east-west) x 2004km (north-south) x 19.8km (vertical) volume. The simluation data is from the National Center for Atmospheric Research, and it was used in the IEEE Visualization 2004 Contest.
x: Latitude (x coordinate).
y: Longitude (y coordinate).
z: Height in km (z coordinate).
vx: x wind component in m/s
vy: y wind component in m/s
vz: z wind component in m/s
t: Temperature in Celcius
speed: wind speed, sqrt(vx^2 + vy^2 + vz^2)
Each row of this data frame represents a pair of characters who had a sexual relationship on the TV show Mad Men, as of the end of season 4. This data can be displayed with an undirected graph.
The placement of names in column Name1 as opposed to Name2 is arbitrary, and not meaningful. In other words, for any row, you can swap the values of Name1 and Name2, and it will represent the same information.
Name1: Name of one sexual partner.
Name2: Name of another sexual partner.
Wired Magazine 20.02, February 2012
For a list of attempted sexual pairings, see madmen2
Each row of this data frame represents a pair of characters on the TV show Mad Men, as of the end of season 4. Each row represents an attempted sexual relation: the character in the first column, Name1, attempted to have sex with the character in the second column, Name2. If the relationship goes in both directions (the characters had sex with each other), then there will be two rows, representing each direction. This data can be displayed with a directed graph.
Name1: Character who made sexual advances.
Name2: Character who was the target of sexual advances.
Wired Magazine 20.02, February 2012
For a list of successful sexual pairings, see madmen
This data set contains mrathon and half-marathon running times for 520 people. Each row represents one person's times.
Half: Time in minutes, for half marathon.
Full: Time in minutes, for full marathon.
Downey, A.B. (2011), Think Stats, O'Reilly Media.
This data set simply has groupwise means of the PlantGrowth
data set. The purpose of this summarized data set is to make it easy to
use for example graphs.
Dobson, A. J. (1983) An Introduction to Statistical Modelling, Chapman & Hall.
The source data set, PlantGrowth
This is data from an experiment to investigate the effect of cutting length and planting time on the survival of plum root cuttings.
length: Cutting length.
time: Planting time.
survival: Survival status.
count: Number of plants.
Hand, D.J., Daly, F., Lunn, A.D., McConway, K.J. & Ostrowski, E. (1994), A Handbook of Small Data Sets, Chapman & Hall.
Bartlett, M.S. (1935), Contingency table interactions, Journal of the Royal Statistical Society Supplement, 2, 248-252.
This data frame is in "long" format. See plum_wide
the same data in "wide" format.
This is data from an experiment to investigate the effect of cutting length and planting time on the survival of plum root cuttings.
length: Cutting length.
time: Planting time.
dead: Number of dead plants in this condition.
alive: Number of alive plants in this condition.
Hand, D.J., Daly, F., Lunn, A.D., McConway, K.J. & Ostrowski, E. (1994), A Handbook of Small Data Sets, Chapman & Hall.
Bartlett, M.S. (1935), Contingency table interactions, Journal of the Royal Statistical Society Supplement, 2, 248-252.
This data frame is in "wide" format. See plum
the same data in "long" format.
This data set is for examples of R graphics.
This data frame is in "wide" format. See simpledat_long
for the same data in "long" format.
This data set is for examples of R graphics.
This data frame is in "long" format. See simpledat
for the same data in "wide" format.
Summarized ToothGrowth data
Batting statistics for the top 144 hitters in Major League Baseball in 2001.
id: Unique player id
first: First name
last: Last name
name: Full name (first and last)
year: Year of data
team: Abbreviation of team played for
lg: League (American League or National League)
g: Number of games
ab: Number of times at bat
r: Number of runs
h: Number of hits (times reached base because of a batted, fair ball without error by the defense)
2b: Hits on which the batter reached second base safely
3b: Hits on which the batter reached third base safely
hr: Number of home runs
rbi: Runs batted in
sb: Stolen bases
cs: Caught stealing
bb: Base on balls (walk)
so: Strike outs
ibb: Intentional base on balls
hbp: Hits by pitch
sh: Sacrifice hits
sf: Sacrifice flies
gidp: Ground into double-play
avg: Batting average (hits divided by at-bats)
These are the estimated (not counted) values by the U.S. Census.
Thousands: Number of people, in thousands.
U.S. Census Bureau, Statistical Abstract of the United States, 2003, HS-3: http://www2.census.gov/library/publications/2004/compendia/statab/123ed/hist/hs-03.pdf
This data set represents the percent change in population of states in the U.S. from 2000 to 2010.
Abb: Abbreviated state name.
Region: Region of country that the state is in.
Change: Percent change in population.
U.S. Census Bureau, Statistical Abstract of the United States, 2012, Table 14. https://www2.census.gov/library/publications/2011/compendia/statab/131ed/2012-statab.pdf
This data set contains the wind speed and direction over the course of a single day in Chicago (February 29, 2012).
TimeUTC: Time of day in minutes; 0 is midnight.
Temp: Temperature in Celcius.
WindAvg: Average wind speed in m/s in this time block.
WindMax: Maximum wind speed in m/s in this time block.
WindDir: Average direction that wind comes from (0=north, 90=east).
SpeedCat: Average wind speed, categorized in 5 m/s groups.
DirCat: Average direction, categorized in 15-degree groups.
Great Lakes Environmental Research Laboratory: http://www.glerl.noaa.gov/metdata/chi/
World population estimates from 10,000 B.C. to 2,000 A.D.
Population: Estimated population, in thousands