AE 17: Visualizing household income in New York

Suggested answers

Application exercise
Answers
Modified

March 26, 2025

Packages

library(tidyverse)
library(sf)
library(colorspace)
library(scales)

# set default theme
theme_set(theme_minimal())

# create reusable labels for each plot
map_labels <- labs(
  title = "Median household income in New York in 2023",
  subtitle = "By census tract",
  color = NULL,
  fill = NULL,
  caption = "Source: American Community Survey"
)

Load New York 2023 median household income

We will use two data files for this analysis. The first contains median household incomes for each census tract in New York from 2023. The second contains the boundaries of each county in New York.

# load data
ny_inc <- st_read(dsn = "data/ny-inc.geojson")
Reading layer `ny-inc.geojson' from data source 
  `/Users/soltoffbc/Projects/info-3312/course-site/ae/data/ny-inc.geojson' 
  using driver `GeoJSON'
Simple feature collection with 5396 features and 4 fields (with 16 geometries empty)
Geometry type: MULTIPOLYGON
Dimension:     XY
Bounding box:  xmin: -79.76215 ymin: 40.4961 xmax: -71.85648 ymax: 45.01585
Geodetic CRS:  NAD83
ny_counties <- st_read(dsn = "data/ny-counties.geojson")
Reading layer `ny-counties.geojson' from data source 
  `/Users/soltoffbc/Projects/info-3312/course-site/ae/data/ny-counties.geojson' 
  using driver `GeoJSON'
Simple feature collection with 62 features and 4 fields
Geometry type: MULTIPOLYGON
Dimension:     XY
Bounding box:  xmin: -79.76215 ymin: 40.4961 xmax: -71.85648 ymax: 45.01585
Geodetic CRS:  NAD83
ny_inc
Simple feature collection with 5396 features and 4 fields (with 16 geometries empty)
Geometry type: MULTIPOLYGON
Dimension:     XY
Bounding box:  xmin: -79.76215 ymin: 40.4961 xmax: -71.85648 ymax: 45.01585
Geodetic CRS:  NAD83
First 10 features:
         GEOID                                         NAME medincomeE
1  36015010800   Census Tract 108; Chemung County; New York      54354
2  36055010200    Census Tract 102; Monroe County; New York     135260
3  36055011705 Census Tract 117.05; Monroe County; New York     115568
4  36055013902 Census Tract 139.02; Monroe County; New York      57588
5  36055004702  Census Tract 47.02; Monroe County; New York      46250
6  36055013604 Census Tract 136.04; Monroe County; New York      52766
7  36055006300     Census Tract 63; Monroe County; New York      65167
8  36055006000     Census Tract 60; Monroe County; New York      56951
9  36055008100     Census Tract 81; Monroe County; New York      47273
10 36055012700    Census Tract 127; Monroe County; New York     120753
   medincomeM                       geometry
1        7404 MULTIPOLYGON (((-76.83044 4...
2       10008 MULTIPOLYGON (((-77.61592 4...
3       31498 MULTIPOLYGON (((-77.48616 4...
4       13986 MULTIPOLYGON (((-77.65909 4...
5        8729 MULTIPOLYGON (((-77.61634 4...
6       12776 MULTIPOLYGON (((-77.69645 4...
7       32012 MULTIPOLYGON (((-77.64697 4...
8       10918 MULTIPOLYGON (((-77.56453 4...
9       22281 MULTIPOLYGON (((-77.6001 43...
10       7602 MULTIPOLYGON (((-77.57503 4...
ny_counties
Simple feature collection with 62 features and 4 fields
Geometry type: MULTIPOLYGON
Dimension:     XY
Bounding box:  xmin: -79.76215 ymin: 40.4961 xmax: -71.85648 ymax: 45.01585
Geodetic CRS:  NAD83
First 10 features:
   GEOID                        NAME medincomeE medincomeM
1  36013 Chautauqua County, New York      56507       1981
2  36045  Jefferson County, New York      64978       2730
3  36059     Nassau County, New York     143408       1643
4  36111     Ulster County, New York      81804       2953
5  36051 Livingston County, New York      72464       2969
6  36123      Yates County, New York      67521       4456
7  36025   Delaware County, New York      60226       2922
8  36103    Suffolk County, New York     128329       1374
9  36047      Kings County, New York      78548       1052
10 36105   Sullivan County, New York      69826       2633
                         geometry
1  MULTIPOLYGON (((-79.76215 4...
2  MULTIPOLYGON (((-76.14744 4...
3  MULTIPOLYGON (((-73.76871 4...
4  MULTIPOLYGON (((-74.78069 4...
5  MULTIPOLYGON (((-78.06078 4...
6  MULTIPOLYGON (((-77.36711 4...
7  MULTIPOLYGON (((-75.42264 4...
8  MULTIPOLYGON (((-72.0377 41...
9  MULTIPOLYGON (((-74.04171 4...
10 MULTIPOLYGON (((-75.14474 4...

Part 1

Draw a continuous choropleth of median household income

Your turn: Create a choropleth map of median household income in New York. Use a continuous color gradient to identify each tract’s median household income.

Tip

Use the stored map_labels to set the title, subtitle, and caption for this and the remaining plots.

ggplot(data = ny_inc) +
  # use fill and color to avoid gray boundary lines
  geom_sf(aes(fill = medincomeE, color = medincomeE)) +
  # increase interpretability of graph
  scale_color_continuous(labels = label_dollar()) +
  scale_fill_continuous(labels = label_dollar()) +
  map_labels

Your turn: Now revise the map to use an optimized color gradient for improved readability.

ggplot(data = ny_inc) +
  # use fill and color to avoid gray boundary lines
  geom_sf(mapping = aes(fill = medincomeE, color = medincomeE)) +
  # increase interpretability of graph
  scale_fill_continuous_sequential(
    palette = "viridis",
    rev = FALSE,
    aesthetics = c("fill", "color"),
    labels = label_dollar(),
    name = NULL
  ) +
  map_labels

Overlay county borders

Your turn: To provide better context, overlay the NY county borders on the choropleth map.

ggplot(data = ny_inc) +
  # use fill and color to avoid gray boundary lines
  geom_sf(mapping = aes(fill = medincomeE, color = medincomeE)) +
  # add county borders
  geom_sf(data = ny_counties, color = "white", fill = NA) +
  # increase interpretability of graph
  scale_fill_continuous_sequential(
    palette = "viridis",
    rev = FALSE,
    aesthetics = c("fill", "color"),
    labels = label_dollar(),
    name = NULL
  ) +
  map_labels

Part 2

Your turn: Continuous color palettes can be hard to distinguish visibly. To improve readability, convert the continuous color palette into a discrete one with 6 levels. Additionally, modify the projection method to use this projection optimized for New York.

# binned_scale() - default breaks
ggplot(data = ny_inc) +
  geom_sf(mapping = aes(fill = medincomeE)) +
  geom_sf(data = ny_counties, color = "white", fill = NA) +
  scale_fill_binned_sequential(
    palette = "viridis",
    rev = FALSE,
    labels = label_dollar()
  ) +
  # increase interpretability of graph
  map_labels +
  coord_sf(crs = 2261)

sessioninfo::session_info()
─ Session info ───────────────────────────────────────────────────────────────
 setting  value
 version  R version 4.4.2 (2024-10-31)
 os       macOS Sonoma 14.6.1
 system   aarch64, darwin20
 ui       X11
 language (EN)
 collate  en_US.UTF-8
 ctype    en_US.UTF-8
 tz       America/New_York
 date     2025-03-28
 pandoc   3.4 @ /usr/local/bin/ (via rmarkdown)

─ Packages ───────────────────────────────────────────────────────────────────
 package      * version    date (UTC) lib source
 class          7.3-22     2023-05-03 [1] CRAN (R 4.4.2)
 classInt       0.4-10     2023-09-05 [1] CRAN (R 4.3.0)
 cli            3.6.3      2024-06-21 [1] CRAN (R 4.4.0)
 colorspace   * 2.1-1      2024-07-26 [1] CRAN (R 4.4.0)
 DBI            1.2.3      2024-06-02 [1] CRAN (R 4.4.0)
 dichromat      2.0-0.1    2022-05-02 [1] CRAN (R 4.3.0)
 digest         0.6.37     2024-08-19 [1] CRAN (R 4.4.1)
 dplyr        * 1.1.4      2023-11-17 [1] CRAN (R 4.3.1)
 e1071          1.7-14     2023-12-06 [1] CRAN (R 4.3.1)
 evaluate       1.0.3      2025-01-10 [1] CRAN (R 4.4.1)
 farver         2.1.2      2024-05-13 [1] CRAN (R 4.3.3)
 fastmap        1.2.0      2024-05-15 [1] CRAN (R 4.4.0)
 forcats      * 1.0.0      2023-01-29 [1] CRAN (R 4.3.0)
 generics       0.1.3      2022-07-05 [1] CRAN (R 4.3.0)
 ggplot2      * 3.5.1      2024-04-23 [1] CRAN (R 4.3.1)
 glue           1.8.0      2024-09-30 [1] CRAN (R 4.4.1)
 gtable         0.3.6      2024-10-25 [1] CRAN (R 4.4.1)
 here           1.0.1      2020-12-13 [1] CRAN (R 4.3.0)
 hms            1.1.3      2023-03-21 [1] CRAN (R 4.3.0)
 htmltools      0.5.8.1    2024-04-04 [1] CRAN (R 4.3.1)
 htmlwidgets    1.6.4      2023-12-06 [1] CRAN (R 4.3.1)
 jsonlite       1.8.9      2024-09-20 [1] CRAN (R 4.4.1)
 KernSmooth     2.23-24    2024-05-17 [1] CRAN (R 4.4.2)
 knitr          1.49       2024-11-08 [1] CRAN (R 4.4.1)
 labeling       0.4.3      2023-08-29 [1] CRAN (R 4.3.0)
 lifecycle      1.0.4      2023-11-07 [1] CRAN (R 4.3.1)
 lubridate    * 1.9.3      2023-09-27 [1] CRAN (R 4.3.1)
 magrittr       2.0.3      2022-03-30 [1] CRAN (R 4.3.0)
 pillar         1.10.1     2025-01-07 [1] CRAN (R 4.4.1)
 pkgconfig      2.0.3      2019-09-22 [1] CRAN (R 4.3.0)
 proxy          0.4-27     2022-06-09 [1] CRAN (R 4.3.0)
 purrr        * 1.0.2      2023-08-10 [1] CRAN (R 4.3.0)
 R6             2.5.1      2021-08-19 [1] CRAN (R 4.3.0)
 RColorBrewer   1.1-3      2022-04-03 [1] CRAN (R 4.3.0)
 Rcpp           1.0.14     2025-01-12 [1] CRAN (R 4.4.1)
 readr        * 2.1.5      2024-01-10 [1] CRAN (R 4.3.1)
 rlang          1.1.5      2025-01-17 [1] CRAN (R 4.4.1)
 rmarkdown      2.29       2024-11-04 [1] CRAN (R 4.4.1)
 rprojroot      2.0.4      2023-11-05 [1] CRAN (R 4.3.1)
 scales       * 1.3.0.9000 2025-03-19 [1] Github (bensoltoff/scales@71d8f13)
 sessioninfo    1.2.2      2021-12-06 [1] CRAN (R 4.3.0)
 sf           * 1.0-16     2024-03-24 [1] CRAN (R 4.4.0)
 stringi        1.8.4      2024-05-06 [1] CRAN (R 4.3.1)
 stringr      * 1.5.1      2023-11-14 [1] CRAN (R 4.3.1)
 tibble       * 3.2.1      2023-03-20 [1] CRAN (R 4.3.0)
 tidyr        * 1.3.1      2024-01-24 [1] CRAN (R 4.3.1)
 tidyselect     1.2.1      2024-03-11 [1] CRAN (R 4.3.1)
 tidyverse    * 2.0.0      2023-02-22 [1] CRAN (R 4.3.0)
 timechange     0.3.0      2024-01-18 [1] CRAN (R 4.3.1)
 tzdb           0.4.0      2023-05-12 [1] CRAN (R 4.3.0)
 units          0.8-5      2023-11-28 [1] CRAN (R 4.3.1)
 vctrs          0.6.5      2023-12-01 [1] CRAN (R 4.3.1)
 withr          3.0.2      2024-10-28 [1] CRAN (R 4.4.1)
 xfun           0.50.5     2025-01-15 [1] https://yihui.r-universe.dev (R 4.4.2)
 yaml           2.3.10     2024-07-26 [1] CRAN (R 4.4.0)

 [1] /Library/Frameworks/R.framework/Versions/4.4-arm64/Resources/library

──────────────────────────────────────────────────────────────────────────────