Type: | Package |
Title: | All Things Data and Springsteen |
Version: | 0.1.0 |
Maintainer: | Joey O'Brien <obrienjoeyd@gmail.com> |
Description: | An R data package containing setlists from all Bruce Springsteen concerts over 1973-2021. Also includes all his song details such as lyrics and albums. Data extracted from: http://brucebase.wikidot.com/. |
License: | CC0 |
URL: | https://github.com/obrienjoey/spRingsteen, https://obrienjoey.github.io/spRingsteen/ |
BugReports: | https://github.com/obrienjoey/spRingsteen/issues |
Encoding: | UTF-8 |
LazyData: | true |
Depends: | R (≥ 2.10) |
RoxygenNote: | 7.2.1 |
Suggests: | tidytext, dplyr, lubridate, readr, stringr, magrittr, tidyselect |
Imports: | devtools, rlang |
NeedsCompilation: | no |
Packaged: | 2022-08-18 20:27:03 UTC; joeyobrien |
Author: | Joey O'Brien [aut, cre] |
Repository: | CRAN |
Date/Publication: | 2022-09-15 21:26:08 UTC |
Metadata for concerts in the spRingsteen dataset.
Description
Metadata for concerts played by Bruce Springsteen both solo and
with numerous bands from the years 1973 to present day. Can be joined with
setlists
using gig_key
.
Usage
concerts
Format
A data frame with 6 variables:
- gig_key
Primary key of the data frame.
- date
Date of the concert.
- location
Full location of concert including venue name.
- state
State concert was performed in (if in USA).
- city
City in which the concert was performed (if not in USA).
- country
Country concert was performed in.
Source
Examples
library(dplyr)
# What countries have been played in the most?
concerts %>%
count(country, sort = TRUE)
# What decade did most shows take place in?
library(lubridate)
concerts %>%
select(date) %>%
mutate(decade = (year(date) %/% 10) * 10) %>%
count(decade)
Metadata for setlists in the spRingsteen dataset.
Description
Metadata for the setlists of concerts played by Bruce Springsteen both solo and with numerous bands from the years 1973 to present day.
Usage
setlists
Format
A data frame with4 variables:
- gig_key
Key associated with the concert which the setlist is from.
- song_key
Key associated with the song played.
- song
Name of the song played.
- song_number
Order of appearance for the song in the setlist.
Source
Examples
library(dplyr)
# what are the top five most played songs?
setlists %>%
count(song, sort = TRUE) %>%
slice(1:5)
# what is the average show length?
setlists %>%
count(gig_key) %>%
summarise(ave_length = mean(n))
Data for all songs in the spRingsteen dataset.
Description
Data describing all songs which have been played by Bruce Springsteen both
solo and with numerous bands from the year 1973 to present day. Can be joined with
setlists
using song_key
.
Usage
songs
Format
A data frame with 4 variables:
- song_key
Primary key of the data frame.
- title
Title of the song.
- lyrics
Lyrics of the song if available in the database.
- album
Name of the album on which the song appears if available in the database.
Source
Examples
library(dplyr)
# What are the most common albums?
songs %>%
filter(!is.na(album)) %>%
count(album, sort = TRUE)
# What word occurs most frequently in the lyrics from the album 'Born To Run'
library(tidytext)
songs %>%
filter(album == 'Born To Run') %>%
select(title, lyrics) %>%
unnest_tokens(word, lyrics) %>%
count(word, sort = TRUE) %>%
anti_join(stop_words, by = 'word')
Tour information for concerts in the spRingsteen dataset.
Description
Data describing the tours associated with concerts played by Bruce
Springsteen both solo and with numerous bands from the years 1973 to present day.
Note that concerts prior to 1973 and non-tour, e.g., practice shows,
promotion shows, have been removed. Furthermore some of the shows are
associated with more than one tour as such some of the entries from
concerts
appear twice. Can be joined with setlists
or concerts
using gig_key
.
Usage
tours
Format
A data frame with 2 variables:
- gig_key
Primary key of the data frame.
- tour
Tour associated with the concert. Note some concerts have more than one tour associated with them.
Source
Examples
library(dplyr)
# How many shows were on each tour?
tours %>%
count(tour, sort = TRUE)
Update the Package Datasets
Description
Checks if new data is available on the package dev version (Github). In case new data is available the function will enable the user the update the datasets
Usage
update_data()
Examples
## Not run:
data_update()
## End(Not run)