Motivation
The original LiPad site provides a SQL dump that requires you to configure PostreSQL on your own laptop or server.
Because some classmates asked me about it, I decided to host the database to simplify its access.
Description
Databases.pacha.dev provides a complete the PostgreSQL database of the Canadian Hansard dataset, including supplementary data. It covers the Canadian Hansard from 1901 to 2019 and doesn’t incldue modifications to LiPad contents, it only saves you the step of configuring PostgreSQL.
Usage
You can adapt this code for R. Please see my post about the census to get an idea of other functions you can use with SQL.
library(dplyr)
library(RPostgres)
con <- dbConnect(
Postgres(),
user = Sys.getenv("dbedu_usr"),
password = Sys.getenv("dbedu_pwd"),
dbname = "canadianhansard",
host = "databases.pacha.dev"
)
dbListTables(con)
tbl(con, "dilipadsite_basehansard") %>% glimpse()
dbDisconnect(con)Citing
For citation purposes, a full description of the project is available in the following article:
Beelen, K., Thijm, T. A., Cochrane, C., Halvemaan, K., Hirst, G., Kimmins, M., Lijbrink, S., Marx, M., Naderi, N., Rheault, L., Polyanovsky, R., and Whyte, T. (2017). “Digitization of the Canadian Parliamentary Debates.” Canadian Journal of Political Science, 50(3), 849–864. https://doi.org/10.1017/S0008423916001165.
Contributing
If you find this database to be useful, please consider donating on Buy me a Coffee. Additionally, I provide R and Python training services in English and Spanish language.