Wraps the SA Hansard API (http://parliament-api-docs.readthedocs.io/en/latest/south-australia/), producing "tidy" data.frame
objects or rendering the resulting HTML.
This package can be installed from GitHub using the devtools::install_github
method:
devtools::install_github("jonocarroll/SAHansard")
Obtain all SA Hansard records for a given year
hansard_year(year = "2017")
#> <Hansard 2017>
#> date chamber PdfDocId TocDocId
#> 1 2017-02-16 Legislative Council HANSARD-10-20106 HANSARD-10-20107
#> 2 2017-02-16 House of Assembly HANSARD-11-25481 HANSARD-11-25482
#> 3 2017-02-15 House of Assembly HANSARD-11-25420 HANSARD-11-25421
#> 4 2017-02-15 Legislative Council HANSARD-10-20049 HANSARD-10-20050
#> 5 2017-02-14 Legislative Council HANSARD-10-19979 HANSARD-10-19980
#> 6 2017-02-14 House of Assembly HANSARD-11-25353 HANSARD-11-25354
#> Uncorrected
#> 1 FALSE
#> 2 FALSE
#> 3 FALSE
#> 4 FALSE
#> 5 FALSE
#> 6 FALSE
From these, a given Table of Contents can be obtained (for a given docid
from hansard_year
)
# normally just evaluating hansard_toc is what you want,
# here I only want to show limited results
glimpse(hansard_toc(docid = 'HANSARD-10-19980'))
#> <Hansard HANSARD-10-19980>
#> Observations: 72
#> Variables: 16
#> $ pdfid <chr> "HANSARD-10-19979", "HANSARD-10-19979", "HANSARD-10...
#> $ type <chr> "Root", "Root", "Root", "Root", "Root", "Root", "Ro...
#> $ expanded <chr> "TRUE", "TRUE", "TRUE", "TRUE", "TRUE", "TRUE", "TR...
#> $ date <chr> "2017-02-14", "2017-02-14", "2017-02-14", "2017-02-...
#> $ chamber <chr> "Legislative Council", "Legislative Council", "Legi...
#> $ name <chr> "Parliamentary Committees", "Ministerial Statement"...
#> $ type.1 <chr> "Proceeding", "Proceeding", "Proceeding", "Proceedi...
#> $ expanded.1 <chr> "TRUE", "TRUE", "TRUE", "TRUE", "TRUE", "TRUE", "TR...
#> $ name.1 <chr> "Crime and Public Integrity Policy Committee", "Cry...
#> $ type.2 <chr> "Topic", "Topic", "Topic", "Topic", "Topic", "Topic...
#> $ docid <chr> "HANSARD-10-19939", "HANSARD-10-19940", "HANSARD-10...
#> $ expanded.2 <chr> "TRUE", "TRUE", "TRUE", "TRUE", "TRUE", "TRUE", "TR...
#> $ xref <chr> "1", "1", "1", "2", "1", "2", "1", "2", "1", "2", "...
#> $ name.2 <chr> "The Hon. D.G.E. HOOD", "The Hon. P. MALINAUSKAS", ...
#> $ type.3 <chr> "Member", "Member", "Member", "Member", "Member", "...
#> $ id <chr> "3126", "5084", "1820", "3122", "1820", "3122", "27...
From these, a HTML fragment can be obtained (use the pdfid
from hansard_toc
)
# the print method for this result opens the
# rendered HTML fragment in the RStudio Viewer with
# CSS obtained from the Hansard site.
# Here I am just printing the content
hansard_fragment(pdfid = 'HANSARD-10-19961')$content
[1] "<fragment.text>
South Road Tram Overpass
The Hon. P. MALINAUSKAS (Minister for Police, Minister for Correctional Services, Minister for Emergency Services, Minister for Road Safety) (16:19:45): I table a copy of a ministerial statement relating to the tram overpass made earlier today in another place by my colleague the Minister for Transport and Infrastructure.
</fragment.text>"