Package: archiveRetriever
Title: Retrieve Archived Web Pages from the 'Internet Archive'
Version: 0.4.1
Authors@R: 
    c(person(given = "Lukas",
             family = "Isermann",
             role = c("aut", "cre"),
             email = "lukas.isermann@uni-mannheim.de",
             comment = c(ORCID = "https://orcid.org/0000-0002-7195-9302")),
             person(given = "Konstantin",
             family = "Gavras",
             role = "aut",
             email = "konstantin@gavras.de",
             comment = c(ORCID = "https://orcid.org/0000-0002-9222-0101")))
Description: Scraping content from archived web pages stored in
    the 'Internet Archive' (<https://archive.org>) using a systematic
    workflow.  Get an overview of the mementos available from the
    respective homepage, retrieve the Urls and links of the page and
    finally scrape the content. The final output is stored in tibbles,
    which can be then easily used for further analysis.
License: Apache License (>= 2.0)
URL: https://github.com/liserman/archiveRetriever/
Imports: anytime, dplyr, ggplot2, gridExtra, httr, jsonlite, lubridate,
        rvest, stringr, tibble, tidyr, utils, xml2
Suggests: vcr (>= 2.0.0), testthat, webmockr
Encoding: UTF-8
RoxygenNote: 7.3.3
NeedsCompilation: no
Packaged: 2025-10-16 13:00:33 UTC; liserman
Author: Lukas Isermann [aut, cre] (ORCID:
    <https://orcid.org/0000-0002-7195-9302>),
  Konstantin Gavras [aut] (ORCID:
    <https://orcid.org/0000-0002-9222-0101>)
Maintainer: Lukas Isermann <lukas.isermann@uni-mannheim.de>
Repository: CRAN
Date/Publication: 2025-10-16 14:40:02 UTC
Built: R 4.5.2; ; 2025-11-01 03:04:24 UTC; windows
