Write a table from PatentsView to S3 — download_patents_view_to_s3_chunked • barketools

Downloads zipped table from patentsview.org, unzips it, reads it into R using readr::read_tsv_chunked, and writes each chunk to s3_out.

Usage

download_patents_view_to_s3_chunked(
  table,
  s3_out,
  chunk_size = 1e+05,
  chunk_column_name = "chunk"
)

Arguments

table: Table name, e.g., g_location_disambiguated.
s3_out: S3 path, e.g., s3://<bucket>/path
chunk_size: The number of rows to include in each chunk.
chunk_column_name: The column name that will be used to partition the data on S3.