Skip to contents

Downloads zipped table from patentsview.org, unzips it, reads it into R using readr::read_tsv_chunked, and writes each chunk to s3_out.

Usage

download_patents_view_to_s3_chunked(
  table,
  s3_out,
  chunk_size = 1e+05,
  chunk_column_name = "chunk"
)

Arguments

table

Table name, e.g., g_location_disambiguated.

s3_out

S3 path, e.g., s3://<bucket>/path

chunk_size

The number of rows to include in each chunk.

chunk_column_name

The column name that will be used to partition the data on S3.