slide_etl
Ingest slide by adding them to a file or s3 based storage location and generating metadata about them
Parameters:
Name | Type | Description | Default |
---|---|---|---|
slide_urlpath |
str
|
path to slide image |
'???'
|
project_name |
str
|
project name underwhich the slides should reside |
''
|
comment |
str
|
comment and description of dataset |
''
|
subset_csv_urlpath |
str
|
url/path to subset csv |
''
|
debug_limit |
int
|
limit number of slides |
0
|
output_urlpath |
str
|
url/path to output table |
''
|
storage_options |
dict
|
storage options to pass to reading functions |
{}
|
output_storage_options |
dict
|
storage options to pass to writing functions |
{}
|
local_config |
str
|
url/path to YAML config file |
''
|
no_copy |
bool
|
determines whether we copy slides to output_urlpath |
False
|
metadata_extension(str) |
file extension of generated metadata file (either 'csv' or 'parquet') |
required |