Skip to content

Multipart upload of one large file instead of a directory with per-partition files #15

@ryan-williams

Description

@ryan-williams

Just wondering if my interpretation of some relevant docs and the code here is correct:

  • it seems like uploading a single large file in parts is possible / supported by S3 (docs):

    In a distributed development environment, it is possible for your application to initiate several updates on the same object at the same time. Your application might initiate several multipart uploads using the same object key. For each of these uploads, your application can then upload parts and send a complete upload request to Amazon S3 to create the object

  • However, my reading of the code here is that s3committer doesn't support this; it only cares about jobs that output a directory containing one output file per task.

Is that all correct?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions