Class S3FileIO
- All Implemented Interfaces:
Closeable,Serializable,AutoCloseable,CredentialSupplier,DelegateFileIO,FileIO,SupportsBulkOperations,SupportsPrefixOperations,SupportsRecoveryOperations
Locations used must follow the conventions for S3 URIs (e.g. s3://bucket/path...). URIs with
schemes s3a, s3n, https are also treated as s3 file paths. Using this FileIO with other schemes
will result in ValidationException.
- See Also:
-
Constructor Summary
ConstructorsConstructorDescriptionS3FileIO()No-arg constructor to load the FileIO dynamically.S3FileIO(SerializableSupplier<software.amazon.awssdk.services.s3.S3Client> s3) Constructor with custom s3 supplier and S3FileIO properties.S3FileIO(SerializableSupplier<software.amazon.awssdk.services.s3.S3Client> s3, S3FileIOProperties s3FileIOProperties) Constructor with custom s3 supplier and S3FileIO properties. -
Method Summary
Modifier and TypeMethodDescriptionsoftware.amazon.awssdk.services.s3.S3Clientclient()voidclose()Close File IO to release underlying resources.voiddeleteFile(String path) Delete the file at the given path.voiddeleteFiles(Iterable<String> paths) Deletes the given paths in a batched manner.voiddeletePrefix(String prefix) This method provides a "best-effort" to delete all objects under the given prefix.protected voidfinalize()Returns the credential stringvoidinitialize(Map<String, String> props) Initialize File IO from catalog properties.listPrefix(String prefix) Return an iterable of all files under a prefix.newInputFile(String path) Get aInputFileinstance to read bytes from the file at the given path.newInputFile(String path, long length) Get aInputFileinstance to read bytes from the file at the given path, with a known file length.newOutputFile(String path) Get aOutputFileinstance to write bytes to the file at the given path.Returns the property map used to configure this FileIObooleanrecoverFile(String path) Perform a best-effort recovery of a file at a given pathMethods inherited from class java.lang.Object
clone, equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface org.apache.iceberg.io.FileIO
deleteFile, deleteFile, newInputFile, newInputFile, newInputFile
-
Constructor Details
-
S3FileIO
public S3FileIO()No-arg constructor to load the FileIO dynamically.All fields are initialized by calling
initialize(Map)later. -
S3FileIO
Constructor with custom s3 supplier and S3FileIO properties.Calling
initialize(Map)will overwrite information set in this constructor.- Parameters:
s3- s3 supplier
-
S3FileIO
public S3FileIO(SerializableSupplier<software.amazon.awssdk.services.s3.S3Client> s3, S3FileIOProperties s3FileIOProperties) Constructor with custom s3 supplier and S3FileIO properties.Calling
initialize(Map)will overwrite information set in this constructor.- Parameters:
s3- s3 suppliers3FileIOProperties- S3 FileIO properties
-
-
Method Details
-
newInputFile
Description copied from interface:FileIOGet aInputFileinstance to read bytes from the file at the given path.- Specified by:
newInputFilein interfaceFileIO
-
newInputFile
Description copied from interface:FileIOGet aInputFileinstance to read bytes from the file at the given path, with a known file length.- Specified by:
newInputFilein interfaceFileIO
-
newOutputFile
Description copied from interface:FileIOGet aOutputFileinstance to write bytes to the file at the given path.- Specified by:
newOutputFilein interfaceFileIO
-
deleteFile
Description copied from interface:FileIODelete the file at the given path.- Specified by:
deleteFilein interfaceFileIO
-
properties
Description copied from interface:FileIOReturns the property map used to configure this FileIO- Specified by:
propertiesin interfaceFileIO
-
deleteFiles
Deletes the given paths in a batched manner.The paths are grouped by bucket, and deletion is triggered when we either reach the configured batch size or have a final remainder batch for each bucket.
- Specified by:
deleteFilesin interfaceSupportsBulkOperations- Parameters:
paths- paths to delete- Throws:
BulkDeletionFailureException- in case of failure to delete at least 1 file
-
listPrefix
Description copied from interface:SupportsPrefixOperationsReturn an iterable of all files under a prefix.Hierarchical file systems (e.g. HDFS) may impose additional restrictions like the prefix must fully match a directory whereas key/value object stores may allow for arbitrary prefixes.
- Specified by:
listPrefixin interfaceSupportsPrefixOperations- Parameters:
prefix- prefix to list- Returns:
- iterable of file information
-
deletePrefix
This method provides a "best-effort" to delete all objects under the given prefix.Bulk delete operations are used and no reattempt is made for deletes if they fail, but will log any individual objects that are not deleted as part of the bulk operation.
- Specified by:
deletePrefixin interfaceSupportsPrefixOperations- Parameters:
prefix- prefix to delete
-
client
public software.amazon.awssdk.services.s3.S3Client client() -
getCredential
Description copied from interface:CredentialSupplierReturns the credential string- Specified by:
getCredentialin interfaceCredentialSupplier
-
initialize
Description copied from interface:FileIOInitialize File IO from catalog properties.- Specified by:
initializein interfaceFileIO- Parameters:
props- catalog properties
-
close
public void close()Description copied from interface:FileIOClose File IO to release underlying resources.Calling this method is only required when this FileIO instance is no longer expected to be used, and the resources it holds need to be explicitly released to avoid resource leaks.
-
finalize
-
recoverFile
Description copied from interface:SupportsRecoveryOperationsPerform a best-effort recovery of a file at a given path- Specified by:
recoverFilein interfaceSupportsRecoveryOperations- Parameters:
path- Absolute path of file to attempt recovery for- Returns:
- true if recovery was successful, false otherwise
-