ConfiguredAssetS3DataConnector
- class great_expectations.datasource.data_connector.ConfiguredAssetS3DataConnector(name: str, datasource_name: str, bucket: str, assets: dict, execution_engine: Optional[great_expectations.execution_engine.execution_engine.ExecutionEngine] = None, default_regex: Optional[dict] = None, sorters: Optional[list] = None, prefix: str = '', delimiter: str = '/', max_keys: int = 1000, boto3_options: Optional[dict] = None, batch_spec_passthrough: Optional[dict] = None, id: Optional[str] = None)#
-
Extension of ConfiguredAssetFilePathDataConnector used to connect to S3.
Being a Configured Asset Data Connector, it requires an explicit list of each Data Asset it can connect to. While this allows for fine-grained control over which Data Assets may be accessed, it requires more setup.
- Parameters:
-
-
name (str) – required name for DataConnector
-
datasource_name (str) – required name for datasource
-
bucket (str) – bucket for S3
-
assets (dict) – dict of asset configuration (required for ConfiguredAssetDataConnector)
-
execution_engine (ExecutionEngine) – optional reference to ExecutionEngine
-
default_regex (dict) – optional regex configuration for filtering data_references
-
sorters (list) – optional list of sorters for sorting data_references
-
prefix (str) – S3 prefix
-
delimiter (str) – S3 delimiter
-
max_keys (int) – S3 max_keys (default is 1000)
-
boto3_options (dict) – optional boto3 options
-
batch_spec_passthrough (dict) – dictionary with keys that will be added directly to batch_spec
-
- get_available_data_asset_names() List[str] #
-
Return the list of asset names known by this DataConnector.
- Returns:
-
A list of available names