Class RedisMetadataHandler
- java.lang.Object
-
- com.amazonaws.athena.connector.lambda.handlers.MetadataHandler
-
- com.amazonaws.athena.connector.lambda.handlers.GlueMetadataHandler
-
- com.amazonaws.athena.connectors.redis.RedisMetadataHandler
-
- All Implemented Interfaces:
FederationRequestHandler,com.amazonaws.services.lambda.runtime.RequestStreamHandler
public class RedisMetadataHandler extends GlueMetadataHandler
Handles metadata requests for the Athena Redis Connector using Glue for schema.For more detail, please see the module's README.md, some notable characteristics of this class include:
1. Uses Glue table properties (redis-endpoint, redis-value-type, redis-key-prefix, redis-keys-zset, redis-ssl-flag, redis-cluster-flag, and redis-db-number) to provide schema as well as connectivity details to Redis. 2. Attempts to resolve sensitive fields such as redis-endpoint via SecretsManager so that you can substitute variables with values from by doing something like hostname:port:password=${my_secret}
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from class com.amazonaws.athena.connector.lambda.handlers.GlueMetadataHandler
GlueMetadataHandler.DatabaseFilter, GlueMetadataHandler.TableFilter
-
-
Field Summary
Fields Modifier and Type Field Description static StringDEFAULT_REDIS_DB_NUMBERprotected static StringKEY_COLUMN_NAMEprotected static StringKEY_PREFIX_SEPERATORprotected static StringKEY_PREFIX_TABLE_PROPprotected static StringKEY_TYPEprotected static StringQPT_CLUSTER_ENV_VARprotected static StringQPT_COLUMN_NAMEprotected static StringQPT_DB_NUMBER_ENV_VARprotected static StringQPT_ENDPOINT_ENV_VARprotected static StringQPT_SSL_ENV_VARprotected static StringREDIS_CLUSTER_FLAGprotected static StringREDIS_DB_FLAGprotected static StringREDIS_DB_NUMBERprotected static StringREDIS_ENDPOINT_PROPprotected static StringREDIS_SSL_FLAGprotected static StringSPLIT_END_INDEXprotected static StringSPLIT_START_INDEXprotected static StringVALUE_TYPE_TABLE_PROPprotected static StringZSET_KEYS_TABLE_PROP-
Fields inherited from class com.amazonaws.athena.connector.lambda.handlers.GlueMetadataHandler
COLUMN_NAME_MAPPING_PROPERTY, DATETIME_FORMAT_MAPPING_PROPERTY, DATETIME_FORMAT_MAPPING_PROPERTY_NORMALIZED, GET_TABLES_REQUEST_MAX_RESULTS, GLUE_TABLE_CONTAINS_PREVIOUSLY_UNSUPPORTED_TYPE, SOURCE_TABLE_PROPERTY, VIEW_METADATA_FIELD
-
Fields inherited from class com.amazonaws.athena.connector.lambda.handlers.MetadataHandler
configOptions, DISABLE_SPILL_ENCRYPTION, KMS_KEY_ID_ENV, SPILL_BUCKET_ENV, SPILL_PREFIX_ENV
-
-
Constructor Summary
Constructors Modifier Constructor Description RedisMetadataHandler(Map<String,String> configOptions)protectedRedisMetadataHandler(software.amazon.awssdk.services.glue.GlueClient awsGlue, EncryptionKeyFactory keyFactory, software.amazon.awssdk.services.secretsmanager.SecretsManagerClient secretsManager, software.amazon.awssdk.services.athena.AthenaClient athena, RedisConnectionFactory redisConnectionFactory, String spillBucket, String spillPrefix, Map<String,String> configOptions)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description protected org.apache.arrow.vector.types.pojo.FieldconvertField(String name, String type)Overrides the default Glue Type to Apache Arrow Type mapping so that we can fail fast on tables which define types that are not supported by this connector.GetDataSourceCapabilitiesResponsedoGetDataSourceCapabilities(BlockAllocator allocator, GetDataSourceCapabilitiesRequest request)Used to describe the types of capabilities supported by a data source.GetTableResponsedoGetQueryPassthroughSchema(BlockAllocator allocator, GetTableRequest request)Used to get definition (field names, types, descriptions, etc...) of a Query PassThrough.GetSplitsResponsedoGetSplits(BlockAllocator blockAllocator, GetSplitsRequest request)If the table is comprised of multiple key prefixes, then we parallelize those by making them each a split.GetTableResponsedoGetTable(BlockAllocator blockAllocator, GetTableRequest request)Retrieves the schema for the request Table from Glue then enriches that result with Redis specific metadata and columns.ListSchemasResponsedoListSchemaNames(BlockAllocator blockAllocator, ListSchemasRequest request)Returns an unfiltered list of schemas (aka databases) from AWS Glue DataCatalog.ListTablesResponsedoListTables(BlockAllocator blockAllocator, ListTablesRequest request)Returns an unfiltered list of tables from AWS Glue DataCatalog for the requested schema (aka database)voidenhancePartitionSchema(SchemaBuilder partitionSchemaBuilder, GetTableLayoutRequest request)This method can be used to add additional fields to the schema of our partition response.voidgetPartitions(BlockWriter blockWriter, GetTableLayoutRequest request, QueryStatusChecker queryStatusChecker)Even though our table doesn't support complex layouts or partitioning, we need to convey that there is at least 1 partition to read as part of the query or Athena will assume partition pruning found no candidate layouts to read.-
Methods inherited from class com.amazonaws.athena.connector.lambda.handlers.GlueMetadataHandler
doGetTable, doListSchemaNames, doListTables, getAwsGlue, getCatalog, getColumnNameMapping, getSourceTableName, populateSourceTableNameIfAvailable
-
Methods inherited from class com.amazonaws.athena.connector.lambda.handlers.MetadataHandler
doGetTableLayout, doHandleRequest, doPing, getCachableSecretsManager, getRequestOverrideConfig, getSecret, handleRequest, makeEncryptionKey, makeSpillLocation, onPing, resolveSecrets, resolveWithDefaultCredentials
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface com.amazonaws.athena.connector.lambda.handlers.FederationRequestHandler
getAthenaClient, getRequestOverrideConfig, getS3Client, getSessionCredentials
-
-
-
-
Field Detail
-
KEY_COLUMN_NAME
protected static final String KEY_COLUMN_NAME
- See Also:
- Constant Field Values
-
SPLIT_START_INDEX
protected static final String SPLIT_START_INDEX
- See Also:
- Constant Field Values
-
SPLIT_END_INDEX
protected static final String SPLIT_END_INDEX
- See Also:
- Constant Field Values
-
QPT_COLUMN_NAME
protected static final String QPT_COLUMN_NAME
- See Also:
- Constant Field Values
-
QPT_ENDPOINT_ENV_VAR
protected static final String QPT_ENDPOINT_ENV_VAR
- See Also:
- Constant Field Values
-
QPT_SSL_ENV_VAR
protected static final String QPT_SSL_ENV_VAR
- See Also:
- Constant Field Values
-
QPT_CLUSTER_ENV_VAR
protected static final String QPT_CLUSTER_ENV_VAR
- See Also:
- Constant Field Values
-
QPT_DB_NUMBER_ENV_VAR
protected static final String QPT_DB_NUMBER_ENV_VAR
- See Also:
- Constant Field Values
-
KEY_TYPE
protected static final String KEY_TYPE
- See Also:
- Constant Field Values
-
VALUE_TYPE_TABLE_PROP
protected static final String VALUE_TYPE_TABLE_PROP
- See Also:
- Constant Field Values
-
KEY_PREFIX_TABLE_PROP
protected static final String KEY_PREFIX_TABLE_PROP
- See Also:
- Constant Field Values
-
ZSET_KEYS_TABLE_PROP
protected static final String ZSET_KEYS_TABLE_PROP
- See Also:
- Constant Field Values
-
KEY_PREFIX_SEPERATOR
protected static final String KEY_PREFIX_SEPERATOR
- See Also:
- Constant Field Values
-
REDIS_ENDPOINT_PROP
protected static final String REDIS_ENDPOINT_PROP
- See Also:
- Constant Field Values
-
REDIS_DB_FLAG
protected static final String REDIS_DB_FLAG
- See Also:
- Constant Field Values
-
REDIS_SSL_FLAG
protected static final String REDIS_SSL_FLAG
- See Also:
- Constant Field Values
-
REDIS_CLUSTER_FLAG
protected static final String REDIS_CLUSTER_FLAG
- See Also:
- Constant Field Values
-
REDIS_DB_NUMBER
protected static final String REDIS_DB_NUMBER
- See Also:
- Constant Field Values
-
DEFAULT_REDIS_DB_NUMBER
public static final String DEFAULT_REDIS_DB_NUMBER
- See Also:
- Constant Field Values
-
-
Constructor Detail
-
RedisMetadataHandler
protected RedisMetadataHandler(software.amazon.awssdk.services.glue.GlueClient awsGlue, EncryptionKeyFactory keyFactory, software.amazon.awssdk.services.secretsmanager.SecretsManagerClient secretsManager, software.amazon.awssdk.services.athena.AthenaClient athena, RedisConnectionFactory redisConnectionFactory, String spillBucket, String spillPrefix, Map<String,String> configOptions)
-
-
Method Detail
-
doGetDataSourceCapabilities
public GetDataSourceCapabilitiesResponse doGetDataSourceCapabilities(BlockAllocator allocator, GetDataSourceCapabilitiesRequest request)
Description copied from class:MetadataHandlerUsed to describe the types of capabilities supported by a data source. An engine can use this to determine what portions of the query to push down. A connector that returns any optimization will guarantee that the associated predicate will be pushed down.- Overrides:
doGetDataSourceCapabilitiesin classMetadataHandler- Parameters:
allocator- Tool for creating and managing Apache Arrow Blocks.request- Provides details about the catalog being used.- Returns:
- A GetDataSourceCapabilitiesResponse object which returns a map of supported optimizations that the connector is advertising to the consumer. The connector assumes all responsibility for whatever is passed here.
-
doListSchemaNames
public ListSchemasResponse doListSchemaNames(BlockAllocator blockAllocator, ListSchemasRequest request) throws Exception
Description copied from class:GlueMetadataHandlerReturns an unfiltered list of schemas (aka databases) from AWS Glue DataCatalog.- Overrides:
doListSchemaNamesin classGlueMetadataHandler- Parameters:
blockAllocator- Tool for creating and managing Apache Arrow Blocks.request- Provides details on who made the request and which Athena catalog they are querying.- Returns:
- The ListSchemasResponse which mostly contains the list of schemas (aka databases).
- Throws:
Exception- See Also:
GlueMetadataHandler
-
doListTables
public ListTablesResponse doListTables(BlockAllocator blockAllocator, ListTablesRequest request) throws Exception
Description copied from class:GlueMetadataHandlerReturns an unfiltered list of tables from AWS Glue DataCatalog for the requested schema (aka database)- Overrides:
doListTablesin classGlueMetadataHandler- Parameters:
blockAllocator- Tool for creating and managing Apache Arrow Blocks.request- Provides details on who made the request and which Athena catalog they are querying.- Returns:
- The ListTablesResponse which mostly contains the list of table names.
- Throws:
Exception- See Also:
GlueMetadataHandler
-
doGetTable
public GetTableResponse doGetTable(BlockAllocator blockAllocator, GetTableRequest request) throws Exception
Retrieves the schema for the request Table from Glue then enriches that result with Redis specific metadata and columns.- Overrides:
doGetTablein classGlueMetadataHandler- Parameters:
blockAllocator- Tool for creating and managing Apache Arrow Blocks.request- Provides details on who made the request and which Athena catalog, database, and table they are querying.- Returns:
- A GetTableResponse mostly containing the columns, their types, and any table properties for the requested table.
- Throws:
Exception
-
doGetQueryPassthroughSchema
public GetTableResponse doGetQueryPassthroughSchema(BlockAllocator allocator, GetTableRequest request) throws Exception
Description copied from class:MetadataHandlerUsed to get definition (field names, types, descriptions, etc...) of a Query PassThrough.- Overrides:
doGetQueryPassthroughSchemain classMetadataHandler- Parameters:
allocator- Tool for creating and managing Apache Arrow Blocks.request- Provides details on who made the request and which Athena catalog, database, and table they are querying.- Returns:
- A GetTableResponse which primarily contains:
1. An Apache Arrow Schema object describing the table's columns, types, and descriptions.
2. A Set
of partition column names (or empty if the table isn't partitioned). - Throws:
Exception
-
enhancePartitionSchema
public void enhancePartitionSchema(SchemaBuilder partitionSchemaBuilder, GetTableLayoutRequest request)
Description copied from class:MetadataHandlerThis method can be used to add additional fields to the schema of our partition response. Athena expects each partitions in the response to have a column corresponding to your partition columns. You can choose to add additional columns to that response which Athena will ignore but will pass on to you when it call GetSplits(...) for each partition.- Overrides:
enhancePartitionSchemain classMetadataHandler- Parameters:
partitionSchemaBuilder- The SchemaBuilder you can use to add additional columns and metadata to the partitions response.request- The GetTableLayoutResquest that triggered this call.
-
getPartitions
public void getPartitions(BlockWriter blockWriter, GetTableLayoutRequest request, QueryStatusChecker queryStatusChecker) throws Exception
Even though our table doesn't support complex layouts or partitioning, we need to convey that there is at least 1 partition to read as part of the query or Athena will assume partition pruning found no candidate layouts to read. We also use this 1 partition to carry settings that we will need in order to generate splits.- Specified by:
getPartitionsin classMetadataHandler- Parameters:
blockWriter- Used to write rows (partitions) into the Apache Arrow response.request- Provides details of the catalog, database, and table being queried as well as any filter predicate.queryStatusChecker- A QueryStatusChecker that you can use to stop doing work for a query that has already terminated- Throws:
Exception
-
doGetSplits
public GetSplitsResponse doGetSplits(BlockAllocator blockAllocator, GetSplitsRequest request)
If the table is comprised of multiple key prefixes, then we parallelize those by making them each a split.- Specified by:
doGetSplitsin classMetadataHandler- Parameters:
blockAllocator- Tool for creating and managing Apache Arrow Blocks.request- Provides details of the catalog, database, table, andpartition(s) being queried as well as any filter predicate.- Returns:
- A GetSplitsResponse which primarily contains:
1. A Set
which represent read operations Amazon Athena must perform by calling your read function. 2. (Optional) A continuation token which allows you to paginate the generation of splits for large queries.
-
convertField
protected org.apache.arrow.vector.types.pojo.Field convertField(String name, String type)
Overrides the default Glue Type to Apache Arrow Type mapping so that we can fail fast on tables which define types that are not supported by this connector.- Overrides:
convertFieldin classGlueMetadataHandler- Parameters:
name- The name of the field in Glue.type- The type of the field in Glue.- Returns:
- The corresponding Apache Arrow Field.
-
-