Connect to Data Sources
After adding an AWS account, you can connect to AWS Glue Data Catalogs to scan data sources that use AWS Glue as a data catalog (metadata catalog).
Connect to AWS Glue Data Source
Supported Big Data Data Types
For specific data formats supported by AWS Glue, please refer to Built-in Classifiers in AWS Glue.
Additionally, the solution also supports Glue Hudi tables.
- On the Connect Data Sources page, click an account to open its details page.
- On the Glue Data Catalogs tab, select a Glue connection, then choose Sync to Data Catalog.
- You will see the catalog status turn to gray
PENDING
, indicating the connection is starting (about 3 minutes). - When you see the catalog status turn to green
ACTIVE
, it means the Glue Data Catalog has been synchronized to the SDP platform's data catalog.
At this point, you have successfully connected to the Glue Data Catalog and can proceed to the next steps.