duckdb/duckdb_iceberg

Support AWS Glue Catalog

rustyconover opened this issue · 4 comments

I'd like to add support for "schemes" that prefix the location supplied for Iceberg tables.

For example I'd like to support AWS Glue with a scheme like:

iceberg_scan('aws-glue://[catalog id].[database_name].[table_name]').

The use of this scheme would allow DuckDB to access Iceberg tables where the current snapshot version is stored in a Glue data catalog. The Iceberg extension would call Glue's describe_table() API method, get the current metadata_url and parse the JSON file.

I'd also like to add a companion function called iceberg_scan_tables('aws-glue://[catalog id]') that would scan all databases and tables under the schema and create views that are backed by iceberg_scan(), so this can expose the databases and tables in glue to the end user.

Might be relevant here too :
#22

Would be great to see glue supported here as an option - especially with Lake Formation as well

I have a pending PR for this functionality.

#51 should close this issue.