Get tables from AWS Glue using boto3

Question

I need to harvest tables and column names from AWS Glue crawler metadata catalogue. I used boto3 but constantly getting number of 100 tables even though there are more. Setting up NextToken doesn't help. Please help if possible. Desired results is list as follows: lst = [table_one.col_one, table_one.col_two, table_two.col_one....table_n.col_n] UPDATED code, still need to have tablename+columnname: Answer Adding sub-loop did

Accepted Answer

Adding sub-loop did the trick to get table+column result.#harvest aws crawler metadatanext_token = ""client = boto3.client('glue',region_name='us-east-1')crawler_tables = []while True:  response = client.get_tables(DatabaseName = '', NextToken = next_token)  for tables in response['TableList']:    for columns in tables['StorageDescriptor']['Columns']:        crawler_tables.append(tables['Name'] + '.' + columns['Name'])  next_token = response.get('NextToken')  if next_token is None:    breakprint(crawler_tables)

Advertisement

Answer