Skip to content
Advertisement

Pyathena is super slow compared to querying from Athena

I run a query from AWS Athena console and takes 10s. The same query run from Sagemaker using PyAthena takes 155s. Is PyAthena slowing it down or is the data transfer from Athena to sagemaker so time consuming?

What could I do to speed this up?

Advertisement

Answer

Just figure out a way of boosting the queries:

Before I was trying:

JavaScript

Figured out that using a PandasCursor instead of a connection is way faster

JavaScript

Ref: https://github.com/laughingman7743/PyAthena/issues/46

User contributions licensed under: CC BY-SA
2 People found this is helpful
Advertisement