Skip to content
Advertisement

How to Read data from Jdbc and write to bigquery using Apache Beam Python Sdk

I am trying to write a Pipeline which will Read Data From JDBC(oracle,mssql) , do something and write to bigquery.

I am Struggling in the ReadFromJdbc steps where it was not able to convert it correct schema type.

My Code:

JavaScript

My data has three columns two of which are Varchar and one is timestamp. Here is how my data looks like

Error which i am facing while running from dataflow as well as direct runner

JavaScript

Any Help regarding this would be greatly appreciated.

Advertisement

Answer

JdbcIO appears to rely on Java-only logical types, so Python cannot deserialize them. This is tracked in https://issues.apache.org/jira/browse/BEAM-13717

User contributions licensed under: CC BY-SA
10 People found this is helpful
Advertisement