Tag: scala

Is there a Scala Spark equivalent to pandas Grouper freq feature?

In pandas, if we have a time series and need to group it by a certain frequency (say, every two weeks), it’s possible to use the Grouper class, like this: Is there any equivalent in Spark (more specifically, using Scala) for this feature? Answer You can use the sql function window. First, you create the…

Uploading files from Azure Blob Storage to SFTP location using Databricks?

azure azure-databricks pyspark python scala

I have a scenario where I need to copy files from Azure Blob Storage to SFTP location in Databricks Is there a way to achieve this scenario using pySpark or Scala? Answer Regarding the issue, please refer to the following steps (I use scala) Mount Azure Blob storage containers to DBFS Copy these file to clust…

How to use JDBC source to write and read data in (Py)Spark?

apache-spark apache-spark-sql pyspark python scala

The goal of this question is to document: steps required to read and write data using JDBC connections in PySpark possible issues with JDBC sources and know solutions With small changes these methods should work with other supported languages including Scala and R. Answer Writing data Include applicable JDBC …

Is there a Python equivalent for Scala’s Option or Either?

functional-programming python scala

I really enjoy using the Option and Either monads in Scala. Are there any equivalent for these things in Python? If there aren’t, then what is the pythonic way of handling errors or “absence of value” without throwing exceptions? Answer The pythonic way for a function to say “I am not …