Python: How to move files in a structured folder based on year/month/date format?

Question

Currently I have a spark job that reads the file, creates a dataframe, does some transformations and then move those records in &#8220;year/month/date&#8221; format. I am achieving this by: I want to achieve the same by pythonic way. So, in the end it should look like: Answer Based on your question , instead …

Accepted Answer

Based on your question , instead of using partitionBy you can also modify your config['destination'] , as s3 will take care of the necessary folder creations underneath the s3 paths3_dump_path = config["destination"] ### 's3:/test-path/'>>> curr_date = datetime.now().date()>>> year,month,day = curr_date.strftime('%Y'),curr_date.strftime('%m'),curr_date.strftime('%d')>>> s3_new_path = '/'.join([s3_dump_path,year,month,day])>>> s3_new_path's3:/test-path//2022/04/14'>>> config["destination"] = s3_new_pathdf.write.option("delimiter", "t").option("header", False).mode(            "append"        ).option("compression", "gzip").csv(            config["destination"]        )

Advertisement

Answer