I’m trying to use the pandas_profiling package to automagically describe some data frames from inside Apaceh Zeppelin. The code I’m running is: My result is: Any way to work around this? Any hope of working around it from inside Zeppelin? Answer The NotImplementedError is being raised from check_dataframe: https://github.com/ydataai/pandas-profiling/blob/v3.1.0/src/pandas_profiling/model/dataframe.py#L10. check_dataframe uses multimethod for enabling multiple argument dispatching to functions, which
Tag: apache-zeppelin
installed geolocator but ImportError: cannot import name ‘DummyLocator’
I Have installed geolocator and when I use pip install geolocator it returns: Requirement already satisfied: geolocator in /opt/anaconda3/lib/python3.6/site-packages (0.1.1) but when I try to import it with import geolocator raises this error: How can I resolve this error? Answer Problem solved by installing packages with this order:
How to pass a variable into an Pyspark sequence to generate time series?
I want to generate a time series, from 2021-12-01 to 2021-12-31, but I want to pass the values with variables into de function secuence. This is my code: I want the values 2021-12-01 and 2021-12-31 inside variables. Something like: And get this result: But instead I’m reciving: cannot resolve ‘eldia1’ given input columns: [MES, NEGOCIO]; Answer Easiest would be to
pyspark matplotlib integration with Zeppelin
I’m trying to draw histogram using pyspark in Zeppelin notebook. Here is what I have tried so far, This code run without no errors but this does not give the expected plot. So I googled and found this documantation, According to this, I tried to enable angular flag as follows, But now I’m getting an error called No module named