Closed
Description
Hey folks,
I posted this on SO and was asked to file a report here as well.
I'm trying to export pandas.DataFrame.describe()
to LaTex
using the to_latex()
-method. This works all fine as long as I don't apply the groupby()
-method beforehand. With a grouped DataFrame, the first row has no values, even though its label is count
. Note that the first row of a grouped dataframe is used to mark down the variable used for grouping in iPython notebook.
I'm using pandas 0.16.2, python 3.
Is this a bug or am I doing something wrong?
Cheers,
Jakob
Here some examples:
Without groupby
:
\begin{tabular}{lr}
\toprule
{} & IS\_FEMALE \\
\midrule
count & 2267.000000 \\
mean & 0.384649 \\
...
...
75\% & 1.000000 \\
max & 1.000000 \\
\bottomrule
\end{tabular}
With groupby
:
\begin{tabular}{llr}
\toprule
& & IS\_FEMALE \\
\midrule
0 & count & \\ % <-- note missing value here
& mean & 1134.000000 \\
& std & 0.554674 \\
...
...
& 75\% & 0.000000 \\
& max & 0.000000 \\
\bottomrule
\end{tabular}
Output in the notebook:
Activity
jorisvandenbossche commentedon Jul 23, 2015
Thanks for the report! Can you:
to_latex
. For example, if you create a similar dataframe comparable to the output of the groupby by hand, and then export it to latex, do you experience the same error?jakbaum commentedon Jul 23, 2015
Sure. This snippet re-creates the issue. Sorry for the messy DataFrame-construction. First time I create one with numpy.
I don't really know how to test your second point, to be honest. The first 'blank' row of a groupby is just visualization, I reckon?
jorisvandenbossche commentedon Jul 23, 2015
Thanks for the reproducible example! That indeeds triggers the error for me as well.
Here is an example of just a small dataframe that also shows the error (as it has as such nothing to do with the
groupby
, it is just that it creates a multi-index thatto_latex
handles incorrectly):It seems that all values are shifted one line below.
jorisvandenbossche commentedon Jul 23, 2015
It seems this has something to do with the index level names:
And possibly related: #9908
jreback commentedon Jul 23, 2015
dupe if #2942 ?
[-].groupby().to_latex() output broken[/-][+]BUG: to_latex() output broken when the index has a name[/+]jorisvandenbossche commentedon Jul 23, 2015
No, I don't think so, as this one not only applies to multi-index:
So it is something with the index name.
jakbaum commentedon Jul 23, 2015
Is the proposed fix of #9908 implemented in 0.16.2?
jorisvandenbossche commentedon Jul 23, 2015
@jakbaum yes, it is already in 0.16.1. But it does not fix this one, it possibly fixed a related issue, but should look into more detail into that.
And very welcome to look into the problem if you want! It shouldn't be that hard I think.
jreback commentedon Jul 23, 2015
also #8336
jakbaum commentedon Jul 23, 2015
@jorisvandenbossche Your belief in my coding qualities honor me, but quite honestly: I don't think I'm capable of fixing this. I wouldn't even know how to start and I don't want to mess things up. Actually, I'm more of a copy-paste coder than anything else. :)
jorisvandenbossche commentedon Jul 23, 2015
@jakbaum no problem, thanks for reporting it anyway!
10 remaining items