I have a dataFrame from a large questionnaire, I’m generating summaries by aggregating the

Question

0

Asked: June 10, 20262026-06-10T16:34:28+00:00 2026-06-10T16:34:28+00:00

I have a dataFrame from a large questionnaire, I’m generating summaries by aggregating the

0

I have a dataFrame from a large questionnaire, I’m generating summaries by aggregating the data on different axis by doing:

df.groupby(group_name).agg([np.mean, np.std, np.count_nonzero])

This generates a column with mean, std, and count per question in my questionnaire. The names of each column in the grouped dataFrame are a tuple (original_column_name, function_applied)

The problem is that when I output to CSV (using to_csv()) the column names are outputted as a tuple i.e. ('gender', 'mean'), ('gender', 'std') where ideally I would like something like gender_mean & gender_std

How can I process these column names before output to CSV?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-10T16:34:29+00:00

Editorial Team

2026-06-10T16:34:29+00:00Added an answer on June 10, 2026 at 4:34 pm

In pandas 0.8.1, try this:

group_df = df.groupby(group_name).agg([np.mean, np.std, np.count_nonzero])
group_df.rename(None, lambda coltuple: '_'.join(coltuple), False, True)

See the DataFrame documentation for more details.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have a dataFrame from a large questionnaire, I’m generating summaries by aggregating the

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply