How can I separate one row from a data set but repeat in each line some of the variables?

Question

I have a dataset where each row contains information that needs to be separated and printed in different rows, but I need to keep the name of the company on each newly printed row: example dataset These are the headers: These are 2 rows of data: I need to separate one line into as many as I need. Some companies

Accepted Answer

Given a text file that looks like:Law Office | 450,000 | 150,000 | 300,000 | 100,000 | 200,000 | 50,000Restaurant | 30,000  | 7,000   | null    | null    | 25,000  | 10,000We can do:df = pd.read_csv('file.txt', sep=' | ', engine='python')# Reverse the column names on '_'.df.columns = ['_'.join(reversed(x.split('_'))) for x in df.columns]# Use pd.wide_to_longdf = pd.wide_to_long(df, ['budget', 'remaining'], i='company', j='department', sep='_', suffix=r'w+').sort_index()df = df.reset_index().dropna()print(df)Output:      company department   budget remaining0  Law Office    finance  300,000   100,0001  Law Office  marketing  450,000   150,0002  Law Office      sales  200,000    50,0004  Restaurant  marketing   30,000     7,0005  Restaurant      sales   25,000    10,000Testing, and how I&#8217;d make the values numeric for future calculations:import pandas as pdfrom io import StringIOd='''company | marketing_budget | marketing_remaining | finance_budget | finance_remaining | sales_budget | sales_remainingLaw Office | 450,000 | 150,000 | 300,000 | 100,000 | 200,000 | 50,000Restaurant | 30,000 | 7,000 | null | null | 25,000 | 10,000'''df = pd.read_csv(StringIO(d), sep=' | ', engine='python')df = df.fillna('').applymap(lambda x: x.replace(',', ''))for col in df.columns:    df[col] = pd.to_numeric(df[col], errors='ignore')df.columns = ['_'.join(reversed(x.split('_'))) for x in df.columns]df = pd.wide_to_long(df, ['budget', 'remaining'], i='company', j='department', sep='_', suffix=r'w+').sort_index()df = df.reset_index().dropna()print(df)....      company department    budget  remaining0  Law Office    finance  300000.0   100000.01  Law Office  marketing  450000.0   150000.02  Law Office      sales  200000.0    50000.04  Restaurant  marketing   30000.0     7000.05  Restaurant      sales   25000.0    10000.0

Advertisement

Answer