Skip to content
Advertisement

Appending data with unequal data frame dimensions

What is the best way to append data using matching column names from two different data frames with differing dimensions?

Scenario:

Df1 = 350(rows)x2778(columns) Df2 = 321×2910

Df1 has <2778 columns with the exact same name as <2910 columns in Df2. -It could be 500 columns in each data frame as an example that have equivalent names

What I want to do:

Append data from df2 to df1 where the column names match. E.x.: df1’s data is present in matching column and has df2’s data appended to the end of the column, put underneath it so to say.

If the col names don’t match, the data frame that lacks the matching name should have the name attached as a new column with NA’s filling the missing space. E.x.: df1 doesn’t have a column df2 has, so the column is added while maintaining order of processing.

I’ve tried to do this using Pandas in Python but got Index duplication errors (probably the columns). I’m looking at R now but I want to know if anyone has a simple solution.

Thanks!

Advertisement

Answer

Check out merge() from base r or bind_rows() from dplyr.

User contributions licensed under: CC BY-SA
1 People found this is helpful
Advertisement