Dear Members,
I have a problem that pertains to combining two datasets that do not have a unique identifier. My datasets contains information on around 250 firms, their competitors and the competitors’ competitors for a time-period of 34 years.
Dataset 1 has information on firm-competitor dyads, market and year. A simple representative form of the dataset is:
Dataset 2 has information on competitor- competitor’s competitor dyads, market and year for the same time frame. A simple representative form of this dataset is:
For analyses, I seek a dataset that links the Sample Firm to their Competitors and the Competitors’ Competitors. The final dataset would ideally be in the following form:
This would require merging Dataset 1 and Dataset 2. However, the issue is that there is no unique identifier between the two datasets. I tried a m:m merge using CompetitorFirm and Year. But the matching was not correct. Could any of the members please suggest another way through which the output in the final dataset could be obtained?
Thanks,
Archita
I have a problem that pertains to combining two datasets that do not have a unique identifier. My datasets contains information on around 250 firms, their competitors and the competitors’ competitors for a time-period of 34 years.
Dataset 1 has information on firm-competitor dyads, market and year. A simple representative form of the dataset is:
SampleFirm | Market1 | Year | CompetitorFirm |
Firm A | M1 | t1 | Firm C |
Firm A | M1 | t1 | Firm D |
Dataset 2 has information on competitor- competitor’s competitor dyads, market and year for the same time frame. A simple representative form of this dataset is:
CompetitorFirm | Market2 | Year | Competitor’sCompetitorFirm |
Firm C | M3 | t1 | Firm L |
Firm C | M4 | t1 | Firm M |
Firm D | M5 | t1 | Firm N |
Firm D | M6 | t1 | Firm O |
SampleFirm | Market1 | Year | CompetitorFirm | Competitor’sCompetitorFirm | Market2 |
Firm A | M1 | t1 | Firm C | Firm L | M3 |
Firm A | M1 | t1 | Firm C | Firm M | M4 |
Firm A | M1 | t1 | Firm D | Firm N | M5 |
Firm A | M1 | t1 | Firm D | Firm O | M6 |
This would require merging Dataset 1 and Dataset 2. However, the issue is that there is no unique identifier between the two datasets. I tried a m:m merge using CompetitorFirm and Year. But the matching was not correct. Could any of the members please suggest another way through which the output in the final dataset could be obtained?
Thanks,
Archita
Comment