Dear community members,
Please help me to address the following issue. I will keep my example simple:
1. DV - rating score (0-100) - data for which was collected in March 2014, the rating itself was published in May 2014.
2. IV - number of Twitter followers (0 - to infinity) - data collected in the end of July 2014.
I am afraid that the reviews may point out that IV data was collected 3 months after DV data was published, and thus the results are biased.
Please suggest a proper way to justify this, or test somehow.
My initial idea was to: (A) re-collect IV data (which I did today for 20% randomly selected entities), and (B) run a paired t-test to see if there any significant increase (or decrease) over time. The null was rejected, that is increase in mean over time is significant. However, I am not sure that this is the correct approach, because really there are many possible factors that could impact this significant increase.
Thank you in advance,
Anton
Please help me to address the following issue. I will keep my example simple:
1. DV - rating score (0-100) - data for which was collected in March 2014, the rating itself was published in May 2014.
2. IV - number of Twitter followers (0 - to infinity) - data collected in the end of July 2014.
I am afraid that the reviews may point out that IV data was collected 3 months after DV data was published, and thus the results are biased.
Please suggest a proper way to justify this, or test somehow.
My initial idea was to: (A) re-collect IV data (which I did today for 20% randomly selected entities), and (B) run a paired t-test to see if there any significant increase (or decrease) over time. The null was rejected, that is increase in mean over time is significant. However, I am not sure that this is the correct approach, because really there are many possible factors that could impact this significant increase.
Thank you in advance,
Anton
Comment