For my dissertation, I am using words and their respective data I have pulled from Google trends to see which have the most significant effect on market returns, so I am using Fama French data to explore this.
I have a set of words I want to regress on 'rmf', and then use this regression to spot the t-statistic and then average my observations on each day for each words, essentially putting their data together to form a single index. I have data from 2004-2022 and want to regress this every sixth months and gain my average observations for each day within the 6 months. I am using 6-month rolling regressions because different words will be searched more/less in different years.
For the words I want to average, I want to use those that have a negative t-statistic in the time period, then use only these words to collate an average of their respective observations on that day.
Is there a shortcut that will allow me to regress in different time periods, then put all this together so I have one variable, rather than 36 different variables for each 6 months in the period?
I have a set of words I want to regress on 'rmf', and then use this regression to spot the t-statistic and then average my observations on each day for each words, essentially putting their data together to form a single index. I have data from 2004-2022 and want to regress this every sixth months and gain my average observations for each day within the 6 months. I am using 6-month rolling regressions because different words will be searched more/less in different years.
For the words I want to average, I want to use those that have a negative t-statistic in the time period, then use only these words to collate an average of their respective observations on that day.
Is there a shortcut that will allow me to regress in different time periods, then put all this together so I have one variable, rather than 36 different variables for each 6 months in the period?
Comment