I'll try to explain this as best as possible.
I'm using a database of information for movies scraped from IMDb. It's currently as a csv but I'll be importing it into Stata.
Each film has a list of genres. They are listed alphabetically so I can't simply use the first genre of each list. I was wondering if there was a way to create the genre variable in Stata so that the genre variable for each film will have each of its list 'attached' to it, and then when I use a series of dummy variables in my regression, every film with that genre attached to it will be accounted for.
i.e. y = a + b1action + e
where the action dummy would pick any film with at least 'action' as one of its genres.
I hope this makes sense. Not sure it's possible but thought I'd ask!
Thanks
I'm using a database of information for movies scraped from IMDb. It's currently as a csv but I'll be importing it into Stata.
Each film has a list of genres. They are listed alphabetically so I can't simply use the first genre of each list. I was wondering if there was a way to create the genre variable in Stata so that the genre variable for each film will have each of its list 'attached' to it, and then when I use a series of dummy variables in my regression, every film with that genre attached to it will be accounted for.
i.e. y = a + b1action + e
where the action dummy would pick any film with at least 'action' as one of its genres.
I hope this makes sense. Not sure it's possible but thought I'd ask!
Thanks
Comment