Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • text analysis and topic modeling

    Hello dear all,

    I am wondering if we can do a text analysis with STATA or not?
    I have PDF files of the report of the country and during reading, I want to extract the activity that the government did in terms of increasing/decreasing the taxes and expenditure. My reports are quarterly data and total they are around 8000 pages.

    unfortunately, I could not find any films and specific courses in terms of STATA.

    I appreciate any help and assistance you can provide.

    Best regards,
    Khati

  • #2
    there are some community-contributed commands (e.g., txttool from SJ 14:4) but they only work on Stata dta files so you need to get your pdf files into Stata somehow; given the number of pages, I have no idea what would be the best way to do that

    Comment


    • #3
      @Rich Goldstein thanks for your reply.
      actually, my reports are quarterly and each has a maximum of 35 pages in pdf format.
      total they are 8000 pages.
      So, basically, I can do each report separately. but I could not find a specific link to learn.

      I appreciate your assistance.

      Comment

      Working...
      X