• Create BookmarkCreate Bookmark
  • Create Note or TagCreate Note or Tag
  • PrintPrint
Share this Page URL
Help

2. Managing a Collection > 19. Generate Document Keywords

Generate Document Keywords

Complement your search strategy with document keywords.

Lost information is no use to anybody, and the difference between lost and found is a good collection search strategy. Keywords can play a valuable role in your strategy by giving you insight into a document’s topics. Of course, a document’s headings, listed in its Table of Contents, provide an outline of its topics. Keywords are different. Derived from the document’s full text, they fill in the gaps between the formal, outlined topics and their actual treatments. This hack explains how to find a PDF’s keywords using our kw_catcher program.

How the kw_catcher Keyword Generator Works

Finding keywords automatically is a hard problem. To simplify the problem, we are going to make a couple of assumptions. First, the document in question is large—50 pages or longer. Second, the document title is known—i.e., we aren’t trying to discover the document’s global topic, represented by its title. Rather, we are trying to discover subtopics that emerge throughout the document.


PREVIEW

                                                                          

Not a subscriber?

Start A Free Trial


  
  • Creative Edge
  • Create BookmarkCreate Bookmark
  • Create Note or TagCreate Note or Tag
  • PrintPrint