gawk-diffs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[gawk-diffs] Request for a script for indexing


From: Sumathi LC
Subject: [gawk-diffs] Request for a script for indexing
Date: Sun, 10 Apr 2011 17:15:29 -0400


Dear All
 
I am trying to begin writing a script for generating book index by reading two inputs:
1, The complete book in PDF or DOC format.
2. A list of words/phrases in Ascii txt/Excel to look for in the book.
Using the word list from 2, the program needs to search the whole document 1 and return a text file that enlists all the words with the page numbers separated by comma. Perhaps we can make it read the page numbers by retrieving information from the footer or using a loop to count the pages.
 
I would be grateful if someone can share with me such a script if available. Or some related scripts/ideas to implement this would also be very helpful.
 
Thanks and regards
Sumathi
 

reply via email to

[Prev in Thread] Current Thread [Next in Thread]