[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[elpa] externals/doc-toc 4c98932c65 05/84: create first version README
From: |
ELPA Syncer |
Subject: |
[elpa] externals/doc-toc 4c98932c65 05/84: create first version README |
Date: |
Mon, 26 Sep 2022 13:58:34 -0400 (EDT) |
branch: externals/doc-toc
commit 4c98932c65187b285d29fc1311466c9765c88be3
Author: Daniel Nicolai <dalanicolai@gmail.com>
Commit: Daniel Nicolai <dalanicolai@gmail.com>
create first version README
---
README.org | 86 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
1 file changed, 86 insertions(+)
diff --git a/README.org b/README.org
new file mode 100644
index 0000000000..05470ad73e
--- /dev/null
+++ b/README.org
@@ -0,0 +1,86 @@
+* toc-mode
+Create, cleanup, add and manage Table Of Contents (TOC) of pdf and djvu
documents with Emacs
+
+* Introduction
+TOC-mode is a package for creating, cleaning, adding and managing the
+Table Of Contents (TOC) of pdf and djvu documents.
+
+* Important
+Currently only files with a text layer are supported. A feature to extract the
+TOC via OCR will probably be added soon. Until then, the python
+[[https://pypi.org/project/document-contents-extractor/][documents-contents-extractor]]
package is recommended for extraction of TOC via
+OCR.
+
+* Requirements
+Currently the package requires the ~pdftotext~ (part of poppler-utils),
~pdfoutline~
+(part of
[[https://launchpad.net/ubuntu/bionic/+package/fntsample][fntsample]]) and
~djvused~ (part of
[[http://djvu.sourceforge.net/][http://djvu.sourceforge.net/]]) command
+line utilities to be available.
+
+* Usage
+Extraction and adding contents to the document is done in 4 steps:
+1. extraction
+2. cleanup
+3. adjust/correct pagenumbers
+4. add TOC to document
+
+** 1. Extraction
+Open some pdf or djvu file in Emacs (pdf-tools and djvu package recommended).
+Find the pagenumbers for the TOC. Then type =M-x toc-extract-pages= and answer
the
+subsequent prompts by entering the pagenumbers for the first and the last page
+each followed by =RET=.
+
+A buffer with the, somewhat cleaned up, extracted text will open in TOC-cleanup
+mode.
+
+** 2. TOC-Cleanup
+In this mode you can further cleanup the contents to create a list where
+each line has the structure:
+
+TITLE (SOME) PAGENUMBER
+
+There can be any number of spaces between TITLE and PAGE. The correct
+pagenumbers can be edited in the next step. A document outline supports
+different levels and levels are automatically assigned in order of increasing
+number of preceding spaces, i.e. the lines with the least amount of preceding
+spaces are assigned level 0 etc., and lines with equal number of spaces get
+assigned the same levels.
+#+BEGIN_SRC
+Contents 1
+Chapter 1 2
+ Section 1 3
+ Section 1.1 4
+Chapter 2 5
+#+END_SRC
+There are some handy functions to assist in the cleanup. =C-c C-j= jumps
+automatically to the next line not ending with a number and joins it with the
+next line. If the indentation structure of the different lines does not
+correspond with the levels, then the levels can be set automatically from the
+number of seperators in the indices with =M-x toc-cleanup-set-level-by-index=.
The
+default seperator is a ~.~ but a different seperator can be entered by
preceding
+the function invocation with the universal argument (=C-u=). Some documents
+contain a structure like
+#+BEGIN_SRC
+1 Chapter 1 1
+Section 1 2
+#+END_SRC
+Here the indentation can be set with ~M-x replace-regexp~ ~^[^0-9]~ -> ~\&~
(where
+there is a space character before the ~\&~ represents a space).
+
+Type =C-c C-c= when finished
+
+** 3 TOC-tabular (adjust pagenumbers)
+This mode provides the functionality for easy adjustment of pagenmumbers. The
+buffer can be navigated with the arrow =up/down= keys. The =left= and =right=
+arrow keys will shift =down/up= all the page numbers from the current line and
+below (combine with =SHIFT= for setting individual pagenumbers).
+
+Type =C-c C-c= when done
+
+** TOC-mode (add outline to document)
+The text of this buffer should have the right structure for adding the contents
+to (for pdf's a copy of) the original document. Final adjusments can be done
but
+should not be necessary. Just type =C-c C-c= for adding the contents to the
+document.
+
+For pdf the a file =pdfwithtoc.pdf= is created in the same folder as the
original
+pdf. For djvu, the TOC is simply added to the original djvu file.
- [elpa] branch externals/doc-toc created (now b4bb748aa3), ELPA Syncer, 2022/09/26
- [elpa] externals/doc-toc 7c4eef518e 02/84: initial upload, ELPA Syncer, 2022/09/26
- [elpa] externals/doc-toc fdb66ff587 04/84: delete old file(name) document-outliner, ELPA Syncer, 2022/09/26
- [elpa] externals/doc-toc 4c98932c65 05/84: create first version README,
ELPA Syncer <=
- [elpa] externals/doc-toc 32803571a6 06/84: delete README.md, ELPA Syncer, 2022/09/26
- [elpa] externals/doc-toc bcf2348d28 03/84: rename file to toc-mode, ELPA Syncer, 2022/09/26
- [elpa] externals/doc-toc 34d16c6c06 07/84: implement pdf navigation from TOC-tabular-mode, ELPA Syncer, 2022/09/26
- [elpa] externals/doc-toc 56218310ba 08/84: fix funcitons and update docs for djvu, ELPA Syncer, 2022/09/26
- [elpa] externals/doc-toc b869d9c888 14/84: Update README with tesseract ocr info, ELPA Syncer, 2022/09/26
- [elpa] externals/doc-toc f7aacf7e77 25/84: Set encoding and cleanup linefeeds, ELPA Syncer, 2022/09/26
- [elpa] externals/doc-toc d1cc2e950c 26/84: UPDATE README add extract screencast, ELPA Syncer, 2022/09/26
- [elpa] externals/doc-toc b22f6f0a2d 28/84: add golden-ratio-mode to toc-tablist, ELPA Syncer, 2022/09/26
- [elpa] externals/doc-toc caa066b67d 29/84: Add spell check and correct start pagenumber, ELPA Syncer, 2022/09/26
- [elpa] externals/doc-toc 81f58d0871 35/84: Fix double-dash function names, ELPA Syncer, 2022/09/26