package documentation
code that is not considered core functionality, and not as supported, yet which you may find use for nonetheless
Module | gerechtcodes |
Some information about the gerechtcodes that are used in ECLIs. |
Module | lawref |
This code attempts to make it easier to deal with human variation in references to laws. |
Module | ocr |
Extract text from images, mainly aimed at PDFs that contain _pictures_ of documents, rather than text directly. |
Module | pdf |
Query PDFs about the text objects that they contain (which is not always clean, structured, correct, or present at all) |
Module | pdfhocr |
This is an experiment in allowing per-page decisions to extract embedded text or do OCR. |
Module | word |
A module to create wordcloud images. |