package documentation
code that is not considered core functionality, and not as supported, yet which you may find use for nonetheless
| Module | gerechtcodes |
Some information about the gerechtcodes that are used in ECLIs. |
| Module | lawref |
This code attempts to make it easier to deal with human variation in references to laws. |
| Module | ocr |
Extract text from images, mainly aimed at PDFs that contain _pictures_ of documents, rather than text directly. |
| Module | pdf |
Query PDFs about the text objects that they contain (which is not always clean, structured, correct, or present at all) |
| Module | pdfhocr |
This is an experiment in allowing per-page decisions to extract embedded text or do OCR. |
| Module | word |
A module to create wordcloud images. |