package documentation

code that is not considered core functionality, and not as supported, yet which you may find use for nonetheless

Module gerechtcodes Some information about the gerechtcodes that are used in ECLIs.
Module lawref This code attempts to make it easier to deal with human variation in references to laws.
Module ocr Extract text from images, mainly aimed at PDFs that contain _pictures_ of documents, rather than text directly.
Module pdf Query PDFs about the text objects that they contain (which is not always clean, structured, correct, or present at all)
Module pdfhocr This is an experiment in allowing per-page decisions to extract embedded text or do OCR.
Module word_cloud A module to create wordcloud images.