package documentation
A collection of small singular tools, useful when composing more complex tasks
Module | akn |
Lookup of AKN |
Module | collocation |
Quick and dirty version of some collocation code. |
Module | date |
Try to deal with varied forms of dates and times, and ease things like "I would like to specify a range of days in a particular format" (e.g. for bulk fetching), and such. |
Module | escape |
Make it easier to safely insert text into URLs, and HTML and XML data. |
Module | etree |
Helpers to deal with XML data, largely a wrapper around lxml and its ElementTree interface. |
Module | format |
Formatting varied types of values into text, (and sometimes parsing the same), mostly for readability |
Module | koop |
Data and metadata parsing that is probably specific to KOOP's SRU repositories. |
Module | lazy |
Various functions that allow you to be (a little too) lazy - less typing and/or less thinking. |
Module | localdata |
This is intended to store store collections of data on disk, relatively unobtrusive to use (better than e.g. lots of files), and with quick random access (better than e.g. JSONL). |
Module | meta |
Things that parse metadata. |
Module | net |
network related helper functions, such as fetching from URLs |
Module | notebook |
Tools for jupyter/ipython-style notebooks, and detection that you are, or are _not_, using one right now. |
Module | patterns |
Extracting specific patterns of text. |
Module | shellcolor |
(should arguably be in extras) Eases production of colors in the terminal, mostly for a few command line debug tools. |
Module | spacy |
helper functions related to spacy natural language parsing. |
Module | split |
This module tries to wrangle distinct types of documents for you, from HTML to PDF, from varied specific sources, into plain text, so that you can consume it more easily. |
Module | strings |
mostly-basic string helper functions |
Module | util |
General utility functions, like "give me a path to where wetsuite can store data" and debug tools to the end of inspecting data. |