wetsuite API Documentation Modules Classes Names
Clear Help

For more information on the search, visit the help page.

Class Hierarchy

  • dateutil.parser.parserinfo
    • wetsuite.helpers.date.DutchParserInfo - specific configuration for dateutil for dutch month and week names
  • wetsuite.datacollect.koop_frbr.FRBRFetcher - Helper class to fetch data from an area of https://repository.overheid.nl/frbr/
  • wetsuite.datacollect.sru.SRUBase - Very minimal SRU implementation - just enough to access the KOOP repositories.
    • wetsuite.datacollect.koop_sru.BWB - SRU endpoint for the Basis Wetten Bestand repository
    • wetsuite.datacollect.koop_sru.CVDR - SRU endpoint for the CVDR (Centrale Voorziening Decentrale Regelgeving) repository
    • wetsuite.datacollect.koop_sru.EuropeseRichtlijnen - Note: Broken/untested
    • wetsuite.datacollect.koop_sru.LokaleBekendmakingen - SRU endpoint for bekendmakingen repository
    • wetsuite.datacollect.koop_sru.OfficielePublicaties - SRU endpoint for the OfficielePublicaties repository
    • wetsuite.datacollect.koop_sru.PLOOI - SRU endpoint for the Platform Open Overheidsinformatie repository
    • wetsuite.datacollect.koop_sru.PUCOpenData - Publicatieplatform UitvoeringsContent https://puc.overheid.nl/
    • wetsuite.datacollect.koop_sru.SamenwerkendeCatalogi - SRU endpoint for the Samenwerkende Catalogi repository
    • wetsuite.datacollect.koop_sru.StatenGeneraalDigitaal - SRU endpoint for Staten-Generaal Digitaal repository
    • wetsuite.datacollect.koop_sru.TuchtRecht - SRU endpoint for the TuchtRecht repository
    • wetsuite.datacollect.koop_sru.WetgevingsKalender - SRU endpoint for wetgevingskalender, see e.g. https://wetgevingskalender.overheid.nl/
  • wetsuite.datasets.Dataset - If you're looking for details about the specific dataset, look at the .description
  • wetsuite.extras.pdfhocr.PDFAugmenter - Tries to combine embedded-text extraction where present and sensible, and OCR where necessary.
  • wetsuite.helpers.collocation.Collocation - A basic collocation calculator class.
  • wetsuite.helpers.etree.debug_color - Takes XML, parses, reindents, strip_namespaces, returns a class that will render it in color in a jupyter notebook (using pygments).
  • wetsuite.helpers.localdata.LocalKV - A key-value store backed by a local filesystem - it's a wrapper around sqlite3.
    • wetsuite.helpers.localdata.MsgpackKV - Like localKV but the value can be a nested python type (serialized via msgpack)
  • wetsuite.helpers.notebook.etree_visualize_selection - Produces a colorized representation of selection within an XML document. (works only within IPython/jupyter style notebooks, via a HTML representation.)
  • wetsuite.helpers.notebook.ProgressBar - A sequence-iterating progress bar (like tqdm) that supports both notebooks and console, and prefers notebook over console style in notebooks.
  • wetsuite.helpers.spacy.notebook_content_visualisation - Python notebook visualisation to give some visual idea of contents: marks out-of-vocabulary tokens red, and highlight the more interesting words (by POS).
  • wetsuite.helpers.split.Fragments - Abstractish base class explaining the purpose of implementing this
    • wetsuite.helpers.split.Fragments_HTML_CVDR - Turn CVDR in HTML form into fragments
    • wetsuite.helpers.split.Fragments_HTML_Fallback - Extract text from HTML from non-specific source into fragments
    • wetsuite.helpers.split.Fragments_HTML_Geschillencommissie - Turn HTML pages from degeschillencommissie.nl into fragments
    • wetsuite.helpers.split.Fragments_HTML_OP_Bgr - Turn blad gemeenschappelijke regeling in HTML form (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_HTML_OP_Gmb - Turn gemeenteblad in HTML form (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_HTML_OP_Kamer - Turn kamer-related HTMLs (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_HTML_OP_Prb - Turn provincieblad in HTML form (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_HTML_OP_Stb - Turn staatsblad in HTML form (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_HTML_OP_Stcrt - Turn staatscourat in HTML form (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_HTML_OP_Trb - Turn tractatenblad in HTML form (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_HTML_OP_Wsb - Turn waterschapsblad in HTML form (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_HTML_Tuchtrecht - Turn HTML pages from into fragments
    • wetsuite.helpers.split.Fragments_PDF_Fallback - Extract text from PDF from non-specific source into fragments
    • wetsuite.helpers.split.Fragments_XML_BWB - Turn BWB in XML form into fragments
    • wetsuite.helpers.split.Fragments_XML_CVDR - Turn CVDR in XML form into fragments
    • wetsuite.helpers.split.Fragments_XML_Fallback - Extract text from XML from non-specific source into fragments
    • wetsuite.helpers.split.Fragments_XML_OP_Bgr - Turn blad gemeenschappelijke regeling in XML form (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_XML_OP_Gmb - Turn gemeenteblad in XML form (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_XML_OP_Handelingen - Turn handelingen in XML form (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_XML_OP_Kamer - Turn other kamer XMLs (from KOOP's BUS) into fragments (TODO: re-check which these are)
    • wetsuite.helpers.split.Fragments_XML_OP_Prb - Turn provincieblad in XML form (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_XML_OP_Stb - Turn sstaatsblad in XML form (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_XML_OP_Stcrt - Turn staatscourant in XML form (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_XML_OP_Trb - Turn tractatenblad in XML form (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_XML_OP_Wsb - Turn waterschapsblad in XML form (from KOOP's BUS) into fragments
    • wetsuite.helpers.split.Fragments_XML_Rechtspraak - turn rechtspraak.nl's open-rechtspraak XML form into fragments
  • wetsuite.helpers.split.SplitDebug - A notebook-style formatter that does little more than take a list of tuple of three things (meant for the output of fragments()), and print them in a table.

API Documentation for wetsuite, generated by pydoctor 24.11.2 at 2025-12-09 13:59:45.