This is a web service for Textricator, a library for parsing data from computer-generated PDF files.
The web UI for each PDF has a link to download the PDF, a button to show the raw text content of the PDF, and allows you to type in the configuration for Textricator, and click a button to parse the PDF and show the results.
/files/ has links to the web UI pages for the available PDFs.
Get a list of available files. Produces text/html or application/json
Download the PDF for the specified ID. Produces application/pdf.
Extract the raw text content of the PDF, as JSON. Produces application/json.
Parse the PDF data, returning the result in the specified format.
Request body: YAML configuration for the FSM.
Load the web UI for the specified PDF.
See the Textricator Github page for documentation on writing the parser configuration.
TextricatorWeb is licensed under the AGPLv3 (as is Textricator).
Source code for TextricatorWeb 9.0.20 is available at https://github.com/measuresforjustice/textricator-web
Source code for Textricator 9.0.46 is available at https://github.com/measuresforjustice/textricator
This service uses the following libraries: