September 2022

In the last month, we launched several exciting new features, including the ability to convert document extractions to Excel spreadsheets and a beta Zapier integration.

New feature: Extract document data to spreadsheets

You can now download any document extraction as an Excel spreadsheet. This feature lets you easily convert tables, rows, labels, checkboxes, and other document primitives into well structured spreadsheets, unlike many tools that map PDF formatting onto a spreadsheet with no meaningful relationship to the underlying cells.

For example, the document in the Getting started guide converts to the following spreadsheet:

You can download spreadsheets either through the Sensible app or the Sensible API.

For more information, see Quickstart PDF to Excel and SenseML to spreadsheet reference.

Beta feature: Zapier integration

Connect Sensible with your favorite tools -- Google Drive, Dropbox, and more -- through our new Zapier integration.

UX improvement: Autocomplete SenseML in Sensible app

You can now author SenseML faster in the Sensible app by using autocomplete suggestions.

For example, if you type field:

Click to enlarge

Sensible autocompletes field as:

      "id": "",
      "method": {

Improvement: Faster checkbox recognition

Sensible can now use embedded PDF metadata to extract a checkbox's selection status. If the PDF lacks metadata, Sensible falls back to the slower, existing method of pixel recognition. For more information, see Checkbox.

Improvement: Address type in paragraphs

Sensible now recognizes addresses in paragraphs in addition to addresses in block format. Use "block_format":"false" to recognize addresses such as the following example:

Click to enlarge

For more information, see the Address type.

Improvement: Accounting Currency type recognizes trailing negative signs

Sensible now recognizes accounting currencies when formatted with a trailing negative sign, for example, $527.01-.

Improvement: Zip sections

You can now combine all the fields in multiple section groups into a new section group using the Zip computed field method. For an example, see Advanced: Zip sections.

Improvement: Configure SenseML execution order

For advanced use cases, for example to suppress source section groups so you can have clean output for a zipped section group, you can now configure the order in which Sensible executes fields, computed fields, and sections.

For more information, see Field extraction order.

Improvement: Configure the Remove Header and Remove Footer preprocessors with text matches

To recognize footer or header text that varies slightly or isn't present on all pages, you can now bypass automatic header or footer recognition by configuring a text match. Sensible removes all text below the bottom or top boundary of the matched text. For more information, see the Remove header and Remove footer preprocessors.

Improvement: Sections verbosity

When you configure a higher verbosity, Sensible now returns metadata for sections.