September 2022
In the last month, we launched several exciting new features, including the ability to convert document extractions to Excel spreadsheets and a beta Zapier integration.
New feature: Extract document data to spreadsheets
You can now download any document extraction as an Excel spreadsheet. This feature lets you easily convert tables, rows, labels, checkboxes, and other document primitives into well structured spreadsheets, unlike many tools that map PDF formatting onto a spreadsheet with no meaningful relationship to the underlying cells.
For example, the document in the Getting started guide converts to the following spreadsheet:
You can download spreadsheets either through the Sensible app or the Sensible API.
For more information, see Quickstart PDF to Excel and SenseML to spreadsheet reference.
Beta feature: Zapier integration
Connect Sensible with your favorite tools -- Google Drive, Dropbox, and more -- through our new Zapier integration.
UX improvement: Autocomplete SenseML in Sensible app
You can now author SenseML faster in the Sensible app by using autocomplete suggestions.
For example, if you type field
:
Sensible autocompletes field
as:
{
"id": "",
"anchor":,
"type":,
"match":,
"method": {
"id":
}
}
Improvement: Faster checkbox recognition
Sensible can now use embedded PDF metadata to extract a checkbox's selection status. If the PDF lacks metadata, Sensible falls back to the slower, existing method of pixel recognition. For more information, see Checkbox.
Improvement: Address type in paragraphs
Sensible now recognizes addresses in paragraphs in addition to addresses in block format. Use "block_format":"false"
to recognize addresses such as the following example:
For more information, see the Address type.
Improvement: Accounting Currency type recognizes trailing negative signs
Sensible now recognizes accounting currencies when formatted with a trailing negative sign, for example, $527.01-
.
Improvement: Zip sections
You can now combine all the fields in multiple section groups into a new section group using the Zip computed field method. For an example, see Advanced: Zip sections.
Improvement: Configure SenseML execution order
For advanced use cases, for example to suppress source section groups so you can have clean output for a zipped section group, you can now configure the order in which Sensible executes fields, computed fields, and sections.
For more information, see Field extraction order.
Improvement: Configure the Remove Header and Remove Footer preprocessors with text matches
To recognize footer or header text that varies slightly or isn't present on all pages, you can now bypass automatic header or footer recognition by configuring a text match. Sensible removes all text below the bottom or top boundary of the matched text. For more information, see the Remove header and Remove footer preprocessors.
Improvement: Sections verbosity
When you configure a higher verbosity, Sensible now returns metadata for sections.