September 2024
In the last, month we made both an improvement and a breaking change to our support for extracting from Excel documents. We also updated our large language model (LLM) support for the List method and Query Group method in response to end-of-life GPT versions.
Improvement: Standardization for Excel extractions
To improve efficiency and consistency, Sensible introduced breaking changes to our support for extracting from XLSX documents.
Now, Sensible standardizes the size and formatting of the text and cell grid in the spreadsheet, then extracts the text directly. Sensible no longer converts the Excel document to a PDF to perform the extraction. Methods that are less commonly used for spreadsheets (pixel- and OCR-based methods) are no longer supported. If you currently extract data from Excel sheets, check your configs for regressions. For more information, see File types.
Improvement: update from GPT-3.5 Turbo to GPT-4o mini
Sensible updated its default large language model (LLM) engine for the Query Group method and List method from an end-of-life version (GPT-3.5 Turbo) to GPT-4o mini. As part of this change, query groups that contain a single field no longer route to the deprecated Query method, which was formerly a strategy for maintaining backward compatibility with this deprecated method. Test your existing configs for changed output if they use the Query Group method or List method.
New feature: SSO sign-in support
Sensible now offers single sign-on (SSO) implementation for logging into your Sensible account using your company's account. Contact Sensible if you're interested in this feature.