May 2022
In the last month, we released several feature improvements, including the ability to match the most frequently repeated value for a repeated anchor and fuzzy matching for the Currency type.
New feature: Match most frequent
The new "match": "mostFrequent"
enum on the Field query object returns the most frequently matched value. This is useful for OCR text, like poor-quality scans or photographs. For example, if a scanned document contains repeated data for a field anchored on "1 Wages", but due to OCR errors the matched values are 21050.20
, 21850.20
, 21850.20
and 21850.58
, this option returns the most frequent, and therefore the mostly likely correct value, 21850.20
.
Improvement: Name type recognizes lists of names
The Name type now recognizes lists of names, for example Argos Fullington, Jax Odenson, Ollie Longstreet
.
Improvement: Address type
We made several improvements to Sensible's ability to recognize addresses.
Improvement: Currency type recognizes fuzzy OCR output
Use the new Relaxed With Cents parameter with the Currency type when poor-quality scans or photographed documents result in erroneous OCR output for the decimal separator or thousands separator.
If you set this parameter, Sensible overrides all other Currency type parameters and outputs USD currency.