List extraction tips
This Sensible Instruct method extracts repeating data in a document based on your description of the list’s overall contents and each individual item. Data such as the work history or skills on a resume, the vehicles on an auto insurance policy, or the line items on an invoice are best suited for this method.
This method is an alternative to the Table method, when the data you want can appear either as a table or as another layout. The List method can find data in paragraphs of free text or in more structured layouts, such as key/value pairs or tables.
- The list description describes the overall contents for the list, while each property is a single, short description of an item in the list.
- You can use location hints to describe the target list's position in the document. For examples of location hints, see Query extraction tips.
- For more information about how to write descriptions, or "prompts", see Query extraction tips.
The following example shows using the List method to extract data from a restaurant menu:
To try out this example in the Sensible app, take the following steps:
Download the following example PDF:
Example PDF Download link
Create a test document type in the Sensible app, then click the document type you created to edit it. In the document type's Reference documents tab, upload the example PDF you downloaded in a previous step.
Click the document type's Configurations tab, create a new test configuration, and click the configuration you created to edit it.
Click Sensible Instruct and create fields to extract data using the following table:
|Field name||Method||Overall list description||Property ids and descriptions|
|dinners||List||"dinner special menu items"||dinner description - "entree description"|
price - "dinner price"
|desserts||List||"dessert special menu items"||dessert description - "dessert description"|
price - "dessert price"
|wines||List||"red wines and white wines (not other drinks such as beers or liquors)"||wine_name - "wine brand name"|
wine_type - "wine varietal name (not brand), for example, return 'Red:cabernet savignon' or 'white:varietal not found'"
wine_description - "wine description"
smallest_serving_price - "smallest wine serving size and its dollar price, formatted like '6 oz: $11'"
second_smallest_serving_price - "second-smallest wine serving size and its dollar price, formatted like '6 oz: $11'"
bottle_price - "price per bottle, in dollars"
For example, use the following screenshot as a guide for configuring the
For the full reference for this method in SenseML, see List method.
Updated 1 day ago