Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 9255815
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 18, 20262026-06-18T11:47:41+00:00 2026-06-18T11:47:41+00:00

Since there is virtually no documentation or code snippets on programming inside OpenText Capture

  • 0

Since there is virtually no documentation or code snippets on programming inside OpenText Capture Center. I need some input from someone with experience.

Here is the crux of what I need…
In the Scripting Manager, I need to be able to access all of the Phrase objects that the OCR identified in the document, regardless of the Fields matched or identified during extraction.

As long as I have access to the OCR phrases, I can do two things that will greatly increase our matching percentage on any field.

  1. Perform sanitations and transformations of the invoice phrases as a type of pre-processing before matching occurs (I.E. turn Corporation into CORP, remove apostrophes, etc..)
  2. Write a custom matching function that is more understanding of our data than the native Generic SnapMatch.

Thanks!

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-18T11:47:42+00:00Added an answer on June 18, 2026 at 11:47 am

    Ok, ultimately there is no way to do this via the Scripting Manager entry points. The reason for this is that all the image data is parsed and extracted prior to entry into the scripting manager. By the time you get to the extraction phase of the manager, you have an XML Runtime document which represents the meta structure of the output document with data that the extraction “thought might be useful” before entry. All other possible “phrases” and other data types extracted that did not fit a field directly or an alternative is “discarded”. Meaning that the Vendor Name or something similar which DoKuStar didn’t find interesting, is still not searchable with any code mechanism.

    The problem I needed to solve was very specific to my particular domain, and was caused indirectly by policy of the Oracle group. The names of vendors was stripped of special characters and concatenated. Basically, they just did not match what was on the invoice, and therefore snapmatch was virtually useless.

    I created an intermediate solution whereby the local SnapMatch database could be updated by users directly, “Rename Vendor” so to speak. And therefore our local SnapMatch database will match what was on the invoices as we make corrections, even if the Oracle database doesn’t. All in all, not a specific solution to the coding side, but it turned out to be an effective solution to the domain issue.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I am trying to avoid using Net::SSH::Perl library since there is some problems in
Since there is no way to join tables using Google App Engine datastore, I
Since there are no C# source codes for Wait, Pulse, PulseAll methods. Does anyone
Since there is no way that you can make the flash object transparent, there
Since there is no type in ruby, how do Ruby programmers make sure a
Edit Since there were many downvotes and people who didn't understand what I'm asking
I keep coming back to this problem, since there doesn't seem to be a
So my issue is pretty straight forward, since there is seemingly no callback for
When using ModelForm Within forms.py it would save a lot of time, since there
I'm guessing it's not a Perl Compatible Regular Expression, since there's a special kind

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.