I was incorporating the OCR in my iPhone app using open source library “tesseract”.I have seen some apps on AppStore doing so. I am just wondering how can I segregate the data that it reads from the card, i.e, how will I distinguish between say FirstName and the LastName or between CompanyName and the person’s name, how will I distinguish whether the number is Phone# or the Fax# etc.
I wish that there are some standard formats of the Business Cards but I feel that would be hard luck to assume so.Any inkling on that would help a lot.
I posted it over here to gather the thoughts of various developers so that I can come up with some good logic (to make it more accurate) to classify the data that we get from the OCR engine.
1 Answer