I want implement a program that can do a a “Vision-based Page Segmentation”. I need some guide and clue. (I need practical information and not just academical info)
My preferred languages are JS (jQuery) and PHP.
I read the following article (VIPS: a Vision-based Page Segmentation Algorithm) and I think it can be a good framework for this purpose:
ftp://ftp.research.microsoft.com/pub/tr/tr-2003-79.pdf
Is there any open source impementation for “Vision-based Page Segmentation”?
No. Microsoft Granted Patent on Vision-Based Document Segmentation (VIPS). Try again in 2023. I am truly sorry.
I am not a patent lawyer, but the US patent 7,428,700 claims are quite straightforward:
Now, a document described by a tree structure having a plurality of nodes is our old friend DOM model of a Web page.
Also note that the four inventors are also the same four co-authors of the paper cited. I be damned if that’s a sheer coincidence.