Recursive X-Y cut

From HandWiki

The recursive X-Y cut is a top-down page segmentation technique that decomposes a document image recursively into a set of rectangular blocks. The algorithm works by projecting the document bitmap (i.e. summing up all the pixels in a line) to the sides of the document page. By this method, a white space density graph is produced, with peaks for vertical or horizontal whitespace lines. These peaks define the cuts of the document and are used top-down to segment the document into smaller pieces.

References