IBM provides a linguistic approach to identifying the main body text of a page, and they present that approach as an improvement upon methods such as a VIPS or a Visual Gap Segmentation process. There are a number of new patent applications from...
↧