public class PDFStructuredTextExtractor
extends Object
This class takes a PDF File as input and extracts the text of it in an
HTML-like hierarchical
object structure (see the package "structure" for the classes itself).
Author:
Benjamin Paassen - bpaassen@techfak.uni-bielefeld.de
Copyright (C) 2013, 2014 Raphael Dickfelder, Jan Göpfert, Benjamin Paassen, Andreas Stöckel, licensed under the AGPL v. 3: http://openresearch.cit-ec.de/projects/scie