public class PreTextBlock extends Object
Modifier and Type | Field and Description |
---|---|
Histogramm<Float> |
lengthHisto |
ArrayList<PreTextLine> |
lines |
static int |
MINIMUMBLOCKSIZE |
Histogramm<Float> |
yDistHisto |
Constructor and Description |
---|
PreTextBlock() |
Modifier and Type | Method and Description |
---|---|
void |
addLine(PreTextLine line) |
void |
addTextPosition(org.apache.pdfbox.util.TextPosition text) |
int |
getSize() |
ArrayList<PreTextBlock> |
split()
This is supposed to split a TextBlock representing a whole page into
different blocks that might represent
columns in a two-column text
Headings
Foot notes
Tables and figures
The document abstract
etc.
|
public final ArrayList<PreTextLine> lines
public final Histogramm<Float> yDistHisto
public final Histogramm<Float> lengthHisto
public static final int MINIMUMBLOCKSIZE
public int getSize()
public void addTextPosition(org.apache.pdfbox.util.TextPosition text)
public void addLine(PreTextLine line)
public ArrayList<PreTextBlock> split()
Copyright (C) 2013, 2014 Raphael Dickfelder, Jan Göpfert, Benjamin Paassen, Andreas Stöckel, licensed under the AGPL v. 3: http://openresearch.cit-ec.de/projects/scie