org.schwiebert.abl4j.io
Class TigerCorpusReader
java.lang.Object
org.schwiebert.abl4j.io.TigerCorpusReader
- All Implemented Interfaces:
- IConfigurable, ITreebankReader
public class TigerCorpusReader
- extends Object
- implements ITreebankReader
An ITreebankReader that is able to read the Tiger Corpus format - see http://www.
ims.uni-stuttgart.de/projekte/TIGER/TIGERCorpus/ for details about the
corpus. Existing annotations are converted to ABLs internal data structures.
By setting the property AblProperties.INPUT_IGNORE_ANNOTATIONS to
true, the structural information of the tiger treebank is ignored.
- Author:
- sschwieb
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
reader
private Reader reader
ignoreAnnotations
private boolean ignoreAnnotations
TigerCorpusReader
public TigerCorpusReader()
configure
public void configure(PropertiesMap pm)
throws InvalidConfigurationException
- Specified by:
configure in interface IConfigurable
- Throws:
InvalidConfigurationException
readTreebank
public void readTreebank(ITreeBank tb)
throws IOException
- Specified by:
readTreebank in interface ITreebankReader
- Throws:
IOException
buildConstituent
protected IConstituent buildConstituent(HashMap<String,Object> internalMapping,
ITree tree,
Element nt)
getWordIndex
private int getWordIndex(String id,
HashMap<String,Object> map,
boolean begin)
© 2007 Stephan Schwiebert