home edit page issue tracker

This page pertains to UD version 2.

UD Hebrew HTB

Language: Hebrew (code: he)
Family: Afro-Asiatic, Semitic

This treebank has been part of Universal Dependencies since the UD v1.1 release.

The following people have contributed to making this treebank part of UD: Yoav Goldberg, Reut Tsarfaty, Amir More, Shoval Sadde, Victoria Basmov, Yuval Pinter.

Repository: UD_Hebrew-HTB
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.13

License: CC BY-NC-SA 4.0

Genre: news

Questions, comments? General annotation questions (either Hebrew-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [yoav • goldberg (æt) gmail • com, reut • tsarfaty (æt) gmail • com, habeanf (æt) gmail • com, shovatz (æt) gmail • com, vikasaeta (æt) gmail • com, uvp (æt) cs • bgu • ac • il]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.

Annotation Source
Lemmas annotated manually in non-UD style, automatically converted to UD
UPOS annotated manually in non-UD style, automatically converted to UD
XPOS annotated manually
Features annotated manually in non-UD style, automatically converted to UD
Relations annotated manually in non-UD style, automatically converted to UD

Description

A Universal Dependencies Corpus for Hebrew.

Universal Dependencies - Hebrew Dependency Treebank (v2) https://github.com/UniversalDependencies/UD_Hebrew

V1 for the the corpus was built by semi-automatic conversion of the Hebrew Constituency Treebank (v2). V2 is converted from V1, using a combination of automatic conversion when possible, and manual conversion and verification in other cases.

Acknowledgments

The Universal Dependencies Hebrew Treebank created by: (in alphabetic order):

The Universal Dependencies Hebrew Treebank is based on the Hebrew Constituency Treebank (v2) developed by MILA, The Knowledge Center for Processing Hebrew. (http://www.mila.cs.technion.ac.il/resources_treebank.html)

References

You are encouraged to cite these papers if you use the Hebrew Universal Dependencies Treebank:

@inproceedings{tsarfaty2013unified, title={A Unified Morpho-Syntactic Scheme of Stanford Dependencies}, author={Tsarfaty, Reut}, booktitle={Proc. of ACL}, year={2013} }

@inproceedings{mcdonald2013universal, title={Universal Dependency Annotation for Multilingual Parsing}, author={McDonald, Ryan T and Nivre, Joakim and Quirmbach-Brundage, Yvonne and Goldberg, Yoav and Das, Dipanjan and Ganchev, Kuzman and Hall, Keith B and Petrov, Slav and Zhang, Hao and T{"a}ckstr{"o}m, Oscar and others}, booktitle={Proc. of ACL}, year={2013} }

Note that these papers do not accurately reflect the current annotation in the Treebank. A more up-to-date publication is forthcoming.

Statistics of UD Hebrew HTB

POS Tags

ADJADPADVAUXCCONJDETINTJNOUNNUMPRONPROPNPUNCTSCONJVERBX

Features

AbbrCaseDefiniteGenderHebBinyanHebExistentialMoodNumberPersonPolarityPrefixPronTypeReflexTenseVerbFormVerbTypeVoice

Relations

aclacl:relcladvcladvmodamodapposcasecase:acccase:genccccompcompound:affixcompound:smixutconjcopcsubjdepdetdiscoursedislocatedfixedflatflat:namemarkmark:qnmodnmod:possnsubjnsubj:copnsubj:outernummodobjoblparataxispunctrootxcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Verbs with Reflexive Core Objects

Relations Overview