Batch Submission Specifications for PanLem

Contributed by Jonathan Pool.

Revised 2011-05-16.

The “PanLem” UI permits users to submit lexical data as files.

There are 3 input file formats: simple text, full text, and XML. On this page is the syntax for the full text format.

The expansion formulae below use multi-character space-delimited tokens as atoms. Spaces are not significant. The operators are represented with standard regular-expression symbols and the following special symbols:

‘’text quotation
newline (Ux000a)
«»arbitrary reordering of repetitions of expansions of enclosed atoms
such that (expansion must comply with the following condition)
⊖ ()all atoms in all expansions of all instances of enclosed atom must be unique
$introducer of variable referenced in condition query

Varilingual Variant

The full text format has three variants: varilingual, centrilingual, and bilingual. The most general variant is the varilingual. Its specification is:

Centrilingual Variant

The centrilingual variant is identical to the varilingual variant except as follows:

Bilingual Variant

The bilingual variant is identical to the centrilingual variant except as follows: