author | title |
---|---|
Bernhard Fisseni |
Normalization services for TEI XML files |
This is a WEB service for applying orthographic normalization, mainly to transcripts of spoken data in TEI
It uses the functionality provided by the TEI Speech Tools library; for the documentation of services and parameters, see there.
A running instance is provided by the Leibniz Institute for the German Language.
Besides the dependencies available via
Maven, needs some utility
functions. These can be
locally mvn install
ed.
mvn clean install
and then deploy the war
in target/
to your servlet container of choice.
The normalization service will be at <root>/normalize
, and expects data via POST
,
accepting/returning the MIME types:
application/tei+xml;format-variant=tei-iso-spoken;tokenized=1
application/tei+xml;format-variant=tei-dta;tokenized=1
application/tei+xml
application/xml
The
WADL
file is at
SERVICE_NAME/services/application.wadl
, e.g. at IDS:
https://clarin.ids-mannheim.de/teilicht/services/application.wadl.