Project assignment for the Natural Language course
MEIC-A # 2015/2016
- Troubleshooting using transducers.
Use transducers to implement the following converters:
- hours converter in condensed form hh:mm for a form used in portuguese spoken language.
- Examples:
- 14:25 --> catorze e vinte cinco;
- 10:15 --> dez e quinze ou dez e um quarto;
- The use of the word "fourth / quarto" and "half / meia" is taken into account in the assessment, the converter can generate one or more alternatives.
- The use of the word "hours / horas" and "minutes / minutos" is optional and will not be taken into account in the evaluation.
- Assume that the hours to convert are always valid.
- hours converter as used in the portuguese spoken language to the condensed form.
The file data.sym (available from statement) includes the definition of all symbols that can be used by transducers to develop. This file can NOT be changed.
Please note that all files should use UTF-8 encoding.
To test the proposed solution uses in Linux environment, the tools:
- "Graphviz" (http://www.graphviz.org/);
- "OpenFST" from Google (http://www.openfst.org/twiki/bin/view/FST/FstDownload)
- Install Graphviz and OpenFST
- Clone the repo:
git clone https://github.com/manuelalves/ln-project1.git
- Enter the directory:
cd ln-project1
- Run the script:
./run.sh
- To delete the generated files:
./remove.sh