Visualize language usage in the Norwegian parliament. See it in action at tale.holderdeord.no.
This project consists of two parts:
indexer/
- download and index Stortinget transcripts in ElasticSearchwebapp/
- web frontend to present / visualize the data
$ docker-compose up -d es webapp
$ docker-compose run --rm indexer
- elasticsearch
- node.js
- ruby
Download and index transcripts (requires a local elasticsearch):
$ cd indexer/
$ gem install bundler
$ bundle install
$ bundle exec ruby -Ilib bin/hdo-transcript-indexer
Re-create the index. This is necessary when a mapping is changed:
$ bundle exec ruby -Ilib bin/hdo-transcript-indexer --create-index
Convert a single XML transcript to indexable JSON:
$ bundle exec ruby -Ilib bin/hdo-transcript-converter transcript.xml
Start the webapp in dev mode:
$ cd webapp
$ npm install
$ npm run dev
# open your browser at http://localhost:7575/
- Because of deficiencies in the transcripts, we don't know the correct time for all speeches. The "time" field will in these cases be set to midnight.