Parse (successfully) the PDF containing the represantatives economic interests, etc from stortinget.no into a machine readable format and archive result.
stortinget.no now publish historical versions from 2017 to date. There is some data from 2013 to 2014 only available here.
sudo apt install poppler-utils # debian/ubuntu
brew install poppler # macos
pipenv install --dev
pipenv run python parser.py --help
pipenv run python scraper.py --help
Useful commands for testing the parser
python parser.py --file pdfs/interests-2014-09-24.pdf
pdftohtml -i -xml -stdout pdfs/interests-2014-09-24.pdf > testy.xml # ...and compare
python parser.py --all --verbose # Does anything unexpected change?