POpulation DIstribution-based Gtfs Generator
A realistic public transport dataset generator, which is serialized as RDF/Turtle. This is an extension of the PoDiGG generator that automatically generates Linked Connections RDF in Turtle from the GTFS data that is generated.
npm install podigg-lc
This tool will generate a lc.ttl
file in the output_data
folder, which contains Linked Connections.
Additionally, the folder will also contain the raw GTFS files and a visualization of the data.
All generator parameters can be configured using a config file or environment variables, depending on how the generator is invoked. More information about this can be found at https://github.com/PoDiGG/podigg#parameters
For big data generation, the NODE_MEM
environment variable can be overridden to increase the maximum memory usage.
This can be done by adding the -e NODE_MEM=<memory in MB>
option when running the Docker container.
The easiest way to run the generator is using the command line tool:
podigg-lc [output folder [path to a JSON config file]]
This config file contains parameters for the generator, as explained in the main generator.
Alternatively, the generator can also be configured using environment variables, as explained below. In that case, the generator must be called as follows:
podigg-lc-env [output folder]
This generator can be run using a Docker container as follows:
Downloading and running the container from the Docker hub:
docker pull podigg/podigg-lc
docker run -it --rm \
-v $(pwd)/output_data:/tmp/output_data \
-e GTFS_GEN_SEED=123 \
podigg/podigg-lc
Building and running the container from this repo:
git clone [email protected]:PoDiGG/podigg-lc.git
cd podigg-lc
docker build -t podigg-lc .
docker run -it --rm \
-v $(pwd)/output_data:/tmp/output_data \
-e GTFS_GEN_SEED=123 \
podigg-lc
Parameters must be passed using environment variables.
The PoDiGG generator is written by Ruben Taelman.
This code is copyrighted by Ghent University – imec and released under the MIT license.