Skip to content

open-data-kazakhstan/ideas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

Open Data Kazakhstan
Docs for data engineers

Table of content

English version

Getting started

This is an open-source project fully hosted on GitHub. All contributions are welcome, whether it's a bug report, data engineering or documentation. Below you can find some details about different contributions to this project.

Submitting an idea

Ideas for datasets can be submitted in this issue tracker. Please, provide as much details as possible so that other people can easily understand the idea. Before creating an issue for your idea, make sure it doesn't exist already.

Create a new dataset

When you start working on a new dataset, normally you wouldn't have an existing repository. In that case, you can create one with your own account on Github and transfer it to our organization.

Contributing to an existing dataset

If you found an issue/bug in an existing dataset repository, you can simply open a pull request and assign one of the members of this organization as a reviewer. Once it is reviewed and approved, we make sure it is merged into the main branch.

You can also create an issue for each dataset. Simply open a new issue in the relevant repository. Please, don't use ideas repository for issues in an existing dataset.

Datasets

A single dataset can be placed in a repository, for instance, this is a population dataset: https://github.com/open-data-kazakhstan/population. It should have the following structure:

Metadata specification

We describe data using frictionlessdata specification.

TODO: how to generate datapackage.json.

Data format

We use CSV format for tabular data.

Scripts

We use Python programming language by default and Dataflows library.

Data pipelines

TODO

Releases

No releases published

Packages

No packages published