{hkdatasets} - an R package for Hong Kong data

Background

In the open-source community, sample datasets play an important role for the democratisation of data skills. If not for the accessibility of classic datasets such as iris, mtcars, and Titanic, students of data analysis and data science would have to face one additional barrier to learning. It is not easy to get hold of data that satsifies all the conditions of being (i) interesting, (ii) relevant to the audience, and (iii) free to share publicly.

We noticed that there is a lack of such datasets for those who are interested in practising their data skills on a theme that is meaningful and relevant to them, namely Hong Kong.

For this project, we created an R package to make it easy to practise analysis and visualisation with datasets which are related to Hong Kong.

Our work

If you’re interested, we recommend that you go check out the following links:

Data sets available

  • hkdc - A dataset containing information about Hong Kong District Councillors (elected 2019), with variables such as their constituency, region, share of vote, total votes, link to their Facebook pages, etc. (Sources: https://www.districtcouncils.gov.hk; https://dce2019.hk01.com/)

  • hk_accidents - A dataset containing traffic accidents between 2014 and 2019.

  • hk_collisions - A dataset containing information of collisions by casualties between 2014 and 2019.

  • hk_vehicles - A dataset containing information of vehicles by casualties between 2014 and 2019.

For more information, please visit https://github.com/Hong-Kong-Districts-Info/hkdatasets.

Sources

More coming soon…

project image

LET’S WORK TOGETHER