Datasets

Real-world data can be difficult to obtain. Here I share the datasets collected as part of my research.

Mobile App User Dataset

I surveyed 10,208 people from more than 15 countries on their mobile app usage behavior. The countries include USA, China, Japan, Germany, France, Brazil, UK, Italy, Russia, India, Canada, Spain, Australia, Mexico, and South Korea.

We asked respondents about: (1) their mobile app user behavior in terms of mobile app usage, including the app stores they use, what triggers them to look for apps, why they download apps, why they abandon apps, and the types of apps they download. (2) their demographics including gender, age, marital status, nationality, country of residence, first language, ethnicity, education level, occupation, and household income (3) their personality using the Big-Five personality traits

This dataset contains the results of the survey.

Detailed descriptions about the project and how I collected the data can be found in my TSE paper (3MB).

Download

Download dataset (7MB).

The datasets are freely available for research use when acknowledged with the following reference:

If you use the data, please tell me your name, research group, and the publications that may result.

For further information please contact me at s.lim [at] cs.ucl.ac.uk

RALIC Dataset

I have collected various datasets of stakeholders and their requirements on a real software project. Detailed descriptions about the project and how I collected the data can be found in my thesis (8MB).

The datasets consist of:

  • 1714 recommendations from 61 stakeholders (OpenR)
  • 839 recommendations from 50 stakeholders (ClosedR)
  • 439 ratings from 76 stakeholders on 10 project objectives (RateP-Obj)
  • 1514 ratings from the same 76 stakeholders on 48 requirements (RateP-Req)
  • 3113 ratings from the same 76 stakeholders on 104 specific requirements (RateP-SReq)
  • 262 ratings from 79 stakeholders on 10 project objectives (RankP-Obj)
  • 469 ratings from the same 79 stakeholders on 51 requirements (RankP-Req)
  • 1109 ratings from the same 79 stakeholders on 132 specific requirements (RankP-SReq)
  • 276 ratings from 77 stakeholders on 10 project objectives (PointP-Obj)
  • 670 ratings from the same 77 stakeholders on 45 requirements (PointP-Req)
  • 1219 ratings from the same 77 stakeholders on 83 specific requirements (PointP-SReq)
  • 410 raw textual description of requirements provided by stakeholders (Raw-requirements)
  • stakeholders and their roles (Stakeholders-and-roles)

Download

Download dataset.

Download additional data about the cost (person hours) for each requirement.

The datasets are freely available for research use when acknowledged with the following references:

If you use the data, please tell me your name, research group, and the publications that may result.

For further information please contact me at s.lim [at] cs.ucl.ac.uk

"This is how I did it...I never saved anything for the swim back." – Gatacca (1997) © 2020 Soo Ling Lim