Generate new larger test dataset
As we increase our Performance testing coverage we'll continue to require realistic test data for the features being tested. We currently import a sanitised
gitlab-foss project for our testing, however this is a relatively medium sized repo with much larger ones being used publicly and privately by others. It also doesn't cover other areas of data that we will likely need to test in the future such as a large number of Issues or Branches for example.
Task is to generate a new realistic test data project that's considered large in size as well as having more data points for us to test.
This is difficult though as the largest repos out there are privately own by companies and will contain sensitive info. Strategy then is to explore taking the largest public repo, Linux, and build other data on top of that or expanding our current