Support Viewing Contents of Artifacts Archive Without Downloading It
Everyone can contribute. Help move this issue forward while earning points, leveling up and collecting rewards.
Problem to solve
Add a new API endpoint so that it can list all the files/dirs that the user would get if they download the artifacts archive.
Intended users
Developer, support, unknown
Further details
We need is some way of knowing which artifacts exist, then our scripts can intelligently auto download just the files we care about. For example, we might query the file list, then get all files ending in “.txt” or “.log” and download just those. Since we don’t know what files exist each run, we have to download the entire artifacts.zip, then search for “.txt” or “.log”, and delete everything else. For some of our beefier projects, the download and delete process is slow, bogs down our servers, and is the main bottleneck.
Whenever GitLabs CI runs a pipeline, we need to be able to quickly and easily search an organized set of log files for each job, especially when debugging complex issues. Downloading logs from website GUI takes far too long and is too tedious when we’re debugging something. As a workaround, we have a background daemon that downloads all job artifacts every so often, strips just the small summary logs files we need, and organizes them as per our needs on a shared server.