Add initial Prometheus metrics server to runner manager

Added 2 commits:

2e75f620...4362d0a0 - 2 commits from branch gitlab-org:master

Compare with previous version

Added 1 commit:

9b360181 - 1 commit from branch gitlab-org:master

Compare with previous version

Added 4 commits:

98968b3b - Fix data races around runner health and build stats
226739f1 - Update Prometheus client library vendoring
adc63d4a - Add Prometheus metrics HTTP server
1050bb66 - Add version info Prometheus metric

Compare with previous version

Added 28 commits:

1050bb66...c73f6889 - 25 commits from branch gitlab-org:master
56b621f4 - Update Prometheus client library vendoring
b690b13e - Add Prometheus metrics HTTP server
011f756e - Add version info Prometheus metric

Compare with previous version

Added 1 commit:

75506872 - Add version info Prometheus metric

Compare with previous version

Mentioned in issue gitlab-com/infrastructure#543 (closed)

@ayufan can you or someone in the team comment?

@tmaczukin

Could you review that?

@juliusv Generally it looks good.

So right now, only whatever is configured at process startup takes effect. Is that ok? Should it be a command-line flag instead?

I think it'll be OK for now. We can think how to make HTTP server able to reload in the future.

Generally, this all leads to somewhat more code, but more explicit dependencies and encapsulation.

I agree with such approach :)

@tmaczukin Excellent, thanks. Anything else to change, or should I remove the WIP and get it merged?

@juliusv We should update configuration docs and add a basic documentation for metrics server and the I think we can merge this. After this we would be able to prepare a RC for 1.8, deploy it to our internal runners and add the runners to our Prometheus instance to check how it's working.

@tmaczukin Thanks, I'm adding documentation now, inspired again by https://gitlab.com/gitlab-org/gitlab-ci-multi-runner/merge_requests/219.

Added 1 commit:

24aabf0b - Add documentation about Prometheus HTTP server

Compare with previous version

Added 1 commit:

44d6646c - Add documentation about Prometheus HTTP server

Compare with previous version

Unmarked this merge request as a Work In Progress

Marked the task Documentation created/updated as completed

@tmaczukin I added documentation and removed the WIP marker!

Added 1 commit:

ad7f3f0f - Add documentation about Prometheus HTTP server

Compare with previous version

@tmaczukin Hm, the build pipeline has been stuck in "Pending" state after the successful Prebuild step for about an hour now: https://gitlab.com/juliusv/gitlab-ci-multi-runner/pipelines/4775447 - did I manage to break it by force-pushing too much or something?

@juliusv no it's most likely the problem with sidekiq builds queuing. Can you restart the unit tests build (https://gitlab.com/juliusv/gitlab-ci-multi-runner/builds/5602039)? It fails randomly and this one doesn't seems to be related with your changes 😉

Add initial Prometheus metrics server to runner manager

What does this MR do?

Why was this MR needed?

Are there points in the code the reviewer needs to double check?

General metrics approach

Configuration reloading

Does this MR meet the acceptance criteria?

What are the relevant issue numbers?

Activity

Admin message

Admin message

Add initial Prometheus metrics server to runner manager

What does this MR do?

Why was this MR needed?

Are there points in the code the reviewer needs to double check?

General metrics approach

Configuration reloading

Does this MR meet the acceptance criteria?

What are the relevant issue numbers?

Merge request reports

Activity