Huge spike on CPU and other resources when couchdb-prometheus-exporter is set database=_all_dbs with 2600 databases #259

Sdas0000 · 2023-10-20T13:59:22Z

Since frequency for matrices collections is set to 1 min , couchdb-prometheus-exporter attempting to collect information for all 2600 databases , this impacts the performance of the cluster.
Is there a way to collect database information sequentially or by a batch size ?
Can we add a parameter to collect database information frequency ( may be every 6 hours or 12 hours etc ) ?

gesellix · 2023-10-20T16:17:29Z

I think we'll have to change the collector to continuously (with configurable frequency) perform scrapes across the databases. Just like you suggested in your laster question. This might not be a quick fix, though, I'll have to check.

You might work around the issue by running multiple couchdb-prometheus-exporter instances and configuring each for only a subset of your databases. The Prometheus configuration would then have to scrape all those exporters, obviously. This is only a workaround.

gesellix · 2023-10-21T16:29:55Z

@Sdas0000 please have a look at the database.concurrent.requests parameter as introduced with #46. It allows to limit the concurrent requests between exporter and CouchDB cluster, which might help for your environment.

Nevertheless I'm going to implement an option to decouple Prometheus' scrape interval (Prometheus -> Exporter) and the exporter's scrape interval (Exporter -> CouchDB). Beware that this might have the undesired effect of collecting stale metrics.

gesellix · 2023-10-21T18:24:43Z

I just released v30.9.0 with a new flag to perform scrapes at a configurable interval independent of Prometheus scrapes. Example: --scrape.interval=6h for an interval of 6 hours (default is 0s).

Please leave some feedback and whether you need more optimization for your setup. Thanks!

gesellix · 2023-11-06T19:15:45Z

Closing now, feel free to leave feedback here or file another issue in case you still run into performance issues.

Sdas0000 · 2023-11-07T19:51:24Z

Scrap.interval is scraping all for that duration, our issue is _all_dbs (2600 databases) at same time , we are looking for database scraping interval

gesellix · 2023-11-07T20:28:06Z

I think you should give the option described in #259 (comment) a try. This would allow you to define "buckets" for requests to you cluster. Did you have a look at that option?

Sdas0000 · 2023-11-16T21:09:29Z

We tried database.concurrent.requests = 100 , but that didn't help , we still see same high CPU , what we are looking scrape.interval for specific to database level maxtrix ( like doc count , disk utilization etc ) and other matirix can continue as usual. also if we can have a parameter like "database scrape batch size" which will scrape only that batch and after finish first batch it will pick up next batch , in this case it may use less resource. Basically we need disk , doc count etc only few times a day , but other information we need continuously throughout a day

gesellix · 2023-11-17T21:37:12Z

I think I need to reproduce the issue for myself... monitoring 2600 databases... and then trying to make it work using less resources. For the time being I don't have a better suggestion than above #259 (comment), deploying multiple exporter instances, each dedicated for a specific range of databases.

gesellix self-assigned this Oct 20, 2023

gesellix mentioned this issue Oct 21, 2023

Scrape continuously at an interval or synchronously during Prometheus' scrape #261

Merged

gesellix closed this as completed Nov 6, 2023

gesellix reopened this Nov 7, 2023

gesellix mentioned this issue Dec 4, 2023

Multiple couchdb URIs #266

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Huge spike on CPU and other resources when couchdb-prometheus-exporter is set database=_all_dbs with 2600 databases #259

Huge spike on CPU and other resources when couchdb-prometheus-exporter is set database=_all_dbs with 2600 databases #259

Sdas0000 commented Oct 20, 2023

gesellix commented Oct 20, 2023

gesellix commented Oct 21, 2023

gesellix commented Oct 21, 2023

gesellix commented Nov 6, 2023

Sdas0000 commented Nov 7, 2023

gesellix commented Nov 7, 2023

Sdas0000 commented Nov 16, 2023

gesellix commented Nov 17, 2023

Huge spike on CPU and other resources when couchdb-prometheus-exporter is set database=_all_dbs with 2600 databases #259

Huge spike on CPU and other resources when couchdb-prometheus-exporter is set database=_all_dbs with 2600 databases #259

Comments

Sdas0000 commented Oct 20, 2023

gesellix commented Oct 20, 2023

gesellix commented Oct 21, 2023

gesellix commented Oct 21, 2023

gesellix commented Nov 6, 2023

Sdas0000 commented Nov 7, 2023

gesellix commented Nov 7, 2023

Sdas0000 commented Nov 16, 2023

gesellix commented Nov 17, 2023