Nikolay Samokhvalov
--- a/0094_how_to_run_analyze.md 0 → 100644
+++ b/0094_how_to_run_analyze.md 0 → 100644
+# How to run ANALYZE (to collect statistics)
+The command `ANALYZE` collect statistics about a database ([docs](https://www.postgresql.org/docs/current/sql-analyze.html)). Maintaining fresh statistics is crucial for achieving good database performance.
+
+Running it is trivial:
+```sql
+analyze;
+```
+
+However, this, being single-threaded, can take a lot of time.
+
+## How to run ANALYZE at full speed
+To utilize multiple CPU cores, we can use client program `vacuumdb` with option `--analyze-only` and multiple workers ([docs](https://www.postgresql.org/docs/current/app-vacuumdb.html)).
+
+The following runs `ANALYZE` on *all* databases (`--all`), using the number of workers matching the number of vCPUs, and limiting overall duration by 2 hours (connection options like `-h`, `-U` are not shown here):
--- a/0094_how_to_run_analyze.md 0 → 100644
+++ b/0094_how_to_run_analyze.md 0 → 100644
+    while IFS= read -r line; do  
+        echo "$(date '+%Y-%m-%d %H:%M:%S') $line"  
+    done  
+} < <(
+  PGOPTIONS='-c statement_timeout=2h' \
+    vacuumdb \
+      --analyze-only \
+      --all \
+      --jobs $(nproc) \
+      --echo
+) | tee -a analyze_all_$(date +%Y%m%d).log
+```
+
+With this snippet, all the commands are going to be also printed and logged, with a timestamps (alternatively, instead of the `while`, one could use `ts` from `moreutils`).
+
+`--jobs $(nproc)` works for Linux, and it defines the number of workers matching the number of vCPUs. Note that if there are large unpartitioned tables, at some point, only a few workers may remain active. A solution to this problem can be partitioning: with many smaller partitions, it can allow all workers to remain busy, which can speed up the whole operation drastically on machines with a high number of CPU cores.