Commit Graph

604 Commits

Author SHA1 Message Date
Alberto
ef56eab3d7 Fixed bug in build.sh 2023-06-13 11:25:44 +02:00
0xFugue
996b7323f4 dstats/host-proc analysis (#125)
* cli scaffolding for processings metrics

* pandas description: dstats

* added dstats columns

* added procfs columns

* prune procfs/dstats columns

* sanitize, convert dstats

* re-write dstats and size conversion

* faster Human2ByteConveter

* dstats violin plots

* dstats, all plots, cdf

* all plots, pdf

* added prefix option

* option for cdf/pdf plots

* add jordis propagation delay calc - partial

* dstats takes only one arg now

* refactor jordis module

* added file check to dstats

* make analysis classes singleton

* ProcFS read/processing

* refactoring; added Plot baseclass

* added TODO

* Plot, violin_plots_helper()

* added ContainerID

* procfs plots - v0

* add rpc/metrics and update config.json

* added settling time plots

* setting labels

* added InOctets/OutOctets

* procfs plots - v1

* MiBs

* TODO

* change perms for prometheus and grafana configs

* conditional import tomli/tomllib

* BLKR/BLKW in MiB

* grafana dashboard

* run.sh refactoring

* dstats is now a standalone module

* added dstats dir

* bug fix

* fast pid2node_name

* kurtosis version check

* dstats bug fix

* bug fix

* dstats analysis - v2

* container2node_name via kurtosis inspect

* added dstats to cleanup

* dstats - container2name - v3

* dstats - v3

* host-proc - v2

* delete old host-proc launcher

* host-proc - v2: inline cpu stats

* host proc v2 : analysis : cpu stats

* add kinspect to host-proc

* host-proc-helper.sh

* host-proc v4

* plots 4 total and avg metrics

* clusters: kmeans

* per container clusters

* panelling is now class-wide

* per container clusters

* take care of incomplete samples

* extract the maximal, complete sample set

* grouping is now done by Key + removed HWM

* host-proc bug fix

* uniformise run.sh conditionals

* bug fix run.sh

* --full-uuids

* remove kurtosis inspect altogether

* cleanup run.sh

* kurtosis run cleanup

* added container size

* host-proc-helper.py

* run.sh

* incorporated config.json

* run.sh

* analysis.py

* config.json .plotting.host-proc

* conditional histograms - v0

* host-proc-helper.py omit fields

* conditional histograms

* compare plots done

* config.json dstats/host-proc

* config.json dstats/host-proc

* dstats/host-proc plotting run.sh

* run.sh

* cmd_helper

* analysis requirements.txt

* monitoring/host-proc/analysis.py -> analysis-module/src/hproc.py

* run.sh

* make top_plot strict

* remove/modify prints

* cleanup the old dockers

* set default to compare plots

* hproc.py

* src/main.py

* Dockerfile main.py

* plot_compare_panel

* set common --out-prefix

* build.sh

* work around ifindex issues

* run.sh: synchronous sudo and supressed analysis output

* run.sh

* venv for host-proc

* cleanup container-proc outputs

* add run summary to plot titles

* delete monitoring/host-proc/analysis.py

* remove log.info()

* host-proc: profile only wakunodes with config

* col rename, output plot dir, medians

* CPU % fixed

* export all plots to wakurtosis_log

* setting defaults to config.json

* config.json: num_nodes >= fanout

* run.sh plot dirs

* MemUse to VmPeak/VmSize is now configurable

* install python3-venv

* K -> KiB; i -> fo in summary
2023-06-08 14:11:36 +05:30
Daimakaimura
244b3e5128 Merge pull request #111 from logos-co/AnalysisMasterMerge
Container process level monitoring and metrics
2023-05-23 07:07:17 +01:00
Daimakaimura
554ee2a40c Added explicit infra conditions to run the analysis 2023-05-22 13:09:06 +01:00
Daimakaimura
3db85b3d82 Fixed wrong parameter function call 2023-05-22 13:08:08 +01:00
Daimakaimura
6acfe22698 Deleted old container-proc monitoring venv build 2023-05-22 12:55:46 +01:00
Daimakaimura
fbf0e39027 Make sure we are running the analysis properly depending on the infra 2023-05-22 07:50:22 +01:00
Daimakaimura
5deab3b34d Merge remote-tracking branch 'origin/master' into AnalysisMasterMerge 2023-05-22 07:49:52 +01:00
Daimakaimura
aaf86045f1 Ignoring temp files from monitoring 2023-05-19 11:45:15 +01:00
Daimakaimura
271d9ed9c4 Minor 2023-05-19 11:44:39 +01:00
Daimakaimura
5451b29a79 Minor 2023-05-19 11:20:24 +01:00
Daimakaimura
6ccc9fd609 Container-proc image builder 2023-05-19 11:19:42 +01:00
Daimakaimura
a52048cf41 Fix the paths 2023-05-19 11:18:48 +01:00
Daimakaimura
4c06bfb64f Added logic to call container-proc container 2023-05-19 11:18:14 +01:00
Daimakaimura
5895030fc4 Added container-proc build image 2023-05-19 11:17:57 +01:00
Alberto Soutullo
e7bd7baa16 Merge pull request #115 from logos-co/Alberto/analysis_multinode
Accumulated metrics into analysis image
2023-05-18 18:16:22 +02:00
Daimakaimura
35c18ac220 Move cproc venv to root 2023-05-18 08:46:50 +01:00
Daimakaimura
cee707220a Ignore cproc venv 2023-05-18 08:46:03 +01:00
Alberto
d21242868e Updated README.md 2023-05-17 17:02:03 +02:00
Alberto
8fafb3a847 Modified tests 2023-05-17 17:01:22 +02:00
Alberto
ea9b8d1d2b Now plots are correctly plotted 2023-05-17 17:01:17 +02:00
Alberto
ef71404111 Corrected plotting configurations 2023-05-17 17:00:44 +02:00
Alberto
01fb6fd58a Changed config structure again 2023-05-17 17:00:21 +02:00
Daimakaimura
349d17cae6 Typo 2023-05-16 13:26:53 +01:00
Daimakaimura
f3c991ddc7 Fixed unused imports 2023-05-16 13:26:11 +01:00
Daimakaimura
57f59117a5 Fixed imports 2023-05-16 13:24:46 +01:00
Daimakaimura
608e6b4e6d Refactored main and added cproc and cadvisor helpers 2023-05-16 13:20:45 +01:00
Daimakaimura
90e16f5f44 Added container-prioc run example 2023-05-16 09:45:35 +01:00
Daimakaimura
c65af20ccc Added -i arg to the test 2023-05-16 09:42:44 +01:00
Daimakaimura
49b9436fc2 Added container-prioc analysis logic 2023-05-16 09:38:51 +01:00
Daimakaimura
466e458952 Added plotting function for cproc 2023-05-16 09:03:39 +01:00
Daimakaimura
855500318a Fixed master merge issues 2023-05-16 08:59:34 +01:00
Daimakaimura
e9fc710137 Merge branch 'master' into AnalysisMasterMerge 2023-05-16 08:35:55 +01:00
Alberto Soutullo
967e6b8fdc Added extra information in README.md 2023-05-15 17:37:50 +02:00
Alberto Soutullo
e539a2846b Changed plotting.py to match latest changes 2023-05-15 17:37:39 +02:00
Alberto Soutullo
ec84d1edba Created file to save metrics plot configuration 2023-05-15 17:37:20 +02:00
Alberto Soutullo
ce33e9ec3b Changed prometheus to deal with changes in config.json 2023-05-15 17:36:47 +02:00
Alberto Soutullo
087a562446 Modified config.json to reduce verbosity. 2023-05-15 17:36:20 +02:00
Daimakaimura
db834cf2af Switched to default probe 2023-05-15 12:07:21 +01:00
Daimakaimura
91dead2485 Added tests 2023-05-15 12:07:02 +01:00
Daimakaimura
2026527cb5 Fixed tests 2023-05-15 12:06:38 +01:00
Alberto Soutullo
d7b3b3b010 Added analysis to run.sh 2023-05-11 13:16:36 +02:00
Daimakaimura
d223ca8b40 Removed rounding from latencies 2023-05-11 09:43:16 +01:00
Daimakaimura
96638c8745 Refactoring 2023-05-11 09:42:32 +01:00
Daimakaimura
fe823290f4 Updated test_parse_args to test all args 2023-05-11 07:53:59 +01:00
Daimakaimura
fea0d10384 Calls analysis build script instead 2023-05-11 07:47:13 +01:00
Daimakaimura
4611bc356e Added docker imgae build script 2023-05-11 07:46:47 +01:00
Daimakaimura
330a91103b Cleanup commented function 2023-05-11 07:16:50 +01:00
Daimakaimura
cf88885e16 For dash source should be . 2023-05-11 07:13:45 +01:00
Daimakaimura
ec63fbb9fb Ignore container-proc venv 2023-05-11 07:02:46 +01:00