CMX crashes and web uploading
Posted: Tue 04 Jul 2023 7:01 am
Just reporting that my CMX has crashed three times in the last 3 weeks. All seem related to uploads to the web server, so maybe this is the same issue as in viewtopic.php?p=172284#p172284 but I'm not sure of this, hence this separate thread.
On the first two occasions … 13 and 29 June, see the first two diags files … the crashes were associated with uploads in the early hours of the morning. It may be of interest that typically around 00:30, on many/most mornings, a MySQL upload fails, an alarm is triggered, but usually the data do end up being uploaded. I have assumed/guessed this “routine” problem was triggered by a the web server issue that was, maybe, not handled smoothly by CMX. Though note the 29 June crash seems to have started with a PHP upload failure.
The third “crash”, again in the early hours of the morning ... this morning 04 July, see third diag file ... was a little different but was again associated with web uploading. The timeline seems to be this:
00:31:32 MySQL uploading error. ie the “regular” occurrence that “seems” benign, but there are other uploading errors occurring in the early hours of the morning, none seemingly fatal.
02:03:14 This error seems more significant. From this point on, uploads are unsuccessful - all of them.
04:09:07 Something more sinister happens here and it is the last entry in the diag file for about 2 hours until:
06:11:08 Exiting system due to external SIGTERM signal ... that's me attempting a restart. This didn't "seem" to work so I rebooted the rpi. I was in a hurry, so maybe didn't give it enough time.
I noticed that during this two hour gap, the monthly file continues to be updated. But the “RecentData” table in cumulusmx.db goes from an entry every minute to once every 5 minutes. (I guess the RecentData table is where CMX gets it's one minute data for the recent graphs?)
I've only picked the eyes out of the diag files, so maybe those with a better understanding will see significance in things I haven't paid attention to. Anyhow I would be grateful for insights. From my experience crashes are rare so I don't have much experience in reading these tea leaves.
Edit: and I forgot to add that WLL stopped reporting during some of the time this morning
On the first two occasions … 13 and 29 June, see the first two diags files … the crashes were associated with uploads in the early hours of the morning. It may be of interest that typically around 00:30, on many/most mornings, a MySQL upload fails, an alarm is triggered, but usually the data do end up being uploaded. I have assumed/guessed this “routine” problem was triggered by a the web server issue that was, maybe, not handled smoothly by CMX. Though note the 29 June crash seems to have started with a PHP upload failure.
The third “crash”, again in the early hours of the morning ... this morning 04 July, see third diag file ... was a little different but was again associated with web uploading. The timeline seems to be this:
00:31:32 MySQL uploading error. ie the “regular” occurrence that “seems” benign, but there are other uploading errors occurring in the early hours of the morning, none seemingly fatal.
02:03:14 This error seems more significant. From this point on, uploads are unsuccessful - all of them.
04:09:07 Something more sinister happens here and it is the last entry in the diag file for about 2 hours until:
06:11:08 Exiting system due to external SIGTERM signal ... that's me attempting a restart. This didn't "seem" to work so I rebooted the rpi. I was in a hurry, so maybe didn't give it enough time.
I noticed that during this two hour gap, the monthly file continues to be updated. But the “RecentData” table in cumulusmx.db goes from an entry every minute to once every 5 minutes. (I guess the RecentData table is where CMX gets it's one minute data for the recent graphs?)
I've only picked the eyes out of the diag files, so maybe those with a better understanding will see significance in things I haven't paid attention to. Anyhow I would be grateful for insights. From my experience crashes are rare so I don't have much experience in reading these tea leaves.
Edit: and I forgot to add that WLL stopped reporting during some of the time this morning