APC UPS Data Center & Enterprise Solutions Forum
Schneider, APC support forum to share knowledge about installation and configuration for Data Center and Business Power UPSs, Accessories, Software, Services.
Posted: 2021-06-29 06:15 AM . Last Modified: 2024-03-12 11:50 PM
Link copied. Please paste this link to share this article on your social media post.
Posted: 2021-06-29 06:15 AM . Last Modified: 2024-03-12 11:50 PM
I have an AP9631 that seems to "Network Interface restarted" every couple days or so. I have upgraded the firmware to v.6.4.6 , but this has not solved the issue. I looked through the logs files but dont see anything helps narrow down where the problem is.
I am attaching the sanitized files if anyone could help narrow down the issue.
Thanks
Link copied. Please paste this link to share this article on your social media post.
Link copied. Please paste this link to share this article on your social media post.
Posted: 2021-06-29 06:15 AM . Last Modified: 2024-03-12 11:49 PM
@APC I could reproduce the behavior again and got a core dump where should I send the snapshot?
Snippets:
dump.txt
09/07/2018 10:17:26 Failsafe Reset Specific code = 200 AOS v6.5.6 rpdu2 v6.5.6 Serial Number: 5A1750E04651 AOS Binary Date/Time: Mar 30 2018 16:51:08 APP Binary Date/Time: Apr 24 2018 11:42:41 Task Dump Task ID 166 Data in RAM is incomplete - dump aborted!
events.txt:
09/12/2018 21:53:37 System Network Interface restarted. 0x0002 (occurs multiple times)
debug.txt:
09/12/2018 21:20:27 Requested Reset
09/12/2018 21:53:38 Failsafe Reset
09/12/2018 22:49:52 Failsafe Reset
...
09/07/2018 09:18:33 .\os\os_heap.c:191 check_heap: block size mismatch. 09/07/2018 09:18:40 Software Exception Reset
09/07/2018 10:14:21 Less than 100 bytes in stack, task 220
Link copied. Please paste this link to share this article on your social media post.
Posted: 2021-06-29 06:15 AM . Last Modified: 2024-03-12 11:50 PM
Link copied. Please paste this link to share this article on your social media post.
Posted: 2021-06-29 06:15 AM . Last Modified: 2024-03-12 11:50 PM
There's a watchdog timer built into the AP9631 platform. If the network is quiet enough, it might sense the lack of traffic as a problem and restart the management card. There is also a ping test that takes place from time to time and a failure there could also restart the card.
Link copied. Please paste this link to share this article on your social media post.
Link copied. Please paste this link to share this article on your social media post.
Posted: 2021-06-29 06:15 AM . Last Modified: 2024-03-12 11:49 PM
It looks like a known issue with that firmware -- you could try reducing notifications ie removing SNMP trap receivers and email recipients.
Link copied. Please paste this link to share this article on your social media post.
Link copied. Please paste this link to share this article on your social media post.
Posted: 2021-06-29 06:15 AM . Last Modified: 2024-03-12 11:49 PM
Will there be a possibility to disable this watchdog and/or ping feature in the future?
I tried to disable all unecessary services: ftp, snmp but I still get the network card restart and it takes up to 4 minutes
It is especially annoying for me because I use the APC7920B in automated tests which then fail because I cannot connect to the device when the interface is restarting.
Additional information:
aos: v6.5.6
rpdu2g: v6.5.6
bootmon: v1.0.8
Link copied. Please paste this link to share this article on your social media post.
Link copied. Please paste this link to share this article on your social media post.
Posted: 2021-06-29 06:15 AM . Last Modified: 2024-03-12 11:49 PM
If you request a "Technical Support Debug Information Download" from the About / Support web page on your device, you'll get a .tar.gz file which contains a number of useful things. I don't remember which file it is (maybe the debug one), but one of them has a code identifying the cause of the watchdog reset. One of the APC folks can probably look that up for you and tell you why your card is resetting.
Link copied. Please paste this link to share this article on your social media post.
Link copied. Please paste this link to share this article on your social media post.
Posted: 2021-06-29 06:15 AM . Last Modified: 2024-03-12 11:49 PM
@APC I could reproduce the behavior again and got a core dump where should I send the snapshot?
Snippets:
dump.txt
09/07/2018 10:17:26 Failsafe Reset Specific code = 200 AOS v6.5.6 rpdu2 v6.5.6 Serial Number: 5A1750E04651 AOS Binary Date/Time: Mar 30 2018 16:51:08 APP Binary Date/Time: Apr 24 2018 11:42:41 Task Dump Task ID 166 Data in RAM is incomplete - dump aborted!
events.txt:
09/12/2018 21:53:37 System Network Interface restarted. 0x0002 (occurs multiple times)
debug.txt:
09/12/2018 21:20:27 Requested Reset
09/12/2018 21:53:38 Failsafe Reset
09/12/2018 22:49:52 Failsafe Reset
...
09/07/2018 09:18:33 .\os\os_heap.c:191 check_heap: block size mismatch. 09/07/2018 09:18:40 Software Exception Reset
09/07/2018 10:14:21 Less than 100 bytes in stack, task 220
Link copied. Please paste this link to share this article on your social media post.
Create your free account or log in to subscribe to the board - and gain access to more than 10,000+ support articles along with insights from experts and peers.