Help
  • Explore Community
  • Get Started
  • Ask the Community
  • How-To & Best Practices
  • Contact Support
Notifications
Login / Register
Community
Community
Notifications
close
  • Forums
  • Knowledge Center
  • Events & Webinars
  • Ideas
  • Blogs
Help
Help
  • Explore Community
  • Get Started
  • Ask the Community
  • How-To & Best Practices
  • Contact Support
Login / Register
Sustainability
Sustainability

Join our "Ask Me About" community webinar on May 20th at 9 AM CET and 5 PM CET to explore cybersecurity and monitoring for Data Center and edge IT. Learn about market trends, cutting-edge technologies, and best practices from industry experts.
Register and secure your Critical IT infrastructure

Failsafe Reset on AP9631

APC UPS Data Center & Enterprise Solutions Forum

Schneider, APC support forum to share knowledge about installation and configuration for Data Center and Business Power UPSs, Accessories, Software, Services.

cancel
Turn on suggestions
Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.
Showing results for 
Show  only  | Search instead for 
Did you mean: 
  • Home
  • Schneider Electric Community
  • APC UPS, Critical Power, Cooling and Racks
  • APC UPS Data Center & Enterprise Solutions Forum
  • Failsafe Reset on AP9631
Options
  • Subscribe to RSS Feed
  • Mark Topic as New
  • Mark Topic as Read
  • Float this Topic for Current User
  • Bookmark
  • Subscribe
  • Mute
  • Printer Friendly Page
Invite a Co-worker
Send a co-worker an invite to the portal.Just enter their email address and we'll connect them to register. After joining, they will belong to the same company.
You have entered an invalid email address. Please re-enter the email address.
This co-worker has already been invited to the Exchange portal. Please invite another co-worker.
Please enter email address
Send Invite Cancel
Invitation Sent
Your invitation was sent.Thanks for sharing Exchange with your co-worker.
Send New Invite Close
Top Experts
User Count
BillP
Administrator BillP Administrator
5060
voidstar_apc
Janeway voidstar_apc
196
Erasmus_apc
Sisko Erasmus_apc
112
TheNotoriousKMP_apc
Sisko TheNotoriousKMP_apc
108
View All

Invite a Colleague

Found this content useful? Share it with a Colleague!

Invite a Colleague Invite
Solved Go to Solution
Back to APC UPS Data Center & Enterprise Solutions Forum
Solved
Terry_Kennedy_apc
Commander Terry_Kennedy_apc
Commander

Posted: ‎2021-07-26 02:56 AM . Last Modified: ‎2024-02-14 02:37 AM

0 Likes
15
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:56 AM . Last Modified: ‎2024-02-14 02:37 AM

Failsafe Reset on AP9631

I have a SMT1500RM2U UPS with an AP9631 at a customer site. The AP9631 has been restarting with "Failsafe Reset" every few days. The NMC2 has been running 6.4.0 since it was installed. The UPS had been at ID18 9.2 and was updated this past weekend to 9.3 in case that was the problem.

The start of the dump.txt file is:

07/26/2016 11:42:05 Failsafe Reset
Specific code = 201


AOS v6.4.0 sumx v6.4.0
Serial Number: 5A1116T0xxxx
AOS Binary Date/Time: Dec 18 2015 15:04:27
APP Binary Date/Time: Dec 18 2015 15:14:26

Task Dump Task ID 167

OSIntNesting 1
inUioFlag 0
uioErr 0

Current stack at _SS:_SP 03ad: 2510

The complete dump.txt is attached. Can you take a look at this and let me know if it looks like a hardware problem or a software bug, and what steps I should take to investigate further?

 

Labels
  • Labels:
  • UPS Management Devices & PowerChute Software
Reply

Link copied. Please paste this link to share this article on your social media post.

  • All forum topics
  • Previous Topic
  • Next Topic

Accepted Solutions
Terry_Kennedy_apc
Commander Terry_Kennedy_apc
Commander

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-02-14 02:37 AM

0 Likes
0
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-02-14 02:37 AM

2 Symmetra 1P, 2 Matrix, dozens of various generations of Smart-UPS, and 1 microlink UPS. Unfortunately, the only AP933x cards are in one of the Symmetras and the microlink UPS. Everything else is AP961x.

See Answer In Context

Reply

Link copied. Please paste this link to share this article on your social media post.

Replies 15
BillP
Administrator BillP Administrator
Administrator

Posted: ‎2021-07-26 02:56 AM . Last Modified: ‎2024-01-31 02:57 AM

0 Likes
0
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:56 AM . Last Modified: ‎2024-01-31 02:57 AM

Hi Terry, we need the entire .tar file/bundle please. You can sanitize it too first before posting but dump.txt in conjunction with the config, event, and debug.txt is most helpful to debug why this is occurring.

Reply

Link copied. Please paste this link to share this article on your social media post.

Terry_Kennedy_apc
Commander Terry_Kennedy_apc
Commander

Posted: ‎2021-07-26 02:56 AM . Last Modified: ‎2024-02-14 02:37 AM

0 Likes
0
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:56 AM . Last Modified: ‎2024-02-14 02:37 AM

Here you go. It isn't worth unzipping it to sanitize.

 

Reply

Link copied. Please paste this link to share this article on your social media post.

Terry_Kennedy_apc
Commander Terry_Kennedy_apc
Commander

Posted: ‎2021-07-26 02:56 AM . Last Modified: ‎2024-02-14 02:37 AM

0 Likes
0
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:56 AM . Last Modified: ‎2024-02-14 02:37 AM

Bump

Reply

Link copied. Please paste this link to share this article on your social media post.

BillP
Administrator BillP Administrator
Administrator

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-01-31 02:57 AM

0 Likes
0
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-01-31 02:57 AM

Sorry Terry, I've been on vacation. I'll try to look at this later today while playing catch up.

Reply

Link copied. Please paste this link to share this article on your social media post.

BillP
Administrator BillP Administrator
Administrator

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-01-31 02:57 AM

0 Likes
0
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-01-31 02:57 AM

Hi Terry,

In looking at this, we need to see what was happening in the event log at the same time usually as the dump is only kept from the last reboot. So, the event log only goes back to 7/22 and we won't be able to research these:

07/03/2016 06:41:23 Failsafe Reset
07/11/2016 08:41:31 Failsafe Reset

07/23/2016 14:03:36 Netsafe Reset - Netsafe reset is the the watchdog mechanism that the NMC uses to reboot itself if network traffic is too little or too much in an effort to rule out any problems with itself not being able to talk on the network. This is normal behavior.

07/26/2016 11:42:05 Failsafe Reset - this one I will investigate a little more and update you when I know anything further.

Reply

Link copied. Please paste this link to share this article on your social media post.

Terry_Kennedy_apc
Commander Terry_Kennedy_apc
Commander

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-02-14 02:37 AM

0 Likes
0
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-02-14 02:37 AM

Right - I'm interested in the last one. The netsafe was expected as I was working on the network at the time.

Thanks!

Reply

Link copied. Please paste this link to share this article on your social media post.

BillP
Administrator BillP Administrator
Administrator

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-01-31 02:57 AM

0 Likes
0
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-01-31 02:57 AM

Hi Terry,

This looks to potentially be something related to the email format task/process. It is possible something else caused this task to crash (most likely) or there is a problem specifically with this task somewhere. I haven't seen this specifically before yet either.

One question was, were all of the expected emails received when logging in via FTP around 11:42 on 7/26 (which is shown in the event log)?

I would see if this crash can be replicated frequently and we can log a bug on it but it may be a needle in a haystack considering the NMC being a real time OS and whatever was happening on the system at the exact time could've contributed to a one off. But, if we replicate it multiple times and all of the failsafes generate the same dump.txts, then I certainly will log a bug and provide the log files for review to see what can be done in the future releases.

Reply

Link copied. Please paste this link to share this article on your social media post.

Terry_Kennedy_apc
Commander Terry_Kennedy_apc
Commander

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-02-14 02:37 AM

0 Likes
0
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-02-14 02:37 AM

The FTP script runs hourly on all of my APC devices (close to 100 or so, I'd say, of which maybe 6 are NMC2) to track configuration changes. It is not expected that the FTP script would cause any email to be sent by the NMC.

What normally happens with these resets (which I am only seeing on this one device) is that I get a flurry of emails from the UPS about the NMC restarting, discovering the UIO probes, connecting with the UPS, and so on. My SNMP script may report a temporary inability to reach the device (it polls every 5 minutes) and I may get a "configuration change" alert from my FTP monitoring script where I will get a "UPS not discovered" change:

 

There was another reset on the 30th. I'm attaching that debug file. Generally, I can get you a new one every couple of days. tongue-out

Reply

Link copied. Please paste this link to share this article on your social media post.

Terry_Kennedy_apc
Commander Terry_Kennedy_apc
Commander

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-02-14 02:37 AM

0 Likes
0
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-02-14 02:37 AM

I should probably add that this is the only "modern" Smart-UPS that I have (with the blue LCD display). All of my other units are either LED-only Smart-UPS units or Symmetra and Matrix units. So if it is related to the new-style UPS and NMC protocol, then I wouldn't see it on any other unit.

If necessary, I can format / update / configure a replacement NMC2 and send it to the remote site for someone to swap, just to confirm it isn't a hardware problem in that card. But I'd rather keep collecting diagnostic information with this one, as long as it leads to a resolution.

 

Reply

Link copied. Please paste this link to share this article on your social media post.

BillP
Administrator BillP Administrator
Administrator

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-01-31 02:57 AM

0 Likes
0
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-01-31 02:57 AM

Hi Terry,

We don't feel this is a hardware problem and I'd assume you'd agree with that if you did end up sending them a replacement card to compare. I was going to ask what's different about this card configuration-wise versus your others but you've sort of mentioned that now - the UPS model is micro-link but other config items are pretty much the same (right?).

We do have a similar bug logged already that was under investigation and I am thinking to add your specific debug files to. It appears to potentially be a resource utilization problem depending on the specific environment or configuration (including UPS type), what tasks are running and what their priority is. 

There is not a smoking gun in your log files where I can give you a straight answer unless you know it started right after you enabled/disabled something or made a change somewhere. I imagine someone needs to look closely at your configuration and log files and comb through how tasks are prioritized to make sure resources are managed as efficiently as possible to avoid failsafes.

The other thing we can look at and note in the bug is the specific pattern - does it happen every X hours on the dot? Always after an FTP log in? Are the dump.txt files always identical with the same codes/tasks mentioned?

Reply

Link copied. Please paste this link to share this article on your social media post.

Terry_Kennedy_apc
Commander Terry_Kennedy_apc
Commander

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-02-14 02:37 AM

0 Likes
0
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-02-14 02:37 AM

Right - this is the only micro-link UPS I have. Other than that, the config is the same as I use on a number of other NMC2 cards.

It has been happening randomly - since all the timestamps on the crashes are around xx:4x, I'd say that was the FTP login.

It doesn't happen on any specific timeframe. The recent history is:

07/03/2016 06:41:23 Failsafe Reset

07/11/2016 08:41:31 Failsafe Reset

07/23/2016 14:03:36 Netsafe Reset

07/26/2016 11:42:05 Failsafe Reset

07/27/2016 13:41:36 Failsafe Reset

07/29/2016 22:41:45 Failsafe Reset

07/30/2016 01:41:13 Failsafe Reset

I just started monitoring this UPS on 07/02/2016, so the failsafe resets started soon after that. To compare, a different UPS has been running since March 2015 and has never had a failsafe reset, being monitored via the same script. The FTP script is pretty simple - it just logs into the NMC, gets the config.ini file, and logs out. You can find it at ftp://ftp.shrubbery.net/pub/rancid/contrib/rancid-apc.tar.gz

Reply

Link copied. Please paste this link to share this article on your social media post.

BillP
Administrator BillP Administrator
Administrator

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-01-31 02:57 AM

0 Likes
0
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-01-31 02:57 AM

Hi Terry,

Can you share a .tar of one of these different, older style UPSs for comparison, which do not show any issue?

Reply

Link copied. Please paste this link to share this article on your social media post.

Terry_Kennedy_apc
Commander Terry_Kennedy_apc
Commander

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-02-14 02:37 AM

0 Likes
0
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-02-14 02:37 AM

Here you go. This is a Symmetra, but as the bug is probably in AOS and not the APP file, hopefully it will be helpful.

Reply

Link copied. Please paste this link to share this article on your social media post.

BillP
Administrator BillP Administrator
Administrator

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-01-31 02:57 AM

0 Likes
0
5821
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-01-31 02:57 AM

Thanks Terry. It seems to be resources related as I noted so ideally, we'd want something using sumx app to compare apples to apples in case it is specific to resources in the sumx app specifically since some tasks across sumx and sy apps may be different. Or, are all your other UPS units Symmetras?

Reply

Link copied. Please paste this link to share this article on your social media post.

Terry_Kennedy_apc
Commander Terry_Kennedy_apc
Commander

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-02-14 02:37 AM

0 Likes
0
5822
  • Mark as New
  • Bookmark
  • Subscribe
  • Mute
  • Subscribe to RSS Feed
  • Permalink
  • Print
  • Email to a Friend
  • Report Inappropriate Content

Link copied. Please paste this link to share this article on your social media post.

Posted: ‎2021-07-26 02:57 AM . Last Modified: ‎2024-02-14 02:37 AM

2 Symmetra 1P, 2 Matrix, dozens of various generations of Smart-UPS, and 1 microlink UPS. Unfortunately, the only AP933x cards are in one of the Symmetras and the microlink UPS. Everything else is AP961x.

Reply

Link copied. Please paste this link to share this article on your social media post.

Preview Exit Preview

never-displayed

You must be signed in to add attachments

never-displayed

 
To The Top!

Forums

  • APC UPS Data Center Backup Solutions
  • EcoStruxure IT
  • EcoStruxure Geo SCADA Expert
  • Metering & Power Quality
  • Schneider Electric Wiser

Knowledge Center

Events & webinars

Ideas

Blogs

Get Started

  • Ask the Community
  • Community Guidelines
  • Community User Guide
  • How-To & Best Practice
  • Experts Leaderboard
  • Contact Support
Brand-Logo
Subscribing is a smart move!
You can subscribe to this board after you log in or create your free account.
Forum-Icon

Create your free account or log in to subscribe to the board - and gain access to more than 10,000+ support articles along with insights from experts and peers.

Register today for FREE

Register Now

Already have an account? Login

Terms & Conditions Privacy Notice Change your Cookie Settings © 2025 Schneider Electric

This is a heading

With achievable small steps, users progress and continually feel satisfaction in task accomplishment.

Usetiful Onboarding Checklist remembers the progress of every user, allowing them to take bite-sized journeys and continue where they left.

of