We’re running EMC Control Center 6.1 UB12, and we use it primarly for it’s robust performance data collection and reporting capabilities. Performance Manager is a great tool and I use it frequently.
Over the years I’ve had occasional issues with the WLA Archives not collecting performance data and I’ve had to open service requests to get it fixed. Now that I’ve been doing this for a while, I’ve collected enough info to troubleshoot this issue and correct it without EMC’s assistance in most cases.
Check your ..\WLAArchives\Archives directory and look under the Clariion (or Celerra) folder, then the folder with your array’s serial number, then the interval folder. This is where the “*.ttp” (text) and “*.btp” (binary) performance data files are stored for Performance Manager. Sort by date. If there isn’t a new file that’s been written in the last few hours data is not being collected.
Here are the basic items I generally review when data isn’t being collected for an array:
- Log in to every array in Unisphere, go to system properties, and on the ‘General’ tab make sure statistics logging is enabled. I’ve found that if you don’t have an analyzer license on your array and start the 7 day data collection for a “naz” file, after the 7 days is up the stats logging option will be disabled. You’ll have to go back in and re-enable it after the 7 day collection is complete. If stats logging isn’t enabled on the array the WLA data collection will fail.
- If you recently changed the password on your clarion domain account, Make sure that naviseccli is updated properly for security access to all of your arrays (use the “addusersecurity” CLI option) and perform a rediscovery of all your arrays as well from within the ECC console. There is no way from within the ECC console to update the password on an array, you must go through the discovery process again for all of them.
- Verify the agents are running. In the ECC console, click on the gears icon in the lower right hand corner. It will create a window that shows the status of all the agents, including the WLA Archiver. If WLA isn’t started, you can start it by right clicking on any array, choosing Agents, then start. Check the WLAArchives directories again (after waiting about an hour) and see if it’s collecting data again.
If those basic steps don’t work, checking the logs may point you in the right direction:
- Review the Clariion agent logs for errors. You’re not looking for anything specific here, just do a search for “error”, “unreachable” or for the specific IP’s of your arrays and see if there is anything obvious wrong.
Here’s an example of an error I found in one case:MGL 14:10:18 C P I 2536 (29.94 MB) [MGLAgent::ProcessAlert] => Processing SP Unreachable alert. MO = APM00100600999, Context = Clariion, Category = SP Element = Unreachable
2. Review the WLA Agent logs. Again, just search for errors and see if there is anything obvious that’s wrong.%ECC_INSTALL_ROOT%\exec\ENW610\ENW.log %ECC_INSTALL_ROOT%\exec\ENW610\ENW_Bx.log.gz %ECC_INSTALL_ROOT%\exec\ENW610\ENW.ini %ECC_INSTALL_ROOT%\exec\ENW610\ENW_Err.log %ECC_INSTALL_ROOT%\exec\ENW610\ENW_Bx_Err.log
If the logs don’t show anything obvious, here are the steps I take to restart everything. This has worked on several occasions for me.
- From the Control Center console, stop all agents on the ECC Agent server. Do this by right clicking on the agent server (in the left pane), choose agents and stop. Follow the prompts from there.
- Log in to the ECC Agent server console and stop the master agent. You can do this in Computer Management | Services, stop the service titled “EMC ControlCenter Master Agent”.
- From the Control Center console, stop all agents on the Infrastructure server. Do this by right clicking on the agent server (in the left pane), choose agents and stop. Follow the prompts from there.
- Verify that all services have stopped properly.
- From the ECC Agent server console, go to C:\Windows\ECC\ and delete all .comfile and .lck files.
- Restart all agents on the Infrastructure server.
- Restart the Master Agent on the Agent server.
- Restart all other services on the Agent server.
- Verify that all services have restarted properly.
- Wait at least an hour and check to see if the WLA Archive files are being written.
If none of these steps resolve your problem and you don’t see any errors in the logs, it’s time to open an SR with EMC. I’ve found the EMC staff that supports ECC to be very knowledgeable and helpful.