Kevin Holman's System Center Blog

Posts in this blog are provided "AS IS" with no warranties, and confers no rights. Use of included script samples are subject to the terms specified in the Terms of UseAre you interested in having a dedicated engineer that will be your Mic

UR1 for SCOM 2012 R2 – Step by Step

UR1 for SCOM 2012 R2 – Step by Step

  • Comments 21
  • Likes

 

KB Article:   http://support.microsoft.com/kb/2904678/en-us

Download catalog site:  http://catalog.update.microsoft.com/v7/site/Search.aspx?q=2904678

 

Key fixes:

Issue 1 - An error occurs when you run the p_DataPurging stored procedure. This error occurs when the query processor runs out of internal resources and cannot produce a query plan.
Issue 2 - Data warehouse BULK INSERT commands use an unchangeable, default 30-second time-out value that may cause query time-outs.
Issue 3 - Many 26319 errors are generated when you use the Operator role. This issue causes performance problems.
Issue 4 - The diagram component does not publish location information in the component state.
Issue 5 - Renaming a group works correctly on the console. However, the old name of the group appears when you try to override a monitor or scope a view based on group.
Issue 6 - SCOM synchronization is not supported in the localized versions of Team Foundation Server.
Issue 7 - An SDK process deadlock causes the Exchange correlation engine to fail.
Issue 8 - The "Microsoft System Center Advisor monitoring server" reserved group is visible in a computer or group search.
Issue 9 - Multiple Advisor Connector are discovered for the same physical computer when the computer hosts a cluster.
Issue 10 - A Dashboard exception occurs if the criteria that are used for a query include an invalid character or keyword.

Xplat updates:

Issue 1 - On a Solaris-based computer, an error message that resembles the following is logged in the Operations Manager log. This issue occurs if a Solaris-based computer that has many monitored resources runs out of file descriptors and does not monitor the resources. Monitored resources may include file systems, physical disks, and network adapters.
Note The Operations Manager log is located at /var/opt/microsoft/scx/log/scx.log.     errno = 24 (Too many open files)    This issue occurs because the default user limit on Solaris is too low to allocate a sufficient number of file descriptors. After the rollup update is installed, the updated agent overrides the default user limit by using a user limit for the agent process of 1,024.

Issue 2 - If Linux Container (cgroup) entries in the /etc/mtab path on a monitored Linux-based computer begin with the "cgroup" string, a warning that resembles the following is logged in the agent log.  Note When this issue occurs, some physical disks may not be discovered as expected.  Warning [scx.core.common.pal.system.disk.diskdepend:418:29352:139684846989056] Did not find key 'cgroup' in proc_disk_stats map, device name was 'cgroup'.

Issue 3 - Physical disk configurations that cannot be monitored, or failures in physical disk monitoring, cause failures in system monitoring on UNIX and Linux computers. When this issue occurs, logical disk instances are not discovered by Operations Manager for a monitored UNIX-based or Linux-based computer.

Issue 4 - A monitored Solaris zone that is configured to use dynamic CPU allocation with dynamic resource pools may log errors in the agent logs as CPUs are removed from the zone and do not identify the CPUs currently in the system. In rare cases, the agent on a Solaris zone with dynamic CPU allocation may hang during routine monitoring.  Note This issue applies to any monitored Solaris zones that are configured to use dynamic resource pools and a "dedicated-cpu" configuration that involves a range of CPUs.

Issue 5 - An error that resembles the following is generated on Solaris 9-based computers when the /opt/microsoft/scx/bin/tools/setup.sh script does not set the library pathcorrectly. When this issue occurs, the omicli tool cannot run.  ld.so.1: omicli: fatal: libssl.so.0.9.7: open failed: No such file or directory

Issue 6 - If the agent does not retrieve process arguments from the getargs subroutine on an AIX-based computer, the monitored daemons may be reported incorrectly as offline. An error message that resembles the following is logged in the agent log:   Calling getargs() returned an error

Issue 7 - The agent on AIX-based computers considers all file cache to be available memory and does not treat minperm cache as used memory. After this update rollup is installed, available memory on AIX-based computer is calculated as: free memory + (cache – minperm).

Issue 8 - The Universal Linux agent is not installed on Linux computers that have OpenSSL versions greater than 1.0.0 if the library file libssl.so.1.0.0 does not exist. An error message that resembles the following is logged:  /opt/microsoft/scx/bin/tools/.scxsslconfig: error while loading shared libraries: libssl.so.1.0.0: cannot open shared object file: No such file or directory

 

I have seen *several* customers having issues with the OpsDB grooming/purging process, so that looks like a good one to get implemented, especially if this was affecting you.

 

Lets get started.

From reading the KB article – the order of operations is:

 

  1. Install the update rollup package on the following server infrastructure:
    • Management servers
    • Gateway servers
    • Web console server role computers
    • Operations console role computers
  2. Apply SQL scripts (see installation information).
  3. Manually import the management packs.

There are no agent updates in this UR1.  Agents will be placed into pending, however there are no updates.  You must reject the agents in pending.

Now, we need to add another step – if we are using Xplat monitoring – need to update the Linux/Unix MP’s and agents.

4.  Update Unix/Linux MP’s and Agents.

 

1.  Management Servers

Since there is no RMS anymore, it doesn’t matter which management server I start with.  There is no need to begin with whomever holds the RMSe role.  I simply make sure I only patch one management server at a time to allow for agent failover without overloading any single management server.

I can apply this update manually via the MSP files, or I can use Windows Update.  I have 3 management servers, so I will demonstrate both.  I will do the first management server manually.  This management server holds 3 roles, and each must be patched:  Management Server, Web Console, and Console.

The first thing I do when I download the updates from the catalog, is copy the cab files for my language to a single location:

image

Then extract the contents:

image

Once I have the MSP files, I am ready to start applying the update to each server by role.

***Note:  You MUST log on to each server role as a Local Administrator, SCOM Admin, AND your account must also have System Administrator (SA) role to the database instances that host your OpsMgr databases.

My first server is a management server, and the web console, and has the OpsMgr console installed, so I copy those update files locally, and execute them per the KB, from an elevated command prompt:

image

This launches a quick UI which applies the update.  It will bounce the SCOM services as well.  The update does not provide any feedback that it had success or failure.  You can check the application log for the MsiInstaller events for that.

You can also spot check a couple DLL files for the file version attribute. 

image

 

Next up – run the Web Console update:

image

This runs much faster.   A quick file spot check:

image

Lastly – install the console update (make sure your console is closed):

image

A quick file spot check:

image

 

Secondary Management Servers:

I now move on to my secondary management servers, applying the server update, then the console update. 

On this next management server, I will use Windows Update.  I check online, and make sure that I have configured Windows Update to give me updates for additional products:

image

This shows me two applicable updates for this server:

image

I apply these updates (along with some additional Windows Server Updates I was missing, and reboot each management server, until all management servers are updated.

 

Updating Gateways:

I can use Windows Update or manual installation.

image

The update launches a UI and quickly finishes.

Then I will spot check the DLL’s:

image

 

2. Apply the SQL Script

In the path on your management servers, where you installed/extracted the update, there is a SQL script file: 

%SystemDrive%\Program Files\System Center 2012\Operations Manager\Server\SQL Script for Update Rollups

Open a SQL management studio query window, connect it to your Operations Manager database, and then open the script file.  Make sure it is pointing to your OperationsManager database, then execute the script.

****Note – at the time of this writing – the KB article says to run this against the DataWarehouse – the KB article is in error

image

Click the “Execute” button in SQL mgmt. studio.  The execution could take a considerable amount of time and you might see a spike in processor utilization on your SQL database server during this operation.

You will see the following (or similar) output:

image

 

3. Manually import the management packs?

We have four updated MP’s to import  (MAYBE!).

image

The TFS MP bundles are only used for specific scenarios, such as DevOps scenarios where you have integrated APM with TFS, etc.  If you are not currently using these MP’s, there is no need to import or update them.  I’d skip this MP import unless you already have these MP’s present in your environment.

The Advisor MP’s are only needed if you are using System Center Advisor services.

However, the Image and Visualization libraries deal with Dashboard updates, and these need to be updated.

I import all of these without issue.

 

 

Reject the agent update

Agents are placed into pending actions by this update.  HOWEVER – there are no updates for the agents in the Update Rollup.  You must REJECT the agents in pending, using the console or PowerShell.

 

4.  Update Unix/Linux MPs and Agents

Next up – I download and extract the updated Linux MP’s for SCOM 2012 SP1 UR2

http://www.microsoft.com/en-us/download/details.aspx?id=29696

7.5.101 is current at this time for SCOM 2012 R2. 

****Note – take GREAT care when downloading – that you select the correct download for R2.  You must scroll down in the list and select the MSI for 2012 R2:

image

 

Download the MSI and run it.  It will extract the MP’s to C:\Program Files (x86)\System Center Management Packs\System Center 2012 R2 Management Packs for Unix and Linux\

Update any MP’s you are already using.

image

You will likely observe VERY high CPU utilization of your management servers and database server during and immediately following these MP imports.  Give it plenty of time to complete the process of the import and MPB deployments.

Next up – you would upgrade your agents on the Unix/Linux monitored agents.  You can now do this straight from the console:

image

image

You can input credentials or use existing RunAs accounts if those have enough rights to perform this action.

image

I have an environmental issue that caused my Ubuntu server to fail. 

 

5.  Update the remaining deployed consoles

This is an important step.  I have consoles deployed around my infrastructure – on my Orchestrator server, on my personal workstation, on all the other SCOM admins on my team, on a Terminal Server we use as a tools machine, etc.  These should all get the UR1 update.

 

 

Review:

Now at this point, we would check the OpsMgr event logs on our management servers, check for any new or strange alerts coming in, and ensure that there are no issues after the update.

image

Known issues:

See the existing list of known issues documented in the KB article.

1.  Many people are reporting that the SQL script is failing to complete when executed.  You should attempt to run this multiple times until it completes without error.  You might need to stop the Exchange correlation engine, stop the services on the management servers, or bounce the SQL server services in order to get a successful completion in a busy management group.  The errors reported appear as below:

------------------------------------------------------

(1 row(s) affected)

(1 row(s) affected)

Msg 1205, Level 13, State 56, Line 1

Transaction (Process ID 152) was deadlocked on lock resources with another process and has been chosen as the deadlock victim. Rerun the transaction.

Msg 3727, Level 16, State 0, Line 1

Could not drop constraint. See previous errors.

--------------------------------------------------------

Comments
  • Hi Kevin, Thanks for such a detailed post. However, you've specified at the top of the article that you need to reject the pending agent updates, but then specify further down: "You must approve the agents in pending, using the console or PowerShell".

  • Fixed - thanks

  • I got repeated locks running the sql script (stopped/restarted SC services, restarting sql etc.). I then stopped the exchange correlation service but the sql script was running while I did that .. I had thought the script had finished its run ... that run of the sql script succeeded. So, in summary I can't definitively say stopping the exchange correlation service was the fix, but if anyone else has the same issue, try that first, stop the service and run the script, and if that doesn't work, maybe run the sql script several times? HTH.

  • Hi Kevin, why I should reject the pending agents I already applied the RU and approved the agent ! would this impact be badly ?

  • Hi Kevin, Super post as always (only wish I seen this before doing the rollup follwing MS instructions) As per Metto's post, there was no mention of "DO Not Approve Pending Agents" in the article from MS so I did this for all 900 of my agents. You said DO NOT UPDATE is this going to be an issue fro my enviroment? I am also having the SQL script error so I am planning to stop all MGMT server services to get the final part complete (as this is the only one not run) Reagrds Ash

  • There is no problem if you do approve agent updates - it wont really update anything.... its just a waste of time. I think it will reinstall the agent binaries at worst, you'd have to check the application log on an approved agent to verify. There is no agent update in this UR, however, the mechanism to put all agents into pending is handled whenever the management server gets an update.... a different part of the update code.

  • Thanks !

  • Thanks Kevin :)

  • Hi Kevin, When opening the scom web console in SSL, after applying UR1 i get this message: System.ServiceModel.CommunicationException: [HttpWebRequest_WebException_RemoteServer] Arguments: NotFound Debugging resource strings are unavailable. Often the key and arguments provide sufficient information to diagnose the problem. See http://go.microsoft.com/fwlink/?linkid=106663&Version=5.1.20913.0&File=System.Windows.dll&Key=HttpWebRequest_WebException_RemoteServer ---> System.Net.WebException: [HttpWebRequest_WebException_RemoteServer] Arguments: NotFound Debugging resource strings are unavailable. Often the key and arguments provide sufficient information to diagnose the problem. See http://go.microsoft.com/fwlink/?linkid=106663&Version=5.1.20913.0&File=System.Windows.dll&Key=HttpWebRequest_WebException_RemoteServer ---> System.Net.WebException: [HttpWebRequest_WebException_RemoteServer] Arguments: NotFound Debugging resource strings are unavailable. Often the key and arguments provide sufficient information to diagnose the problem. See http://go.microsoft.com/fwlink/?linkid=106663&Version=5.1.20913.0&File=System.Windows.dll&Key=HttpWebRequest_WebException_RemoteServer at System.Net.Browser.BrowserHttpWebRequest.InternalEndGetResponse(IAsyncResult asyncResult) at System.Net.Browser.BrowserHttpWebRequest.<>c__DisplayClassa.b__9(Object sendState) at System.Net.Browser.AsyncHelper.<>c__DisplayClass4.b__0(Object sendState) --- End of inner exception stack trace --- at System.Net.Browser.AsyncHelper.BeginOnUI(SendOrPostCallback beginMethod, Object state) at System.Net.Browser.BrowserHttpWebRequest.EndGetResponse(IAsyncResult asyncResult) at System.ServiceModel.Channels.HttpChannelFactory.HttpRequestChannel.HttpChannelAsyncRequest.CompleteGetResponse(IAsyncResult result) --- End of inner exception stack trace --- at System.ServiceModel.AsyncResult.End[TAsyncResult](IAsyncResult result) at System.ServiceModel.Channels.ServiceChannel.EndCall(String action, Object[] outs, IAsyncResult result) at System.ServiceModel.ClientBase`1.ChannelBase`1.EndInvoke(String methodName, Object[] args, IAsyncResult result) at Microsoft.EnterpriseManagement.Presentation.Security.ServiceProxies.LogonServiceClient.LogonServiceClientChannel.EndGetConfiguration(IAsyncResult result) at Microsoft.EnterpriseManagement.Presentation.Security.ServiceProxies.LogonServiceClient.Microsoft.EnterpriseManagement.Presentation.Security.ServiceProxies.ILogonService.EndGetConfiguration(IAsyncResult result) at Microsoft.EnterpriseManagement.Presentation.Security.ServiceProxies.LogonServiceClient.OnEndGetConfiguration(IAsyncResult result) at System.ServiceModel.ClientBase`1.OnAsyncCallCompleted(IAsyncResult result) When i open the console with l http it works fine, but before applying the update it also worked with ssl. Any ideas?

  • Hi, after doing this I noticed I can't add any more servers to the advisor. if I click add server/groups the search window opens without any servers or groups. Any ideas?

  • Hi, after doing this I noticed I can't add any more servers to the advisor. if I click add server/groups the search window opens without any servers or groups. Any ideas?

  • Has anyone seen after an R2 upgrade that their Request Counts are missing from AppAdvisoir reports? Just just show up N/A?

  • Thanks Kevin!

    After the rollup, I'm having error 31551 and I couldn't find a solution for it..anybody else with the same error after the rollup?



  • @Yasar - can you post the details of a 31551 event?

  • I'm sorry that I didn't post the details of the event

    Log Name: Operations Manager
    Source: Health Service Modules
    Date: 20-3-2014 12:45:57
    Event ID: 31551
    Task Category: Data Warehouse
    Level: Error
    Keywords: Classic
    User: N/A
    Computer: *******
    Description:
    Failed to store data in the Data Warehouse. The operation will be retried.
    Exception 'InvalidOperationException': The given value of type String from the data source cannot be converted to type nvarchar of the specified target column.

    One or more workflows were affected by this.

    Workflow name: Microsoft.SystemCenter.DataWarehouse.CollectAlertData
    Instance name: 655a7694-1e54-44ad-8fd1-8b3977f57320
    Instance ID: {99432F95-4CA3-DC77-B55E-C145CA0F4176}
    Management group: ****BG

Your comment has been posted.   Close
Thank you, your comment requires moderation so it may take a while to appear.   Close
Leave a Comment
Search Blogs