Translate

Wednesday, July 16, 2014

Finding the Server Needle in the Infrastructure Haystack – Addressing Infrastructure Complexity



When it comes to managing a large complex data center and the server infrastructure within, many CIOs know they have severe challenges.  Some might even characterize their predicament as being akin to ordered chaos!  Most know where and what their business critical servers are and what they are doing.  These might comprise 10 to 20% of their overall IT infrastructure.  It’s the other 80 to 90% that few know where and what these systems do AND, more importantly, how they impact factors like capacity management (IT flexibility), response and recovery time objectives (DR), labor efficiency (costs), and more.  Wouldn’t it be nice if only there was some type of metric that would assist in understanding where major areas of complexity reside, how to deal with them, and how they compare with other IT environments of similar size, industry, and make up?

Good news!  There is now.  Using IT infrastructure data that the IBM Systems and Technology Group’s Lab Services team has amassed from 1,000 plus completed IT optimization assessments (ITOs) over the last 10 years, one can actually begin to calculate a metric we call the “Infrastructure Complexity Index” (ICI for short).  This ICI can be (and has been!) used to find and educate IT personnel on specific areas of IT complexity and the factors involved in addressing these areas.

The ICI metric itself is composed of a number of components, each of which can have an impact (positive or negative) on the level of IT infrastructure complexity being shown.  By recognizing the relative impact of each of these components and the effect on their ICI, a governing IT management team can advise, and make course corrections to simplification, standardization, and overall optimization of the IT environment.

The major components affecting the ICI metric include the following:

  • Server hardware vendor variation (count of unique hardware vendors in use) 
  •  Server hardware model variation (count of unique hardware models in use)
  • Server physical servers (count of unique serial numbered servers in use)
  • Server operating system vendor variation (count of unique O. S. vendors in use)
  • Server operating system version/releases (count of unique O. S. version/releases in use)
  • Server logical servers (count of unique logical servers using an O. S. in use)

Using these components together, one can calculate the ICI(1) for a given IT environment and share this with others in the organization as part of an overall ITO analysis and set of recommendations on how to decrease IT complexity and the resulting benefits of doing so.  In addition, the ICI can be used to benchmark your IT environment with another of similar size and type.

With the proper data in hand, one can even decompose the ICI down to the operating system (AIX vs Windows vs Linux), vendor (Oracle/SUN vs IBM/AIX vs HP/UX), or even platform type (Unix/RISC vs Linux/x86), thus providing further insights as to where deeper areas of infrastructure complexity might be impacting the previously mentioned areas of IT flexibility, recoverability, and cost.

Although currently focused on server infrastructure complexity, work is ongoing to expand the use of ICI to other areas of IT infrastructure complexity, including storage, network, and even software stacks.  For further clarification on the ICI and how it could be used for your situation, please contact the author (John F. Ryan jfryan1@us.ibm.com).

Thanks to John Ryan for his guest contribution!


(1)    The actual formula used to calculate the ICI is the intellectual property of the IBM and is currently under patent review.  Please contact the author for specifics on how to introduce the ICI to your situation

Tuesday, July 1, 2014

Small Things about Big Data




Common questions that bother most IT decision makers today are:

  • Do I need a Big Data solution?
  • How will my organization benefit from it?
  • Should I implement one now or wait for a more opportune time?
  • If I implement one, what is the scale I should look at?
Before answering these questions, it might be helpful to understand some “Small things about Big Data.”

An obvious requirement of any IT solution is the ability to provide reports on a regular basis. These reports represent anything from financial health, project status, and customer data to audit & security related information.  

Most times, organizations tend to generate reports during low peak periods to ensure their infrastructure is highly utilized and gives better ROI.  However, some of these reports if available at more regular intervals in a 24 hour period will benefit the organization much more than just generating them once in a 24 hour period.  This is nothing but Analytics.

Any workload that analyzes static or streaming data to produce reports useful for business & IT are essentially called “Analytic Workloads.”  

As evident from this case study at Coca Cola, timely delivery of reports can make a huge difference to an organization’s business and strategy.
http://www-01.ibm.com/common/ssi/cgi-bin/ssialias?subtype=AB&infotype=PM&appname=STGE_TS_ZU_USEN&htmlfid=TSC03243USEN&attachment=TSC03243USEN.PDF
               
Therefore, for organizations to understand if they need to ride the Big Data wave or not, they first should identify the analytical pieces that form their IT backbone.  Important questions here would be:

  1. What are the critical reports generated?
  2. When are these reports generated? (time of day)
  3. Are they based upon accurate and timely data?
  4. If the information from these reports can be generated at a higher frequency or on-demand, will it enable quicker decision making or faster go-to market strategy?
  5. If the answer to 4 is, “Yes” for certain reports, what stops the organization from generating these reports at a higher frequency? 
If the answer to question 3 is “No,” the first step is to address data quality with the business.  This gets into data structure and data governance.  It also requires cooperation with the business as the data owners.

Moving down the list, most often, the organizations I work with say the answer to question 5 lies in availability of IT resources needed to generate these reports.   A common reason to generate reports in the night (or during low peak window) is to avoid performance issues during peak load and to use resources effectively.

To address this, I see the need for a thought process shift to view analytics as an investment that can help bring higher efficiency to an organization’s business, thereby staying ahead of the competition.  Business cases that are developed around the opportunities these insights will bring to the business tend to be the most successful ones.  They justify investing in the right tools to facilitate this versus trying to limp along with analytics workloads taking second priority over other workloads.  

From the standpoint of balancing optimization of resources with business value potential, my experience has been the best way to get better and timely analytical reports is to create a separate set of analytics workloads that can work in real-time using faster systems & storage.   Compute and Storage systems have matured in terms of performance to an extent where one can run real-time reports without impacting the performance of online workloads.  And if the data analyzed contain untapped insights, the business value usually justifies using this approach.

Having identified analytics workloads and the need, the next step is to understand “Why Big Data?”  For any organization that benefits from real-time data sources like social media, complaint records, and service requests etc., Big Data is the way to go. These data sources helps organizations arrive at quick decisions and react quickly to the market or customer needs.

A common example is a customer recording their experience in a public domain which can either help or harm an organization’s reputation. In such cases, when real-time data are made available to the customer service team, they can quickly step-in, understand the situation and help manage the expectation with the respective customer.   However, there are many more.  Customers use these insights to target marketing to customers more effectively, identify patent infringements, identify trademark misuse, improve product design, gauge overall product and brand sentiment, etc…  There are countless strong business uses.

The CIO can initiate the Big Data discussion with the business in business terms.  Often this is a way for the CIO to earn a “seat at the table” with the other business executives.  But, it’s good to be ready by thinking about some practical “small things about big data” before you do.


Article by Subramaniam Meenakshisundaram, a guest contributor and IBM Executive Consultant in the Systems Group Lab Services organization. 

Many thanks Subbu for your guest column!