Operational analytical processing

From HandWiki

Operational analytical processing, or more popularly known as operational analytics, is a subset of data analytics that focuses on improving the operational nature of a business or entity. The main characteristic that distinguishes operational analytics from other types of analytics is that it is analytics on the fly,[1] which means that signals emanating from various parts of a business are processed in real-time to feed back into instant decision-making for the business. Some people refer to this as "continuous analytics," which is another way to emphasize the continuous digital feedback loop that can exist from one part of a business to its other parts.

Overview

The rapid digital transformation of many businesses means that an increasing number of business signals are being recorded and stored in digital form. Businesses are using these signals to improve their efficiency, improve their performance and provide better experiences to their users and customers. A Forrester Report[2] details how digitization of a business is impacting its customer experiences by leveraging data. Operational analytics allows you to process various types of information from different sources and then decide what to do next: what action to take, whom to talk to, what immediate plans to make. Gartner defines this as Continuous Intelligence[3] in a research report and goes on to describe this as a design pattern in which real-time analytics are integrated within a business operation, processing current and historical data to prescribe actions in response to events. Andreessen Horowitz[4][5] describes this as ...more and more decisions are automated away altogether—think of Amazon continually updating prices for its products throughout the day. This form of analytics has become popular with the digitization trend in almost all industry verticals, because it is digitization that furnishes the data needed for operational decision-making.

A few examples of operational analytics include... a product manager who looks at product-usage logs to determine which features of the product are liked by its users, which features slow them down, and which features are disliked by its users. The product manager can gather all these answers by querying data that records usage patterns from the product's user base; and he or she can immediately feed that information back to make the product better. Similarly, in the case of marketing analytic in the pre-digitized world, a marketing manager would organize a few focus groups, try out a few experiments based on their own creativity and then implement them. Depending on the results of experimentation, they would then decide what to do next. An experiment may take weeks or months. In the digitized world, there is the "marketing engineer," a person who is well-versed in using data systems. These marketing engineers can run multiple experiments at once, gather results from experiments in the form of data, terminate the ineffective experiments and nurture the ones that work, all through the use of data-based software systems. The more experiments they can run and the quicker the turnaround times of results, the better their effectiveness in marketing their product.

An MIT Technology Review article[6] describes how a ride-sharing application uses algorithms for real-time monitoring of traffic and trip times to balance demand and supply for ride sourcing—and to adjust fees accordingly and rapidly. The use of operations analytics is not confined to the field of information technology. Data from business intelligence, finance, science, weather, and even current events are combined and then analyze together to extract valuable insight from it, and this in turn, drives quick decision making in almost every conceivable use. A metrics collection system like Scuba[7] is an operational analytics system because it is used extensively for interactive, ad hoc, analysis queries that run in under a second over live data.

Definition of an operational analytics processing engine

The definition of an operational analytics processing engine (OPAP) [8] can be expressed in the form of the following six propositions:

  1. Complex queries: Support for queries like inner & outer joins, aggregations, sorting, relevance, etc.
  2. Low data latency: An update to any data record is visible in query results in under than a few seconds.
  3. Low query latency: A simple search query returns in under a few milliseconds.
  4. High query volume: Able to serve at least a few hundred concurrent queries per second.
  5. Live sync with data sources: Ability to keep itself in sync with various external sources without having to write external scripts. This can be done via change-data-capture of an external database, or by tailing streaming data sources.
  6. Mixed types: Allows values of different types in the same column. This is needed to be able to ingest new data without needing to manipulate them at write time.

System requirements

Operational Analytics is a subset of the broader set of processes that characterizes OLAP (online analytical processing). As such, it inherits the large data sizes and complex queries that OLAP systems typically has to handle. However, the characteristics that uniquely identify operational analytics is the requirement for quick predictions based on most recent signals. This means that the data latency and query latency are very small. For example, operational analytics applied to real time business processes specify that data latency be zero. It also means that queries should be fast and finish at interactive speeds. Because these decisions are taken at a micro-level and are very personalized[9] to each individual entity, operational analytics processing is characterized by how easy it is to deliver personalized recommendations using such a system.

References

  1. "What is Operational Analytics? - Definition from Techopedia" (in en). https://www.techopedia.com/definition/29495/operational-analytics. 
  2. Forrester (Feb 26, 2018). "Digital Rewrites the Rules of Business". http://partners.lantelligence.com/wp-content/uploads/2018/07/Forrester-Digital-Rewrites-The-Rules-Of-Business.pdf. 
  3. "Gartner Identifies Top 10 Data and Analytics Technology Trends for 2019" (in en). https://www.gartner.com/en/newsroom/press-releases/2019-02-18-gartner-identifies-top-10-data-and-analytics-technolo. 
  4. Naous, Jad (2019-05-16). "Everyone is an Analyst: Opportunities in Operational Analytics" (in en-US). https://a16z.com/2019/05/16/everyone-is-an-analyst-opportunities-in-operational-analytics/. 
  5. Naous, Jad; Chen, Frank (2019-04-13). "Re-tooling How Organizations Make Decisions" (in en-US). https://a16z.com/2019/04/13/re-tooling-how-organizations-make-decisions/. 
  6. "Data Analytics and Machine Learning: Driving Speed to Insight" (in en-US). https://www.technologyreview.com/s/602405/data-analytics-and-machine-learning-driving-speed-to-insight/. 
  7. "Scuba: Diving into Data at Facebook" (in en-US). https://research.fb.com/publications/scuba-diving-into-data-at-facebook/. 
  8. "Operational Analytics: What every software engineer should know about low-latency queries on large data sets" (in en). https://rockset.com/blog/operational-analytics-what-every-software-engineer-should-know/. 
  9. "Personalization" (in en), Wikipedia, 2019-08-07, https://en.wikipedia.org/w/index.php?title=Personalization&oldid=909724504, retrieved 2019-08-11