How to Succeed with Self-Service Analytics, Part V: Tools and Technologies

Succeeding with Self-Service Analytics - Part V

Read - How to Succeed with Self-Service Analytics, Part I: Pitfalls and Paradoxes

Read - How to Succeed with Self-Service Analytics, Part II: Know Thy Customer

Read - How to Succeed With Self-Service Analytics, Part III: Trust But Verify New Reports

Read - How to Succeed With Self-Service Analytics, Part IV: Roles, Teams, and Push-Down Development

So far in this series, we’ve focused on the importance of governance processes and organizational models to achieve success with self-service analytics. But tools and technologies also play an important role.

For decades, software vendors proclaimed the dawning of the era of self-service analytics. The reality has rarely matched the hype. Data analytics products have always been harder to configure and use than most users anticipate. But now that’s changed.

The advent of machine learning (ML) has dramatically changed the equation for self-service analytics. ML has made data and analytics tools both more powerful and easier to use for a wide spectrum of users, automating tasks that once took one or more experts to accomplish. These tools turn business users into citizen data analysts and data analysts into citizen data scientists and data engineers.


The advent of machine learning has dramatically changed the equation for self-service analytics.


Self-service tools have permeated every aspect of data analytics. Both casual and power users have benefitted. (See figure 1.)

Figure 1. The Spectrum of AI-Powered Data Analytics Tools

Casual User Tools. Data consumers now have AI-powered reporting and dashboard tools that automatically generate analyses in the background, displaying the most relevant relationships, including correlations, anomalies, and trends detected in a selected chart, dashboard, or data set. Data explorers can now generate natural language queries by speaking or writing words into a search box as a way to kickstart an analysis. They also can create dashboards by pinning the results to a display. (For more information about these types of tools, see “AI is the New BI: How Algorithms are Transforming Business Intelligence and Analytics”.)

For casual users, these AI-infused tools are like having a personal data analyst in their pocket.

They enable casual users to take on some of the functions of a data analyst, allowing data analysts to focus on more complex business problems.

Power User Tools. Power users are perhaps the biggest beneficiaries of the self-service movement. In the past, power users relied on the IT department for data, and when that took too long, they used Excel and Access to query databases and stitch together the information into a report. The process was cumbersome and time-consuming and left little time for analysis.

Today, self-service tools empower data analysts to do the work formerly done by BI developers, data engineers, data scientists and application developers. For instance, data catalogs enable data analysts to find, profile and annotate relevant data sets.  New data preparation tools make it easy to combine data sources, clean and transform data, and share these workflows with others. And new visual analysis tools make it easy for data analysts to query data, visualize and analyze the results, compile into a dashboard, and publish for others to consume. (See “Self-Service Triumvirate: The New Data Analyst Workbench.”)

Data analysts who aspire to become data scientists now have self-service tools that make it easy to build predictive and prescriptive models. These so-called AutoML tools provide a cloud-based, graphical environment for building and deploying machine learning models, including automated data prep, feature generation, and model-build selection. The tools profile a data set, automatically select algorithms to test and apply the appropriate data preparation routines for each selected algorithm. Most come with built-in APIs to score the models against production data and monitor the results over time.

Finally, data analysts can embed charts, tables, and an entire self-service environment into other applications (e.g., portals, cloud applications, custom or mobile applications) with new “low-code” or “no-code” tools. They can also white-label a data analytics tool using a point-and-click interface or by configuring cascading style sheets.

Self-Service Shifts

The advent of powerful new self-service tools is shifting what business and technical users can do with data. Soon, this shift will turn into an expectation written into job descriptions and performance reviews. (See figure 2.)

Figure 2. AI Technology Expands the Range of Self-Service Capabilities

For example, data consumers will be expected to query and analyze data and modify dashboards thanks to new AI-enabled BI tools. Data explorers will be expected to create new dashboards or modify analytics applications with new queries. Data analysts will reallocate time currently spent finding and preparing data for more value-added activities, such as analyzing data and creating lightweight ML models, thanks to a host of tools mentioned above. Data scientists will create their own data pipelines and deploy models into production thanks to codeless data pipelining tools. And data engineers will focus more time building a robust, self-service oriented enterprise data platform rather than creating data pipelines for data analysts and data scientists.

The Impact of a Self-Service Data Platform

The most impactful way organizations can facilitate self-service analytics is to build a robust data platform that provides domain-specific data sets and is geared to empower business and technical users. As a data platform becomes more robust, business users and technical developers become more self-sufficient. They do more work themselves without help from an upstream specialist. This frees up resources to focus on more value-added activities. (See figure 3.)

Figure 3. Self-Service Increases as a Data Platform Matures

As a data platform supports more shared metrics and domain models, the more self-sufficient business and technical users become. 

Data Refinery. A self-service platform provides appropriate access points for each type of business and technical user. To do this, the data team creates a “data refinery” that transforms raw data into various types of assets. The landing zone contains raw data extracted from a host of sources; the integrated zone combines data into flat, subject-area building blocks; a dimensional zone converts that data into a dimensional model with conformed dimensions and trusted data, and a domain zone creates a department-specific view of data pulled from the other three zones. (See figure 4.)

Figure 4. Modern Data Architecture with a Data Refinery

Business users armed with dashboards, reports, analytic applications, and AI-based search tools access data through specially crafted business views that run against data in the refinery. Power users equipped with a bevy of self-service tools and appropriate permissions use a data catalog to find appropriate data in the refinery, a sandbox to explore that data, and data prep tools to extract and transform the data into custom data sets for analysis. A well-designed data platform makes self-service tools more powerful and users more productive.

Summary

Technology plays a critical role in making self-service a reality. The advent of AI-infused products has changed the equation for self-service analytics. As powerful as new products are, organizations need to first implement governance processes and appropriate organizational models before turning to technology. Then, they can reap huge benefits from the tailored deployment of self-service tools, technology, and data platforms.

Wayne Eckerson

Wayne Eckerson is an internationally recognized thought leader in the business intelligence and analytics field. He is a sought-after consultant and noted speaker who thinks critically, writes clearly and presents...

More About Wayne Eckerson