What is the difference between data analytics and business intelligence?

They overlap heavily. Business intelligence usually refers to the tools and practice of reporting on and exploring business data — dashboards, metrics, ad-hoc analysis — and leans descriptive and diagnostic. Data analytics is the broader discipline of working with data to answer questions, which also covers predictive and prescriptive work, statistics, and data science. In practice BI is one large slice of data analytics, not a separate thing.

What is the data analytics workflow?

A common version: define the question, collect the data from its sources, clean and transform it into a usable shape, model it (define the metrics and relationships), analyze it with the right technique, interpret the result, and act. It's a loop rather than a straight line — what you learn at the end reshapes the next question — and the cleaning and modeling steps usually take the most time.

What tools are used for data analytics?

A modern stack has layers. A data warehouse or lakehouse (Snowflake, BigQuery, Redshift, Databricks) stores the data and runs the compute. A transformation tool like dbt models and cleans it. A semantic layer defines metrics and access rules once. On top sit the consumers — BI tools, notebooks (Python, SQL), spreadsheets, embedded analytics in products, and increasingly AI agents. SQL is the common language underneath most of it.

What is the difference between descriptive and predictive analytics?

Descriptive analytics looks backward: it summarizes what already happened — last quarter's revenue, this week's signups, the trend over the year. Predictive analytics looks forward: it uses that historical data, plus statistical or machine-learning models, to estimate what is likely to happen next — projected demand, churn risk, expected revenue. Descriptive tells you where you've been; predictive estimates where you're headed.

How is AI changing data analytics?

Analytics is shifting from people driving dashboards to AI agents doing analytical work — answering questions, building calculations, and taking action in plain language. The shift is real but has a catch: an LLM pointed at raw tables doesn't know your join paths, metric definitions, or access rules, so it re-derives them on every prompt and the same question can return different numbers. Grounding the agent in a governed semantic layer is what makes its answers consistent and trustworthy.

Do I need a data warehouse to do data analytics?

Not for small, one-off analysis — a spreadsheet or a single database can be enough. But once data lives in several systems, grows large, or many people need consistent numbers, a warehouse or lakehouse becomes the practical center of gravity: it stores the data and runs the queries. The analytics tools sit on top of it rather than replacing it.

What skills do you need for data analytics?

The durable core is asking a sharp question, then cleaning and shaping data to answer it honestly. SQL is the common language; statistics keeps you from fooling yourself; a BI tool or notebook is where most analysis happens; and clear communication is what turns a result into a decision. As AI agents take over more of the query-writing, the premium shifts toward framing questions well and judging whether an answer is trustworthy.

What does Cube have to do with data analytics?

Cube is the agentic analytics platform built on a semantic layer. It sits on top of your warehouse and defines metrics, dimensions, joins, and access rules once, then serves them to BI tools, embedded apps, and AI agents over SQL, REST, GraphQL, MCP, and DAX/MDX — so every consumer returns the same governed numbers. Its open-source core, Cube Core (Apache 2.0), is the semantic layer itself. 400+ companies build on it across internal BI and embedded analytics.

What Is Data Analytics?

Last updated June 14, 2026

Key takeaways

Data analytics is the practice of examining, cleaning, modeling, and interpreting data to answer questions and support decisions — turning raw rows into insight people can act on.
It's usually split into four types that answer different questions: descriptive (what happened), diagnostic (why), predictive (what's likely next), and prescriptive (what to do about it).
The workflow is a loop, not a line: define the question, collect and clean the data, model and analyze it, interpret the result, and feed what you learn back into the next question.
The tool stack has layers — a warehouse for storage and compute (Snowflake, BigQuery, Redshift, Databricks), a semantic layer for governed metric definitions, and BI, notebooks, or embedded apps that consume them.
Analytics is shifting from people driving dashboards to AI agents doing analytical work; the catch is that an LLM pointed at raw tables re-derives metric logic on every prompt, so the same question returns different numbers.
A semantic layer is what makes AI-driven analytics trustworthy — agents select from certified metrics instead of inventing SQL. Cube is the agentic analytics platform built on a semantic layer; as Brex put it, the semantic layer is what makes the AI useful.

"Data analytics" covers a lot of ground — from a single SQL query to a forecasting model to an AI agent answering a question in plain language. This is a plain-language explainer for data leaders and practitioners: what data analytics actually is, the four types it's usually split into, the workflow and tools behind it, and how the shift to AI agents is changing the work.

TL;DR

Data analytics is the practice of examining, cleaning, modeling, and interpreting data to answer questions and support decisions — turning raw rows into insight people can act on. It's commonly split into four types by the question they answer: descriptive (what happened), diagnostic (why), predictive (what's likely next), and prescriptive (what to do). The workflow is a loop — define the question, collect and clean the data, model and analyze it, interpret, act — running on a layered stack: a warehouse for storage, a semantic layer for governed metric definitions, and BI tools, notebooks, embedded apps, or AI agents on top. The newest shift is agents doing the analytical work, and a semantic layer is what keeps their answers trustworthy. Cube is the agentic analytics platform built on a semantic layer.

A working definition

Data analytics is the process of examining, cleaning, transforming, modeling, and interpreting data to answer questions and support decisions. The point is conversion: raw rows — orders, events, clicks, sensor readings — on one end, and on the other a thing you can act on, a trend worth chasing, a cause worth fixing, a forecast worth planning against.

It's a broad discipline, which is why the term feels slippery. The same phrase covers an analyst running a one-off SQL query, a finance team living in a governed dashboard, a data scientist training a churn model, and a business user asking an AI agent "why did margin drop in EMEA last quarter?" in plain language. What unites them is the goal — decisions grounded in evidence rather than intuition — and a rough shared workflow underneath.

A quick note on a neighbor term. Business intelligence is often used as a synonym, but it's really one large slice of analytics: the tools and practice of reporting on and exploring business data, leaning toward what-happened and why. Data analytics is the wider field that also includes forecasting, optimization, statistics, and data science. BI is inside analytics, not beside it.

The four types of data analytics

The most useful way to organize the field is by the question each kind of analysis answers. The four build on each other — most teams start at the top and add the others as the questions get harder and the data gets richer.

Type	Question it answers	Example	What it takes
Descriptive	What happened?	Revenue last quarter; weekly active users; the trend over the year	Aggregations, summaries, dashboards
Diagnostic	Why did it happen?	Why churn rose in one region; which segment drove the dip	Drill-down, segmentation, correlation
Predictive	What's likely to happen next?	Projected demand; churn risk; expected revenue	Historical data, statistical and ML models
Prescriptive	What should we do about it?	Optimal pricing; the best inventory plan	Optimization, decision models, simulation

Descriptive analytics is the foundation and the most common — it summarizes what already happened, the totals and trends and counts that fill most dashboards. Diagnostic analytics goes a step deeper, drilling into causes and correlations to explain a number rather than just report it. Predictive analytics turns to look forward, using historical data and statistical or machine-learning models to estimate what's likely next. Prescriptive analytics goes furthest: not just forecasting an outcome but recommending an action, using optimization and decision models to suggest the best move.

You'll see other techniques named alongside these — exploratory data analysis (poking at a dataset to find patterns before formal modeling), inferential analytics (generalizing from a sample to a population), and real-time analytics (analyzing data as it's generated, for cases where a decision can't wait). They're methods that show up across the four types rather than a separate tier.

The data analytics workflow

Under the variety, most analytics follows a recognizable loop. It's worth seeing it as a loop, not a line: what you learn at the end reshapes the next question.

Define the question. The step that's easiest to skip and most expensive to get wrong. "Are we growing?" and "did paid acquisition in EMEA pay back within 90 days?" lead to completely different analyses.
Collect the data. Pull it from its sources — application databases, event streams, third-party APIs, files. In a mature stack this lands in a central data warehouse or lakehouse.
Clean and transform. Real data is messy: duplicates, nulls, inconsistent units, three columns that all sort of mean "customer." Getting it into a usable, consistent shape is usually the most time-consuming part of the whole process.
Model and analyze. Define the metrics and the relationships between entities, then apply the right technique — an aggregation, a segmentation, a regression, a forecast. This is where the question actually gets answered.
Interpret and act. Translate the result into something a human can decide on, communicate it clearly, and feed what you learned back into the next question.

The cleaning and modeling steps are where consistency lives or dies. If every analysis re-defines what "revenue" or "active user" means in its own query, you get three different numbers for the same metric in the same meeting — which is the problem a semantic layer exists to solve, by defining each metric once for everyone to reuse.

The data analytics tool stack

A modern analytics stack is layered, and it helps to keep the layers straight because they do genuinely different jobs.

Storage and compute. A data warehouse or lakehouse — Snowflake, BigQuery, Redshift, Databricks — holds the data and runs the queries. This is the center of gravity for any team past the spreadsheet stage.
Transformation. A tool like dbt models and cleans the raw data into analysis-ready tables, versioned as code.
Semantic layer. A governed layer that defines metrics, dimensions, join paths, and access rules once, on top of the warehouse, so every downstream tool returns the same numbers.
Consumption. The tools people actually use: BI tools and business intelligence tools for dashboards and exploration, notebooks (Python, SQL) for deeper analysis, spreadsheets for ad-hoc work, embedded analytics inside products for customers, and — increasingly — AI agents.

SQL is the common language running underneath most of these layers. And a recurring source of confusion is worth flagging: the semantic layer sits on top of the warehouse and dbt is a partner to it, not a competitor — dbt models the data, the semantic layer governs the metrics and serves them. Neither replaces your warehouse.

How AI is changing data analytics

The biggest shift in 2026 is in who — or what — does the analytical work. For decades the model was a person driving a tool: dragging fields onto a dashboard, writing the SQL behind it, building the forecast in a notebook. That's now moving toward agentic analytics — AI agents that answer questions, build calculations, and take action, with a person framing the question and judging the answer instead of hand-writing every query.

This is genuinely useful: someone in support or ops who can't write SQL can ask a hard question in plain language, and analysts who can write SQL stop hand-writing the same joins for the hundredth time. But it has a structural catch. Point an LLM at raw tables and it has to re-derive your business on every prompt. A table named orders doesn't encode whether revenue is gross or net, includes tax, or excludes refunds; the join graph has fan-outs and three tables that all look like "the customer"; and nothing in a SELECT distinguishes a correct query from one that leaks another tenant's data. So "what was revenue last quarter?" can return three different numbers across three sessions. That's not a prompt-engineering problem — it's a missing layer.

The fix is to ground the agent in a semantic layer. Instead of inventing SQL, the agent selects from certified metrics by name, so answers are consistent, governed, and explainable — you can see which named metrics produced a number rather than auditing a wall of generated SQL. This isn't theoretical: Brex evaluated approaches for grounding AI on their data, chose Cube, and built Brex Spaces, an embedded AI financial analyst, on top of it. Their one-line summary is the cleanest case for the whole shift: the semantic layer is what makes the AI useful.

A note on history, since it gets invoked here: pre-aggregated, multidimensional analysis goes back to the OLAP era of the 1990s, and protocols from it — DAX, MDX, XMLA — still connect tools like Excel to modern data. That lineage is real, but it's plumbing, not the story. The story now is AI-native analytics over a governed model.

Where Cube fits

Cube is the agentic analytics platform built on a semantic layer. Its open-source foundation, Cube Core (Apache 2.0), is the semantic layer: you model metrics, dimensions, joins, and access rules once, and serve them over SQL, REST, GraphQL, an MCP server for AI agents, and DAX/MDX for spreadsheet tools. It sits on top of your warehouse — Snowflake, BigQuery, Redshift, Databricks — which stays your storage and compute; row-level, multi-tenant security is applied at compile time; and pre-aggregation caching keeps queries fast. On top of that foundation, the platform adds the AI agent interfaces, workbooks, dashboards, and embedded surfaces — so the same governed model powers both internal business intelligence for your teams and embedded analytics for your customers. That's why 400+ companies build on Cube across both use cases.

For data analytics specifically, the value is consistency and trust across every kind of analysis. Descriptive dashboards, diagnostic drill-downs, and an AI agent answering in plain language all draw from the same certified definitions, so "revenue" means one thing everywhere. And dbt is a partner, not something Cube replaces: model the data in dbt, serve it through Cube, which reads dbt models.

Our verdict

Data analytics is the practice of turning raw data into decisions, usually organized into four types — descriptive, diagnostic, predictive, and prescriptive — and run as a loop of question, collect, clean, model, analyze, act on a layered stack of warehouse, semantic layer, and consumption tools. The defining shift now is AI agents doing the analytical work, which only stays trustworthy when the agent is grounded in a governed semantic layer rather than pointed at raw tables. The strongest fit is a platform that's SQL-first, governed at compile time, and serves internal BI and embedded analytics from one model — that's Cube, built on the open-source Cube Core.

Methodology

This explainer describes data analytics as the term is used in 2026, organized around the four-type framing (descriptive, diagnostic, predictive, prescriptive) and the analyze-and-act workflow that practitioners share across them, then weighted toward what's changing — AI agents doing analytical work and the governed semantic layer that keeps their answers consistent, access-controlled, and explainable. As the publisher, Cube builds a semantic layer and an agentic analytics platform on top of it, so we have an obvious interest here; we've tried to define the field neutrally and be explicit about where Cube fits versus the broader discipline. Tools and product capabilities move quickly — treat them as version-dependent and confirm against current documentation.

Frequently asked questions

What is data analytics?: Data analytics is the practice of examining, cleaning, transforming, modeling, and interpreting data to answer questions and support decisions. The point is to turn raw rows — orders, events, sensor readings — into insight a person or a system can act on: a trend to chase, a cause to fix, a forecast to plan against. It spans everything from a one-off SQL query to a governed BI dashboard to an AI agent answering a question in plain language.
What are the four types of data analytics?: Descriptive analytics asks what happened (summaries, trends, totals). Diagnostic analytics asks why it happened (drilling into causes and correlations). Predictive analytics asks what is likely to happen next (forecasts from historical data and statistical or machine-learning models). Prescriptive analytics asks what to do about it (recommended actions from optimization and decision models). They build on each other — most teams start with descriptive and add the others as the questions get harder.
What is the difference between data analytics and business intelligence?: They overlap heavily. Business intelligence usually refers to the tools and practice of reporting on and exploring business data — dashboards, metrics, ad-hoc analysis — and leans descriptive and diagnostic. Data analytics is the broader discipline of working with data to answer questions, which also covers predictive and prescriptive work, statistics, and data science. In practice BI is one large slice of data analytics, not a separate thing.
What is the data analytics workflow?: A common version: define the question, collect the data from its sources, clean and transform it into a usable shape, model it (define the metrics and relationships), analyze it with the right technique, interpret the result, and act. It's a loop rather than a straight line — what you learn at the end reshapes the next question — and the cleaning and modeling steps usually take the most time.
What tools are used for data analytics?: A modern stack has layers. A data warehouse or lakehouse (Snowflake, BigQuery, Redshift, Databricks) stores the data and runs the compute. A transformation tool like dbt models and cleans it. A semantic layer defines metrics and access rules once. On top sit the consumers — BI tools, notebooks (Python, SQL), spreadsheets, embedded analytics in products, and increasingly AI agents. SQL is the common language underneath most of it.
What is the difference between descriptive and predictive analytics?: Descriptive analytics looks backward: it summarizes what already happened — last quarter's revenue, this week's signups, the trend over the year. Predictive analytics looks forward: it uses that historical data, plus statistical or machine-learning models, to estimate what is likely to happen next — projected demand, churn risk, expected revenue. Descriptive tells you where you've been; predictive estimates where you're headed.
How is AI changing data analytics?: Analytics is shifting from people driving dashboards to AI agents doing analytical work — answering questions, building calculations, and taking action in plain language. The shift is real but has a catch: an LLM pointed at raw tables doesn't know your join paths, metric definitions, or access rules, so it re-derives them on every prompt and the same question can return different numbers. Grounding the agent in a governed semantic layer is what makes its answers consistent and trustworthy.
Do I need a data warehouse to do data analytics?: Not for small, one-off analysis — a spreadsheet or a single database can be enough. But once data lives in several systems, grows large, or many people need consistent numbers, a warehouse or lakehouse becomes the practical center of gravity: it stores the data and runs the queries. The analytics tools sit on top of it rather than replacing it.
What skills do you need for data analytics?: The durable core is asking a sharp question, then cleaning and shaping data to answer it honestly. SQL is the common language; statistics keeps you from fooling yourself; a BI tool or notebook is where most analysis happens; and clear communication is what turns a result into a decision. As AI agents take over more of the query-writing, the premium shifts toward framing questions well and judging whether an answer is trustworthy.
What does Cube have to do with data analytics?: Cube is the agentic analytics platform built on a semantic layer. It sits on top of your warehouse and defines metrics, dimensions, joins, and access rules once, then serves them to BI tools, embedded apps, and AI agents over SQL, REST, GraphQL, MCP, and DAX/MDX — so every consumer returns the same governed numbers. Its open-source core, Cube Core (Apache 2.0), is the semantic layer itself. 400+ companies build on it across internal BI and embedded analytics.

Upgrade your data stack today

Try Free Talk to sales