ArviZ migration guide#

We have been working on refactoring ArviZ to allow more flexibility and extensibility of its elements while keeping as much as possible a friendly user-interface that gives sensible results with little to no arguments.

One important change is enhanced modularity. Everything will still be available through a common namespace arviz, but ArviZ will now be composed of 3 smaller libraries:

arviz-base data related functionality, including converters from different PPLs.
arviz-stats for statistical functions and diagnostics.
arviz-plots for visual checks built on top of arviz-stats and arviz-base.

Each library has a minimal set of dependencies, with a lot of functionality built on top of optional dependencies. This keeps ArviZ smaller and easier to install as you can install only the components you really need. The main examples are:

arviz-base has no I/O library as a dependency, but you can use netcdf4, h5netcdf or zarr to read and write your data, allowing you to install only the one you need.
arviz-plots has no plotting library as a dependency, but it can generate plots with matplotlib, bokeh or plotly if they are installed.

import arviz as az

Check all 3 libraries have been exposed correctly:

print(az.info)

Status information for ArviZ 1.0.0rc0

arviz_base 0.9.0.dev0 available, exposing its functions as part of the `arviz` namespace
arviz_stats 0.9.0.dev available, exposing its functions as part of the `arviz` namespace
arviz_plots 0.9.0.dev0 available, exposing its functions as part of the `arviz` namespace

`arviz-base`#

Credible intervals and rcParams#

Some global configuration settings have changed. For example, the default credible interval probability (ci_prob) has been updated from 0.94 to 0.89. Using 0.89 produces intervals with lower variability, leading to more stable summaries. At the same time, keeping a non-standard value (rather than 0.90 or 0.95) serves as a friendly reminder that the choice of interval can depend on the problem at hand.

In addition, a new setting ci_kind has been introduced, which defaults to “eti” (equal-tailed interval). This controls the method used to compute credible intervals. The alternative is “hdi” (highest density interval), which was previously the default.

Defaults set via rcParams are not fixed rules, they’re meant to be adjusted to fit the needs of your analysis. rcParams offers a convenient way to establish global defaults for your workflow, while most functions that compute credible intervals also provide ci_prob and ci_kind arguments to override these settings locally.

You can check all default settings with:

az.rcParams

RcParams({'data.http_protocol': 'https',
          'data.index_origin': 0,
          'data.sample_dims': ('chain', 'draw'),
          'data.save_warmup': False,
          'plot.backend': 'matplotlib',
          'plot.density_kind': 'kde',
          'plot.max_subplots': 40,
          'stats.ci_kind': 'eti',
          'stats.ci_prob': 0.89,
          'stats.envelope_prob': 0.99,
          'stats.ic_compare_method': 'stacking',
          'stats.ic_pointwise': True,
          'stats.ic_scale': 'log',
          'stats.module': 'base',
          'stats.point_estimate': 'mean',
          'stats.round_to': '2g'})

`DataTree`#

One of the main differences is that the arviz.InferenceData object doesn’t exist anymore. arviz-base uses xarray.DataTree instead. This is a new data structure in xarray, so it might still have some rough edges, but it is much more flexible and powerful. To give some examples, I/O will now be more flexible, and any format supported by xarray is automatically available to you, no need to add wrappers on top of them within ArviZ. It is also possible to have arbitrary nesting of variables within groups and subgroups.

Important

Not all the functionality on xarray.DataTree will be compatible with ArviZ as it would be too much work for us to cover and maintain. If there are things you have always wanted to do but were not possible with InferenceData and are now possible with DataTree please try them out, give feedback on them and on desired behaviour for things that still don’t work. After a couple releases the “ArviZverse” will stabilize much more, and it might not be possible to add support for that anymore.

What about my existing netcdf/zarr files?#

They are still valid. There have been no changes on this end and we don’t plan to make any. The underlying functions handling I/O operations have changed, but the effect on your workflows should be minimal; the arguments continue to be mostly the same, and only some duplicated aliases have been removed:

Function in legacy ArviZ	New equivalent in xarray
arviz.from_netcdf	`arviz.from_netcdf()`[1]
arviz.from_zarr	`arviz.from_zarr()`[1]
arviz.to_netcdf	-
arviz.to_zarr	-
arviz.InferenceData.from_netcdf	-
arviz.InferenceData.from_zarr	-
arviz.InferenceData.to_netcdf	`xarray.DataTree.to_netcdf()`
arviz.InferenceData.to_zarr	`xarray.DataTree.to_zarr()`

from_zarr is a functools.partial wrapper of open_datatree with engine="zarr" already set
from_netcdf is exactly open_datatree so you can use the engine keyword to choose explicitly between netcdf4, h5netcdf or leave it to xarray’s default behaviour and netcdf_engine_order setting.

Here is an example where we read a file that was saved from an InferenceData object using idata.to_netcdf("example.nc").

Enhanced converter flexibility#

Were you constantly needing to add an extra axis to your data because it didn’t have any chain dimension? No more!

import numpy as np
rng = np.random.default_rng()
data = rng.normal(size=1000)

# arviz_legacy.from_dict({"posterior": {"mu": data}}) would fail
# unless you did data[None, :] to add the chain dimension
az.rcParams["data.sample_dims"] = "sample"

dt = az.from_dict({"posterior": {"mu": data}})
dt

<xarray.DataTree>
Group: /
└── Group: /posterior
        Dimensions:  (sample: 1000)
        Coordinates: (1)
        Data variables:
            mu       (sample) float64 8kB 0.4879 -0.4405 0.4089 ... 1.147 -0.3469 0.4506
        Attributes: (4)

xarray.DataTree

Groups: (1)

/posterior

Dimensions:
- sample: 1000
Coordinates: (1)
- sample
  (sample)
  int64
  0 1 2 3 4 5 ... 995 996 997 998 999
```
array([  0,   1,   2, ..., 997, 998, 999], shape=(1000,))
```

Data variables: (1)

mu

(sample)

float64

0.4879 -0.4405 ... -0.3469 0.4506

array([ 4.87927208e-01, -4.40453578e-01,  4.08943957e-01,  5.39275039e-01,3.62951326e-01,  1.19677234e+00,  7.88189072e-01,  1.06676064e-01,-7.36228253e-01, -7.31135774e-01, -7.71890069e-01,  9.11383226e-01,-1.52432228e+00,  6.95761493e-01,  1.32664411e+00, -3.69769985e-01,-8.81975681e-02, -7.44162102e-01, -2.03359859e+00,  3.16519863e-01,-3.67247767e-01,  4.48977910e-01, -1.28167007e+00,  5.66847596e-01,1.32617725e+00, -7.30068054e-01, -2.23105682e+00, -4.01281005e-02,4.87534118e-01,  1.10825706e+00, -1.73902378e-01,  5.66271727e-01,4.23503426e-01, -6.52794947e-01, -1.54704976e+00,  1.46808814e+00,-2.49351527e-01,  2.72952636e-01,  1.36728975e+00,  5.33739991e-01,-8.82663284e-01,  2.26089437e-01,  1.41335266e-02,  7.41070884e-01,3.78358741e-01,  5.06218757e-01, -3.03716925e-01,  1.43852746e+00,7.75648521e-02, -1.28719598e+00,  5.61962305e-01,  9.38142740e-01,4.49909867e-01,  1.69952369e-01, -3.18646036e-01, -1.54238186e+00,1.13031247e+00,  1.66132761e-01,  2.49978430e-01,  1.12068678e+00,6.97426759e-01, -5.49580702e-02,  1.23344005e+00,  7.87315422e-02,-3.14296962e-01, -7.64642379e-01,  3.49165578e-01, -1.25117691e+00,3.67686558e-01, -1.65546600e-01, -1.53759522e+00, -1.44988728e+00,1.06949821e+00,  1.11132173e+00, -1.21958570e+00, -8.16663444e-01,3.77405031e-01, -1.38881663e-02,  1.72788845e-02,  1.98166664e-01,...-5.74526053e-01,  8.30769983e-01, -1.08342845e-02,  1.00904408e+00,-3.15430585e-01, -4.28653268e-01, -3.47548977e-01, -2.84774373e-01,-7.30336214e-02, -1.11835728e-01, -6.47768401e-01, -9.47951287e-02,1.78540762e+00,  1.48108860e+00,  4.43868653e-01,  4.93975986e-01,-1.34238371e+00, -3.05760676e-02, -8.72690640e-01, -8.41344987e-01,-2.00572956e-01,  7.48184783e-02, -2.34924959e-01, -2.68697484e-01,1.24233561e+00,  8.73413502e-01,  9.68342512e-01, -9.26592815e-01,1.42315366e+00, -3.85957629e-01,  4.26815818e-01, -8.16280769e-01,9.94896561e-01, -1.42430940e-01,  1.09563274e+00,  1.15086582e+00,-6.02472374e-01, -9.90600233e-01, -1.35527158e+00,  1.66839079e+00,-3.16778290e+00, -4.75740083e-01,  4.06252297e-01, -8.40444582e-02,-1.04952254e+00, -9.30461555e-01, -1.67642508e+00,  2.76770067e-01,7.72906881e-01, -8.08701765e-01, -5.95059280e-01,  1.18902618e+00,-1.30398812e+00,  5.12601571e-01,  8.67095333e-01, -7.80923775e-01,2.23715820e+00,  2.33861639e-01, -2.70369876e-01,  2.61798265e-01,2.01252843e-01,  1.50124471e+00,  1.07540922e+00, -9.87473599e-01,-2.99717259e-01, -1.66750559e+00,  5.81663283e-01,  2.28605905e-01,-1.43756680e-01,  1.24263983e+00, -1.58030752e+00, -3.79686376e-01,-8.20560651e-01, -9.49150570e-01, -1.24796548e+00,  6.42440665e-01,1.36799692e+00,  1.14672816e+00, -3.46944466e-01,  4.50586790e-01])

Attributes: (4)
created_at :
2026-02-03T23:35:41.769579+00:00
creation_library :
ArviZ
creation_library_version :
0.9.0.dev0
creation_library_language :
Python

# arviz-stats and arviz-plots also take it into account
az.plot_dist(dt);

../_images/e935faa8b5d919991eeabdd6ae9297fec4baef03e67b135ed1e197994212941b.png

Note

It is also possible to modify sample_dims through arguments to the different functions.

New data wrangling features#

We have also added multiple functions to help with common data wrangling tasks, mostly from and to xarray.Dataset. For example, you can convert a dataset to a wide format dataframe with unique combinations of sample_dims as its rows, with dataset_to_dataframe():

# back to default behaviour
az.rcParams["data.sample_dims"] = ["chain", "draw"]
dt = az.load_arviz_data("centered_eight")
az.dataset_to_dataframe(dt.posterior.dataset)

	mu	theta[Choate]	theta[Deerfield]	theta[Phillips Andover]	theta[Phillips Exeter]	theta[Hotchkiss]	theta[Lawrenceville]	theta[St. Paul's]	theta[Mt. Hermon]	tau
(0, 0)	1.715723	2.317391	1.450174	2.085550	2.227076	3.071507	2.712972	3.083764	1.460448	0.877494
(0, 1)	1.903481	0.889170	0.742949	3.125869	2.779524	2.834705	1.558939	2.487503	1.984379	0.802714
(0, 2)	1.903481	0.889170	0.742949	3.125869	2.779524	2.834705	1.558939	2.487503	1.984379	0.802714
(0, 3)	1.903481	0.889170	0.742949	3.125869	2.779524	2.834705	1.558939	2.487503	1.984379	0.802714
(0, 4)	2.017497	1.109120	0.818893	2.750620	1.928670	1.983162	1.029620	3.662744	2.167574	0.767934
...	...	...	...	...	...	...	...	...	...	...
(3, 495)	7.750625	11.477589	5.578327	9.321531	5.812095	5.437099	3.096142	9.731409	7.948321	3.020477
(3, 496)	6.922368	2.710763	8.646136	3.807844	7.543669	6.788881	6.595036	4.003042	5.275016	2.704639
(3, 497)	5.408836	11.406390	4.446937	9.210775	6.331074	4.150778	4.812302	9.693257	4.914656	2.236486
(3, 498)	7.721440	7.086139	12.311889	6.584301	10.286093	10.050167	11.859938	7.952268	9.754468	2.989656
(3, 499)	10.237157	10.464390	13.714306	10.261666	15.180098	10.916030	15.070900	14.923210	14.023129	3.051559

2000 rows × 10 columns

Note it is also aware of ArviZ naming conventions in addition to using the sample_dims rcParam. It can be further customized through a labeller argument.

Tip

If you want to convert to a long format dataframe, you should use xarray.Dataset.to_dataframe() instead.

`arviz-plots`#

Out of the 3 libraries, arviz-plots is the one with the most changes at all levels, breaking changes, new features more layers to explore.

More and better supported backends!#

One of they key efforts of the refactor has been simplifying the way we interface with the different plotting backends supported. arviz-plots has more backends: matplotlib, bokeh and plotly are all supported now, with (mostly) feature parity among them. All while having less backend related code!

This also means that az.style is no longer an alias to matplotlib.style but its own module with similar (reduced API) that sets the style for all compatible and installed backends (unless a backend is requested explicitly):

az.style.use("arviz-vibrant")
dt = az.load_arviz_data("centered_eight")
az.plot_rank(dt, var_names=["mu", "tau"], backend="matplotlib");

../_images/85b3b016ec3b7a5274b4101d1af7e811b51bbb718ff50143d32ea9ee4d4ca519.png

import plotly.io as pio
pio.renderers.default = "notebook"
pc = az.plot_rank(dt, var_names=["mu", "tau"], backend="plotly")
pc.show()

At the time of writing, there are three cross-backend themes defined by ArviZ: arviz-variat, arviz-vibrant and arviz-cetrino.

Plotting function inventory#

The following functions have been renamed or restructured:

ArviZ <1	ArviZ >=1
plot_bpv	plot_ppc_pit, plot_ppc_tstat
plot_dist_comparison	plot_prior_posterior
plot_ecdf	plot_dist, plot_ecdf_pit
plot_ess	plot_ess, plot_ess_evolution
plot_forest	plot_forest, plot_ridge
plot_ppc	plot_ppc_dist
plot_posterior, plot_density	plot_dist
plot_trace	plot_trace_dist, plot_trace_rank

Others have had their code rewritten and their arguments updated to some extent, but kept the same name:

plot_autocorr
plot_bf
plot_compare
plot_energy
plot_khat
plot_lm
plot_loo_pit
plot_mcse
plot_pair
plot_parallel
plot_rank

The following functions have been added:

Some functions have been removed and we don’t plan to add them:

plot_dist (notice we have plot_dist but it is a different function)
plot_kde (this is now part of plot_dist)
plot_violin

And there are also functions we plan to add but aren’t available yet.

plot_elpd
plot_ppc_residuals
plot_ts

Note

For now, the documentation for arviz-plots defaults to latest which is built from GitHub with each commit. If you see some of the functions in the last block already on the example gallery you should be able to try them, but only if you install the development version! See Installation

You can see all of them at the arviz-plots gallery.

What to expect from the new plotting functions#

There are two main differences with the plotting functions here in legacy ArviZ:

The way of forwarding arguments to the plotting backends.
The return type is now PlotCollection, one of the key features of arviz-plots. A quick overview in the context of plot_xyz is given here but it later has a section of its own.

Other than that, some arguments have been renamed or gotten different defaults, but nothing major. Note, however, that we have incorporated elements from grammar of graphics into arviz-plots, now that we’ll cover the internals of plot_xyz in passing we’ll use some terms from grammar of graphics. If you have never heard about grammar of graphics we recommend you take a look at Overview before continuing.

kwarg forwarding#

Most plot_xyz functions now have a visuals and a stats argument. These arguments are dictionaries whose keys define where their values are forwarded too. The values are also dictionaries representing keyword arguments that will be passed downstream via **kwargs. This allows you to send arbitrary keyword arguments to all the different visual elements or statistical computations that are part of a plot without bloating the call signature with endless xyz_kwargs arguments like in legacy ArviZ.

These same arguments also allow indicating a visual element should not be added to the plot, or providing pre computed statistical summaries for faster re-rendering of plots (at the time of writing pre-computed inputs are only working in plot_forest but should be extended soon).

In addition, the call signature of new plotting functions is plot_xyz(..., **pc_kwargs), with these pc_kwargs being forwarded to the initialization of PlotCollection. This argument allows controlling the layout of the figure as well as any aesthetic mappings that might be used by the plotting function.

For a complete notebook introduction on this see Introduction to batteries-included plots

New return type: `PlotCollection`#

All plot_xyz functions now return a “plotting manager class”. At the time of writing this means either PlotCollection (vast majority of plots) or PlotMatrix (for upcoming plot_pair for example).

These classes are the ones that handle faceting and aesthetic mappings and allow the plot_xyz functions to focus on the visuals and not on the plot layout or encodings.

See Using PlotCollection objects for more details on how to work with existing PlotCollection instances.

Plotting manager classes#

As we have just mentioned, plot_xyz use these plotting manager classes and then return them as their output. In addition, we hope users will use these classes directly to help them write custom plotting functions more easily and with more flexibility.

By using these classes, users should be able to focus on writing smaller functions that take care of a “unit of plotting”. You can then use their .map methods to apply these plotting functions as many times as needed given the faceting and aesthetic mappings defined by the user. Different layouts and different mappings will generally not require changes to these plotting functions, only to the arguments that define aesthetic mappings and the faceting strategy.

Take a look at Create your own figure with PlotCollection if that sounds interesting!

Other arviz-plots features#

There are also helper functions to help compose or extend existing plotting functions. For example, we can create a new plot, with a similar layout to that of plot_trace_dist or plot_rank_dist but custom diagnostics in each column: distribution, rank and ess evolution:

az.combine_plots(
    dt,
    [
        (az.plot_dist, {"kind": "ecdf"}),
        (az.plot_rank, {}),
        (az.plot_ess_evolution, {}),
    ],
    var_names=["theta", "mu", "tau"],
    coords={"school": ["Hotchkiss", "St. Paul's"]},
);

../_images/f4377bc948b07baf4fd6383ae078a7d8d7f107a23d8fd4448fe3ef845b1a8b30.png

Other nice features#

Citation helper#

We have also added a helper to cite ArviZ in your publications and also the methods implemented in it. You can get the citation in BibTeX format through citation():

Extended documentation#

One recurring feedback we have received is that the documentation was OK for people very familiar with Bayesian statistics and probabilistic programming, but not so much for newcomers. Thus, we have added more introductory material and examples to the documentation, including a separated resource that show how to use ArviZ “in-context”, see EABM. And we attempted to make the documentation easier to navigate and understand for everyone.

ArviZ migration guide#

`arviz-base`#

Credible intervals and rcParams#

`DataTree`#

What about my existing netcdf/zarr files?#

Other key differences#

`InferenceData.extend`#

`InferenceData.map`#

`InferenceData.groups`#

Enhanced converter flexibility#

New data wrangling features#

`arviz-stats`#

Model comparison#

`dim` and `sample_dims`#

Accessors on xarray objects#

Computational backends#

Array interface#

`arviz-plots`#

More and better supported backends!#

Plotting function inventory#

What to expect from the new plotting functions#

kwarg forwarding#

New return type: `PlotCollection`#

Plotting manager classes#

Other arviz-plots features#

Other nice features#

Citation helper#

Extended documentation#

ArviZ migration guide#

arviz-base#

Credible intervals and rcParams#

DataTree#

What about my existing netcdf/zarr files?#

Other key differences#

InferenceData.extend#

InferenceData.map#

InferenceData.groups#

Enhanced converter flexibility#

New data wrangling features#

arviz-stats#

Model comparison#

dim and sample_dims#

Accessors on xarray objects#

Computational backends#

Array interface#

arviz-plots#

More and better supported backends!#

Plotting function inventory#

What to expect from the new plotting functions#

kwarg forwarding#

New return type: PlotCollection#

Plotting manager classes#

Other arviz-plots features#

Other nice features#

Citation helper#

Extended documentation#

`arviz-base`#

`DataTree`#

`InferenceData.extend`#

`InferenceData.map`#

`InferenceData.groups`#

`arviz-stats`#

`dim` and `sample_dims`#

`arviz-plots`#

New return type: `PlotCollection`#