For context: I'd like to make a plot in plotly showing the evolution of an investment portfolio where the value of each asset is plotted on top of each other. Since assets are bought and sold, not every asset should be shown for the entire range of the curve. The below example can clarify this. Leading or trailing zeros indicate that the asset was not in the portfolio at that moment.
import pandas as pd
import plotly.express as px
import numpy as np
data = {"Asset 1": [0, 1, 2, 3, 4, 5], "Asset 2": [0, 0, 2, 3, 2, 2], "Asset 3": [1, 1, 3, 0, 0, 0]}
df = pd.DataFrame(data)
fig = px.area(df)
fig.show()
This results in the following figure:
The problem is now that at the indicated time (index=4), Asset 3 is not in the portfolio anymore, hence its value 0. However it is still shown, and the bigger problem is that it makes it impossible to see the value of Asset 2 which is in the portfolio.
I tried changing the zeros to NaN
values to indicate that they don't exist but that gives the exact same figure.
data2 = {"a": [np.nan, 1, 2, 3, 4, 5], "b": [np.nan, np.nan, 2, 3, 2, 2], "c": [1, 1, 3, np.nan, np.nan, np.nan]}
df2 = pd.DataFrame(data2)
fig2 = px.area(df2)
fig2.show()
I am afraid I cannot construct an elegant solution. However this will work for most requirements you stated. How it works:
plotly.express
offers limited custom control. Instead of usingplotly.express
, useplotly.graph_objects
. They have similar syntax.The resulting plot would look like:
The green line, representing values of
df['a']+df['b']+df['c']
still sits on the top. However, the hover label is now showing the value ofdf['a']+df['b']+df['c']
instead of either of the assets.In fact, I found these asset-allocation-y plot prettier without the edge lines:
and this can be done by setting
mode='none'
for each of the 3 plot objects.Remarks: