"unable to trace function trampoline_autograd_fwd into a single graph" error, when trying to combine flash attention with torch.compile()

223 Views Asked by Vaio At 28 June 2025 at 21:30

When I try to combine the two libraries (torch.compile and flash-attention) I get the following error:

torch._dynamo.variables.higher_order_ops: [WARNING] speculate_subgraph: while introspecting the user-defined autograd.Function, we were unable to trace function trampoline_autograd_fwd into a single graph. This means that Dynamo was unable to prove safety for this API and will fall back to eager-mode PyTorch, which could lead to a slowdown. [rank2]:[2023-10-09 15:38:00,809] [4/0] torch._dynamo.variables.higher_order_ops: [ERROR] call_method UserDefinedObjectVariable(fwd) call [TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(float), ConstantVariable(float), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(NoneType)] {} .

Any ideas??

Each of them individually run properly. It is the combination that doesnt stack.

Original Q&A

"unable to trace function trampoline_autograd_fwd into a single graph" error, when trying to combine flash attention with torch.compile()

There are 0 best solutions below

Related Questions in PYTORCH

Related Questions in SELF-ATTENTION

Related Questions in GENERATIVE-PRETRAINED-TRANSFORMER

Trending Questions

Popular # Hahtags

Popular Questions