when i try to run an azure ML pipeline that has inside it a connection to datalake storage using compute cluster (python sdk v2), i get an error that the compute cluster doesnt have access to the storage but when i run the pipeline using compute instance that is assigned to me i dont have any problem but im trying to do automation using synapse pipeline so i need to use a compute cluster not compute instance.
i tried to give the azure ml workspace owner access to data lake gen2 storage but that didnt fix the problem,i tried to use managed identity on the cluster and i got a principal id but when i try to add that given principal id to datalake storage in IAM access i cant enter a principal id in it i only can write a user name
Follow the steps below to grant access to your cluster using managed identity.
Microsoft Entra ID.Give
Storage Blob Data ContributororStorage Blob Data Readerbased on your requirement.