Hi support team! I've been having some issues with Estuary pipelines writing to Motherduck hanging, and the regular fix of bumping up the duckling size is no longer working. I was told on the Estuary slack channel to get support here as this was a Motherduck issue. I've allowed inbound access to our S3 bucket with the Estuary data to the Motherduck team. Could someone DM me for more details?
Hi Ajit, thanks for reaching out. We've been working on this issue in collaboration with Estuary...they actually just sent over a backend core dump from one of your materializations to help us identify the root cause and work on a fix. Can you try disabling and then enabling the Estuary pipeline? We have seen that this often clears the hanging client-side commit from Estuary and allows the pipeline to run.
Ajjit N. we have identified the root cause for this issue, and will have a fix shipped in this week's release. Here's what was happening: • We use short-lived tokens for authorization between the client and the server, and they get refreshed automatically when they expire. We identified an extremely rare race condition during this refresh path where, under a specific timing of background activity, the refresh could end up waiting on a lock it already held, producing a deadlock that prevented the client from obtaining a new token and reaching the server. The fix removes that potential loop and will ship with this week's release. Let us know if you have any questions!
OK awesome and thanks for the updates! Only thing that might be helpful in the future is some guidance on duckling sizes recommended for either streaming data into Motherduck, or batch processing into MD from Estuary/Fivetran/related tools. Right now our production Azure SQL DBs are around 250 GB, but we batch process CDC data once every 2 hours with Estuary and insert into our Motherduck DBs. These are typically <75 MB of data each every 2 hours. I am not sure whether the service accounts we connect to Estuary should be Normal, or Pulse (or larger with bigger data updates down the line).
Also realize this may not be an issue after the weekly release, roughly when can we expect that? Happy to wait if its just a day or two.
Awesome! I will try restarting the one materialization that is still giving me issues.
I still have access enabled to the S3 bucket for the Motherduck team so would appreciate some investigation, also happy to bring this up with the estuary team
.png)